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The  Mars  Climate  Orbiter  is  pre¬ 
pared  for  its  mission.  The  laws 
of  physics  are  the  same  every¬ 
where,  even  on  Mars,  so  the 
probe  could  be  designed  based 
on  the  laws  of  physics  as  discov¬ 
ered  on  earth.  There  is  unfor¬ 
tunately  another  reason  why  this 
spacecraft  is  relevant  to  the  top¬ 
ics  of  this  chapter:  it  was  de¬ 
stroyed  attempting  to  enter  Mars’ 
atmosphere  because  engineers 
at  Lockheed  Martin  forgot  to  con¬ 
vert  data  on  engine  thrusts  from 
pounds  into  the  metric  unit  of 
force  (newtons)  before  giving  the 
information  to  NASA.  Conver¬ 
sions  are  important! 


Chapter  0 

Introduction  and  Review 


0.1  Introduction  and  Review 

If  you  drop  your  shoe  and  a  coin  side  by  side,  they  hit  the  ground  at 
the  same  time.  Why  doesn’t  the  shoe  get  there  first,  since  gravity  is 
pulling  harder  on  it?  How  does  the  lens  of  your  eye  work,  and  why 
do  your  eye’s  muscles  need  to  squash  its  lens  into  different  shapes  in 
order  to  focus  on  objects  nearby  or  far  away?  These  are  the  kinds 
of  questions  that  physics  tries  to  answer  about  the  behavior  of  light 
and  matter,  the  two  things  that  the  universe  is  made  of. 

0.1.1  The  scientific  method 

Until  very  recently  in  history,  no  progress  was  made  in  answering 
questions  like  these.  Worse  than  that,  the  wrong  answers  written 
by  thinkers  like  the  ancient  Greek  physicist  Aristotle  were  accepted 
without  question  for  thousands  of  years.  Why  is  it  that  scientific 
knowledge  has  progressed  more  since  the  Renaissance  than  it  had 
in  all  the  preceding  millennia  since  the  beginning  of  recorded  his¬ 
tory?  Undoubtedly  the  industrial  revolution  is  part  of  the  answer. 
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a  /  Science  is  a  cycle  of  the¬ 
ory  and  experiment. 


b  /  A  satirical  drawing  of  an 
alchemist’s  laboratory.  H.  Cock, 
after  a  drawing  by  Peter  Brueghel 
the  Elder  (16th  century). 


Building  its  centerpiece,  the  steam  engine,  required  improved  tech¬ 
niques  for  precise  construction  and  measurement.  (Early  on,  it  was 
considered  a  major  advance  when  English  machine  shops  learned  to 
build  pistons  and  cylinders  that  fit  together  with  a  gap  narrower 
than  the  thickness  of  a  penny.)  But  even  before  the  industrial  rev¬ 
olution,  the  pace  of  discovery  had  picked  up,  mainly  because  of  the 
introduction  of  the  modern  scientific  method.  Although  it  evolved 
over  time,  most  scientists  today  would  agree  on  something  like  the 
following  list  of  the  basic  principles  of  the  scientific  method: 

(1)  Science  is  a  cycle  of  theory  and  experiment.  Scientific  the¬ 
ories  are  created  to  explain  the  results  of  experiments  that  were 
created  under  certain  conditions.  A  successful  theory  will  also  make 
new  predictions  about  new  experiments  under  new  conditions.  Even¬ 
tually,  though,  it  always  seems  to  happen  that  a  new  experiment 
comes  along,  showing  that  under  certain  conditions  the  theory  is 
not  a  good  approximation  or  is  not  valid  at  all.  The  ball  is  then 
back  in  the  theorists’  court.  If  an  experiment  disagrees  with  the 
current  theory,  the  theory  has  to  be  changed,  not  the  experiment. 

(2)  Theories  should  both  predict  and  explain.  The  requirement  of 
predictive  power  means  that  a  theory  is  only  meaningful  if  it  predicts 
something  that  can  be  checked  against  experimental  measurements 
that  the  theorist  did  not  already  have  at  hand.  That  is,  a  theory 
should  be  testable.  Explanatory  value  means  that  many  phenomena 
should  be  accounted  for  with  few  basic  principles.  If  you  answer 
every  “why”  question  with  “because  that’s  the  way  it  is,”  then  your 
theory  has  no  explanatory  value.  Collecting  lots  of  data  without 
being  able  to  find  any  basic  underlying  principles  is  not  science. 

(3)  Experiments  should  be  reproducible.  An  experiment  should 
be  treated  with  suspicion  if  it  only  works  for  one  person,  or  only 
in  one  part  of  the  world.  Anyone  with  the  necessary  skills  and 
equipment  should  be  able  to  get  the  same  results  from  the  same 
experiment.  This  implies  that  science  transcends  national  and  eth¬ 
nic  boundaries;  you  can  be  sure  that  nobody  is  doing  actual  science 
who  claims  that  their  work  is  “Aryan,  not  Jewish,”  “Marxist,  not 
bourgeois,”  or  “Christian,  not  atheistic.”  An  experiment  cannot  be 
reproduced  if  it  is  secret,  so  science  is  necessarily  a  public  enterprise. 

As  an  example  of  the  cycle  of  theory  and  experiment,  a  vital  step 
toward  modern  chemistry  was  the  experimental  observation  that  the 
chemical  elements  could  not  be  transformed  into  each  other,  e.g., 
lead  could  not  be  turned  into  gold.  This  led  to  the  theory  that 
chemical  reactions  consisted  of  rearrangements  of  the  elements  in 
different  combinations,  without  any  change  in  the  identities  of  the 
elements  themselves.  The  theory  worked  for  hundreds  of  years,  and 
was  confirmed  experimentally  over  a  wide  range  of  pressures  and 
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temperatures  and  with  many  combinations  of  elements.  Only  in 
the  twentieth  century  did  we  learn  that  one  element  could  be  trans¬ 
formed  into  one  another  under  the  conditions  of  extremely  high 
pressure  and  temperature  existing  in  a  nuclear  bomb  or  inside  a  star. 
That  observation  didn’t  completely  invalidate  the  original  theory  of 
the  immutability  of  the  elements,  but  it  showed  that  it  was  only  an 
approximation,  valid  at  ordinary  temperatures  and  pressures. 

self-check  A 

A  psychic  conducts  seances  in  which  the  spirits  of  the  dead  speak  to 
the  participants.  He  says  he  has  special  psychic  powers  not  possessed 
by  other  people,  which  allow  him  to  “channel”  the  communications  with 
the  spirits.  What  part  of  the  scientific  method  is  being  violated  here? 
>  Answer,  p.  961 

The  scientific  method  as  described  here  is  an  idealization,  and 
should  not  be  understood  as  a  set  procedure  for  doing  science.  Sci¬ 
entists  have  as  many  weaknesses  and  character  flaws  as  any  other 
group,  and  it  is  very  common  for  scientists  to  try  to  discredit  other 
people’s  experiments  when  the  results  run  contrary  to  their  own  fa¬ 
vored  point  of  view.  Successful  science  also  has  more  to  do  with 
luck,  intuition,  and  creativity  than  most  people  realize,  and  the 
restrictions  of  the  scientific  method  do  not  stifle  individuality  and 
self-expression  any  more  than  the  fugue  and  sonata  forms  stifled 
Bach  and  Haydn.  There  is  a  recent  tendency  among  social  scien¬ 
tists  to  go  even  further  and  to  deny  that  the  scientific  method  even 
exists,  claiming  that  science  is  no  more  than  an  arbitrary  social  sys¬ 
tem  that  determines  what  ideas  to  accept  based  on  an  in-group’s 
criteria.  I  think  that’s  going  too  far.  If  science  is  an  arbitrary  social 
ritual,  it  would  seem  difficult  to  explain  its  effectiveness  in  building 
such  useful  items  as  airplanes,  CD  players,  and  sewers.  If  alchemy 
and  astrology  were  no  less  scientific  in  their  methods  than  chem¬ 
istry  and  astronomy,  what  was  it  that  kept  them  from  producing 
anything  useful? 

Discussion  Questions 

Consider  whether  or  not  the  scientific  method  is  being  applied  in  the  fol¬ 
lowing  examples.  If  the  scientific  method  is  not  being  applied,  are  the 
people  whose  actions  are  being  described  performing  a  useful  human 
activity,  albeit  an  unscientific  one? 

A  Acupuncture  is  a  traditional  medical  technique  of  Asian  origin  in 
which  small  needles  are  inserted  in  the  patient’s  body  to  relieve  pain. 
Many  doctors  trained  in  the  west  consider  acupuncture  unworthy  of  ex¬ 
perimental  study  because  if  it  had  therapeutic  effects,  such  effects  could 
not  be  explained  by  their  theories  of  the  nervous  system.  Who  is  being 
more  scientific,  the  western  or  eastern  practitioners? 
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B  Goethe,  a  German  poet,  is  less  well  known  for  his  theory  of  color. 
He  published  a  book  on  the  subject,  in  which  he  argued  that  scientific 
apparatus  for  measuring  and  quantifying  color,  such  as  prisms,  lenses 
and  colored  filters,  could  not  give  us  full  insight  into  the  ultimate  meaning 
of  color,  for  instance  the  cold  feeling  evoked  by  blue  and  green  or  the 
heroic  sentiments  inspired  by  red.  Was  his  work  scientific? 

C  A  child  asks  why  things  fall  down,  and  an  adult  answers  “because  of 
gravity.”  The  ancient  Greek  philosopher  Aristotle  explained  that  rocks  fell 
because  it  was  their  nature  to  seek  out  their  natural  place,  in  contact  with 
the  earth.  Are  these  explanations  scientific? 

D  Buddhism  is  partly  a  psychological  explanation  of  human  suffering, 
and  psychology  is  of  course  a  science.  The  Buddha  could  be  said  to 
have  engaged  in  a  cycle  of  theory  and  experiment,  since  he  worked  by 
trial  and  error,  and  even  late  in  his  life  he  asked  his  followers  to  challenge 
his  ideas.  Buddhism  could  also  be  considered  reproducible,  since  the 
Buddha  told  his  followers  they  could  find  enlightenment  for  themselves 
if  they  followed  a  certain  course  of  study  and  discipline.  Is  Buddhism  a 
scientific  pursuit? 

0.1.2  What  is  physics? 

Given  for  one  instant  an  intelligence  which  could  comprehend  all 
the  forces  by  which  nature  is  animated  and  the  respective  positions 
of  the  things  which  compose  it. ..nothing  would  be  uncertain,  and 
the  future  as  the  past  would  be  laid  out  before  its  eyes. 

Pierre  Simon  de  Laplace 

Physics  is  the  use  of  the  scientific  method  to  find  out  the  basic 
principles  governing  light  and  matter,  and  to  discover  the  implica¬ 
tions  of  those  laws.  Part  of  what  distinguishes  the  modern  outlook 
from  the  ancient  mind-set  is  the  assumption  that  there  are  rules  by 
which  the  universe  functions,  and  that  those  laws  can  be  at  least  par¬ 
tially  understood  by  humans.  From  the  Age  of  Reason  through  the 
nineteenth  century,  many  scientists  began  to  be  convinced  that  the 
laws  of  nature  not  only  could  be  known  but,  as  claimed  by  Laplace, 
those  laws  could  in  principle  be  used  to  predict  everything  about 
the  universe’s  future  if  complete  information  was  available  about 
the  present  state  of  all  light  and  matter.  In  subsequent  sections, 
I’ll  describe  two  general  types  of  limitations  on  prediction  using  the 
laws  of  physics,  which  were  only  recognized  in  the  twentieth  century. 

Matter  can  be  defined  as  anything  that  is  affected  by  gravity, 
i.e.,  that  has  weight  or  would  have  weight  if  it  was  near  the  Earth 
or  another  star  or  planet  massive  enough  to  produce  measurable 
gravity.  Light  can  be  defined  as  anything  that  can  travel  from  one 
place  to  another  through  empty  space  and  can  influence  matter,  but 
has  no  weight.  For  example,  sunlight  can  influence  your  body  by 
heating  it  or  by  damaging  your  DNA  and  giving  you  skin  cancer. 
The  physicist’s  definition  of  light  includes  a  variety  of  phenomena 
that  are  not  visible  to  the  eye,  including  radio  waves,  microwaves, 
x-rays,  and  gamma  rays.  These  are  the  “colors”  of  light  that  do  not 
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happen  to  fall  within  the  narrow  violet-to-red  range  of  the  rainbow 
that  we  can  see. 

self-check  B 

At  the  turn  of  the  20th  century,  a  strange  new  phenomenon  was  discov¬ 
ered  in  vacuum  tubes:  mysterious  rays  of  unknown  origin  and  nature. 
These  rays  are  the  same  as  the  ones  that  shoot  from  the  back  of  your 
TV’s  picture  tube  and  hit  the  front  to  make  the  picture.  Physicists  in 
1895  didn’t  have  the  faintest  idea  what  the  rays  were,  so  they  simply 
named  them  “cathode  rays,”  after  the  name  for  the  electrical  contact 
from  which  they  sprang.  A  fierce  debate  raged,  complete  with  national¬ 
istic  overtones,  over  whether  the  rays  were  a  form  of  light  or  of  matter. 
What  would  they  have  had  to  do  in  order  to  settle  the  issue?  > 

Answer,  p.  961 

Many  physical  phenomena  are  not  themselves  light  or  matter, 
but  are  properties  of  light  or  matter  or  interactions  between  light 
and  matter.  For  instance,  motion  is  a  property  of  all  light  and  some 
matter,  but  it  is  not  itself  light  or  matter.  The  pressure  that  keeps 
a  bicycle  tire  blown  up  is  an  interaction  between  the  air  and  the 
tire.  Pressure  is  not  a  form  of  matter  in  and  of  itself.  It  is  as 
much  a  property  of  the  tire  as  of  the  air.  Analogously,  sisterhood 
and  employment  are  relationships  among  people  but  are  not  people 
themselves. 

Some  things  that  appear  weightless  actually  do  have  weight,  and 
so  qualify  as  matter.  Air  has  weight,  and  is  thus  a  form  of  matter 
even  though  a  cubic  inch  of  air  weighs  less  than  a  grain  of  sand.  A 
helium  balloon  has  weight,  but  is  kept  from  falling  by  the  force  of  the 
surrounding  more  dense  air,  which  pushes  up  on  it.  Astronauts  in 
orbit  around  the  Earth  have  weight,  and  are  falling  along  a  curved 
arc,  but  they  are  moving  so  fast  that  the  curved  arc  of  their  fall 
is  broad  enough  to  carry  them  all  the  way  around  the  Earth  in  a 
circle.  They  perceive  themselves  as  being  weightless  because  their 
space  capsule  is  falling  along  with  them,  and  the  floor  therefore  does 
not  push  up  on  their  feet. 

Optional  Topic:  Modern  Changes  in  the  Definition  of  Light  and 
Matter 

Einstein  predicted  as  a  consequence  of  his  theory  of  relativity  that  light 
would  after  all  be  affected  by  gravity,  although  the  effect  would  be  ex¬ 
tremely  weak  under  normal  conditions.  His  prediction  was  borne  out 
by  observations  of  the  bending  of  light  rays  from  stars  as  they  passed 
close  to  the  sun  on  their  way  to  the  Earth.  Einstein’s  theory  also  implied 
the  existence  of  black  holes,  stars  so  massive  and  compact  that  their 
intense  gravity  would  not  even  allow  light  to  escape.  (These  days  there 
is  strong  evidence  that  black  holes  exist.) 

Einstein’s  interpretation  was  that  light  doesn’t  really  have  mass,  but 
that  energy  is  affected  by  gravity  just  like  mass  is.  The  energy  in  a  light 
beam  is  equivalent  to  a  certain  amount  of  mass,  given  by  the  famous 
equation  E  =  me2,  where  c  is  the  speed  of  light.  Because  the  speed 


c  /  This  telescope  picture  shows 
two  images  of  the  same  distant 
object,  an  exotic,  very  luminous 
object  called  a  quasar.  This  is 
interpreted  as  evidence  that  a 
massive,  dark  object,  possibly 
a  black  hole,  happens  to  be 
between  us  and  it.  Light  rays  that 
would  otherwise  have  missed  the 
earth  on  either  side  have  been 
bent  by  the  dark  object’s  gravity 
so  that  they  reach  us.  The  actual 
direction  to  the  quasar  is  presum¬ 
ably  in  the  center  of  the  image, 
but  the  light  along  that  central  line 
doesn’t  get  to  us  because  it  is 
absorbed  by  the  dark  object.  The 
quasar  is  known  by  its  catalog 
number,  MG1 131 +0456,  or  more 
informally  as  Einstein’s  Ring. 
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of  light  is  such  a  big  number,  a  large  amount  of  energy  is  equivalent  to 
only  a  very  small  amount  of  mass,  so  the  gravitational  force  on  a  light 
ray  can  be  ignored  for  most  practical  purposes. 

There  is  however  a  more  satisfactory  and  fundamental  distinction 
between  light  and  matter,  which  should  be  understandable  to  you  if  you 
have  had  a  chemistry  course.  In  chemistry,  one  learns  that  electrons 
obey  the  Pauli  exclusion  principle,  which  forbids  more  than  one  electron 
from  occupying  the  same  orbital  if  they  have  the  same  spin.  The  Pauli 
exclusion  principle  is  obeyed  by  the  subatomic  particles  of  which  matter 
is  composed,  but  disobeyed  by  the  particles,  called  photons,  of  which  a 
beam  of  light  is  made. 

Einstein’s  theory  of  relativity  is  discussed  more  fully  in  book  6  of  this 
series. 

The  boundary  between  physics  and  the  other  sciences  is  not 
always  clear.  For  instance,  chemists  study  atoms  and  molecules, 
which  are  what  matter  is  built  from,  and  there  are  some  scientists 
who  would  be  equally  willing  to  call  themselves  physical  chemists 
or  chemical  physicists.  It  might  seem  that  the  distinction  between 
physics  and  biology  would  be  clearer,  since  physics  seems  to  deal 
with  inanimate  objects.  In  fact,  almost  all  physicists  would  agree 
that  the  basic  laws  of  physics  that  apply  to  molecules  in  a  test  tube 
work  equally  well  for  the  combination  of  molecules  that  constitutes 
a  bacterium.  (Some  might  believe  that  something  more  happens  in 
the  minds  of  humans,  or  even  those  of  cats  and  dogs.)  What  differ¬ 
entiates  physics  from  biology  is  that  many  of  the  scientific  theories 
that  describe  living  things,  while  ultimately  resulting  from  the  fun¬ 
damental  laws  of  physics,  cannot  be  rigorously  derived  from  physical 
principles. 

Isolated  systems  and  reductionism 

To  avoid  having  to  study  everything  at  once,  scientists  isolate  the 
things  they  are  trying  to  study.  For  instance,  a  physicist  who  wants 
to  study  the  motion  of  a  rotating  gyroscope  would  probably  prefer 
that  it  be  isolated  from  vibrations  and  air  currents.  Even  in  biology, 
where  field  work  is  indispensable  for  understanding  how  living  things 
relate  to  their  entire  environment,  it  is  interesting  to  note  the  vital 
historical  role  played  by  Darwin’s  study  of  the  Galapagos  Islands, 
which  were  conveniently  isolated  from  the  rest  of  the  world.  Any 
part  of  the  universe  that  is  considered  apart  from  the  rest  can  be 
called  a  “system.” 

Physics  has  had  some  of  its  greatest  successes  by  carrying  this 
process  of  isolation  to  extremes,  subdividing  the  universe  into  smaller 
and  smaller  parts.  Matter  can  be  divided  into  atoms,  and  the  be¬ 
havior  of  individual  atoms  can  be  studied.  Atoms  can  be  split  apart 
into  their  constituent  neutrons,  protons  and  electrons.  Protons  and 
neutrons  appear  to  be  made  out  of  even  smaller  particles  called 
quarks,  and  there  have  even  been  some  claims  of  experimental  ev- 
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idence  that  quarks  have  smaller  parts  inside  them.  This  method 
of  splitting  things  into  smaller  and  smaller  parts  and  studying  how 
those  parts  influence  each  other  is  called  reductionism.  The  hope  is 
that  the  seemingly  complex  rules  governing  the  larger  units  can  be 
better  understood  in  terms  of  simpler  rules  governing  the  smaller 
units.  To  appreciate  what  reductionism  has  done  for  science,  it  is 
only  necessary  to  examine  a  19th-century  chemistry  textbook.  At 
that  time,  the  existence  of  atoms  was  still  doubted  by  some,  elec¬ 
trons  were  not  even  suspected  to  exist,  and  almost  nothing  was 
understood  of  what  basic  rules  governed  the  way  atoms  interacted 
with  each  other  in  chemical  reactions.  Students  had  to  memorize 
long  lists  of  chemicals  and  their  reactions,  and  there  was  no  way  to 
understand  any  of  it  systematically.  Today,  the  student  only  needs 
to  remember  a  small  set  of  rules  about  how  atoms  interact,  for  in¬ 
stance  that  atoms  of  one  element  cannot  be  converted  into  another 
via  chemical  reactions,  or  that  atoms  from  the  right  side  of  the  pe¬ 
riodic  table  tend  to  form  strong  bonds  with  atoms  from  the  left 
side. 

Discussion  Questions 

A  I’ve  suggested  replacing  the  ordinary  dictionary  definition  of  light 
with  a  more  technical,  more  precise  one  that  involves  weightlessness.  It’s 
still  possible,  though,  that  the  stuff  a  lightbulb  makes,  ordinarily  called 
“light,”  does  have  some  small  amount  of  weight.  Suggest  an  experiment 
to  attempt  to  measure  whether  it  does. 

B  Heat  is  weightless  (i.e.,  an  object  becomes  no  heavier  when  heated), 
and  can  travel  across  an  empty  room  from  the  fireplace  to  your  skin, 
where  it  influences  you  by  heating  you.  Should  heat  therefore  be  con¬ 
sidered  a  form  of  light  by  our  definition?  Why  or  why  not? 

C  Similarly,  should  sound  be  considered  a  form  of  light? 

0.1.3  How  to  learn  physics 

For  as  knowledges  are  now  delivered,  there  is  a  kind  of  contract  of 
error  between  the  deliverer  and  the  receiver;  for  he  that  delivereth 
knowledge  desireth  to  deliver  it  in  such  a  form  as  may  be  best  be¬ 
lieved,  and  not  as  may  be  best  examined;  and  he  that  receiveth 
knowledge  desireth  rather  present  satisfaction  than  expectant  in¬ 
quiry. 

Francis  Bacon 

Many  students  approach  a  science  course  with  the  idea  that  they 
can  succeed  by  memorizing  the  formulas,  so  that  when  a  problem 
is  assigned  on  the  homework  or  an  exam,  they  will  be  able  to  plug 
numbers  in  to  the  formula  and  get  a  numerical  result  on  their  cal¬ 
culator.  Wrong!  That’s  not  what  learning  science  is  about!  There 
is  a  big  difference  between  memorizing  formulas  and  understanding 
concepts.  To  start  with,  different  formulas  may  apply  in  different 
situations.  One  equation  might  represent  a  definition,  which  is  al¬ 
ways  true.  Another  might  be  a  very  specific  equation  for  the  speed 
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of  an  object  sliding  down  an  inclined  plane,  which  would  not  be  true 
if  the  object  was  a  rock  drifting  down  to  the  bottom  of  the  ocean. 
If  you  don’t  work  to  understand  physics  on  a  conceptual  level,  you 
won’t  know  which  formulas  can  be  used  when. 

Most  students  taking  college  science  courses  for  the  first  time 
also  have  very  little  experience  with  interpreting  the  meaning  of  an 
equation.  Consider  the  equation  w  =  A/h  relating  the  width  of  a 
rectangle  to  its  height  and  area.  A  student  who  has  not  developed 
skill  at  interpretation  might  view  this  as  yet  another  equation  to 
memorize  and  plug  in  to  when  needed.  A  slightly  more  savvy  stu¬ 
dent  might  realize  that  it  is  simply  the  familiar  formula  A  =  wh 
in  a  different  form.  When  asked  whether  a  rectangle  would  have 
a  greater  or  smaller  width  than  another  with  the  same  area  but 
a  smaller  height,  the  unsophisticated  student  might  be  at  a  loss, 
not  having  any  numbers  to  plug  in  on  a  calculator.  The  more  ex¬ 
perienced  student  would  know  how  to  reason  about  an  equation 
involving  division  —  if  h  is  smaller,  and  A  stays  the  same,  then  w 
must  be  bigger.  Often,  students  fail  to  recognize  a  sequence  of  equa¬ 
tions  as  a  derivation  leading  to  a  final  result,  so  they  think  all  the 
intermediate  steps  are  equally  important  formulas  that  they  should 
memorize. 

When  learning  any  subject  at  all,  it  is  important  to  become  as 
actively  involved  as  possible,  rather  than  trying  to  read  through 
all  the  information  quickly  without  thinking  about  it.  It  is  a  good 
idea  to  read  and  think  about  the  questions  posed  at  the  end  of  each 
section  of  these  notes  as  you  encounter  them,  so  that  you  know  you 
have  understood  what  you  were  reading. 

Many  students’  difficulties  in  physics  boil  down  mainly  to  diffi¬ 
culties  with  math.  Suppose  you  feel  confident  that  you  have  enough 
mathematical  preparation  to  succeed  in  this  course,  but  you  are 
having  trouble  with  a  few  specific  things.  In  some  areas,  the  brief 
review  given  in  this  chapter  may  be  sufficient,  but  in  other  areas 
it  probably  will  not.  Once  you  identify  the  areas  of  math  in  which 
you  are  having  problems,  get  help  in  those  areas.  Don’t  limp  along 
through  the  whole  course  with  a  vague  feeling  of  dread  about  some¬ 
thing  like  scientific  notation.  The  problem  will  not  go  away  if  you 
ignore  it.  The  same  applies  to  essential  mathematical  skills  that  you 
are  learning  in  this  course  for  the  first  time,  such  as  vector  addition. 

Sometimes  students  tell  me  they  keep  trying  to  understand  a 
certain  topic  in  the  book,  and  it  just  doesn’t  make  sense.  The  worst 
thing  you  can  possibly  do  in  that  situation  is  to  keep  on  staring 
at  the  same  page.  Every  textbook  explains  certain  things  badly  — 
even  mine!  —  so  the  best  thing  to  do  in  this  situation  is  to  look 
at  a  different  book.  Instead  of  college  textbooks  aimed  at  the  same 
mathematical  level  as  the  course  you’re  taking,  you  may  in  some 
cases  find  that  high  school  books  or  books  at  a  lower  math  level 
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give  clearer  explanations. 

Finally,  when  reviewing  for  an  exam,  don’t  simply  read  back 
over  the  text  and  your  lecture  notes.  Instead,  try  to  use  an  active 
method  of  reviewing,  for  instance  by  discussing  some  of  the  discus¬ 
sion  questions  with  another  student,  or  doing  homework  problems 
you  hadn’t  done  the  first  time. 

0.1.4  Velocity  and  acceleration 

Calculus  was  invented  by  a  physicist,  Isaac  Newton,  because 
he  needed  it  as  a  tool  for  calculating  velocity  and  acceleration;  in 
your  introductory  calculus  course,  velocity  and  acceleration  were 
probably  presented  as  some  of  the  first  applications. 

If  an  object’s  position  as  a  function  of  time  is  given  by  the  func¬ 
tion  x(f),  then  its  velocity  and  acceleration  are  given  by  the  first 
and  second  derivatives  with  respect  to  time, 


dx 


and 

cP  x 
dt2' 

The  notation  relates  in  a  logical  way  to  the  units  of  the  quantities. 
Velocity  has  units  of  m/s,  and  that  makes  sense  because  dx  is  inter¬ 
preted  as  an  infinitesimally  small  distance,  with  units  of  meters,  and 
dt  as  an  infinitesimally  small  time,  with  units  of  seconds.  The  seem¬ 
ingly  weird  and  inconsistent  placement  of  the  superscripted  twos  in 
the  notation  for  the  acceleration  is  likewise  meant  to  suggest  the 
units:  something  on  top  with  units  of  meters,  and  something  on  the 
bottom  with  units  of  seconds  squared. 

Velocity  and  acceleration  have  completely  different  physical  in¬ 
terpretations.  Velocity  is  a  matter  of  opinion.  Right  now  as  you  sit 
in  a  chair  and  read  this  book,  you  could  say  that  your  velocity  was 
zero,  but  an  observer  watching  the  Earth  rotate  would  say  that  you 
had  a  velocity  of  hundreds  of  miles  an  hour.  Acceleration  represents 
a  change  in  velocity,  and  it’s  not  a  matter  of  opinion.  Accelerations 
produce  physical  effects,  and  don’t  occur  unless  there’s  a  force  to 
cause  them.  For  example,  gravitational  forces  on  Earth  cause  falling 
objects  to  have  an  acceleration  of  9.8  m/s2. 

Constant  acceleration  example  1 

>  How  high  does  a  diving  board  have  to  be  above  the  water  if  the 
diver  is  to  have  as  much  as  1 .0  s  in  the  air? 

o  The  diver  starts  at  rest,  and  has  an  acceleration  of  9.8  m/s2. 
We  need  to  find  a  connection  between  the  distance  she  travels 
and  time  it  takes.  In  other  words,  we’re  looking  for  information 
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about  the  function  x(t),  given  information  about  the  acceleration. 
To  go  from  acceleration  to  position,  we  need  to  integrate  twice: 


adf  df 


{at  +  v0)  d t  [v0  is  a  constant  of  integration.] 
atdt  [v0  is  zero  because  she’s  dropping  from  rest.] 


2af  +*o 


[x0  is  a  constant  of  integration.] 


=  -ar  [x0  can  be  zero  if  we  define  it  that  way.] 


Note  some  of  the  good  problem-solving  habits  demonstrated  here. 
We  solve  the  problem  symbolically,  and  only  plug  in  numbers  at 
the  very  end,  once  all  the  algebra  and  calculus  are  done.  One 
should  also  make  a  habit,  after  finding  a  symbolic  result,  of  check¬ 
ing  whether  the  dependence  on  the  variables  make  sense.  A 
greater  value  of  t  in  this  expression  would  lead  to  a  greater  value 
for  x;  that  makes  sense,  because  if  you  want  more  time  in  the 
air,  you’re  going  to  have  to  jump  from  higher  up.  A  greater  ac¬ 
celeration  also  leads  to  a  greater  height;  this  also  makes  sense, 
because  the  stronger  gravity  is,  the  more  height  you’ll  need  in  or¬ 
der  to  stay  in  the  air  for  a  given  amount  of  time.  Now  we  plug  in 
numbers. 


x  =  1  ^9.8  m/s2j  (1 .0  s)2 
=  4.9  m 


Note  that  when  we  put  in  the  numbers,  we  check  that  the  units 
work  out  correctly,  (m/s2)  (s)2  =  m.  We  should  also  check  that 
the  result  makes  sense:  4.9  meters  is  pretty  high,  but  not  unrea¬ 
sonable. 


The  notation  dg  in  calculus  represents  an  infinitesimally  small 
change  in  the  variable  q.  The  corresponding  notation  for  a  finite 
change  in  a  variable  is  A q.  For  example,  if  q  represents  the  value 
of  a  certain  stock  on  the  stock  market,  and  the  value  falls  from 
q0  =  5  dollars  initially  to  qq  =  3  dollars  finally,  then  A q  =  —2 
dollars.  When  we  study  linear  functions,  whose  slopes  are  constant, 
the  derivative  is  synonymous  with  the  slope  of  the  line,  and  d y/ dx 
is  the  same  thing  as  Ay/ Ax,  the  rise  over  the  run. 


Under  conditions  of  constant  acceleration,  we  can  relate  velocity 
and  time, 

Av 
a  =  w— , 

A  f  ’ 

or,  as  in  the  example  1,  position  and  time, 


1  2 

x  =  -at  +  vat  +  xQ. 
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It  can  also  be  handy  to  have  a  relation  involving  velocity  and  posi¬ 
tion,  eliminating  time.  Straightforward  algebra  gives 


vj:  =  +  2aAx, 

where  Vf  is  the  final  velocity,  v0  the  initial  velocity,  and  Ax  the 
distance  traveled. 


>  Solved  problem:  Dropping  a  rock  on  Mars  page  50,  problem  1 7 


i>  Solved  problem:  The  Dodge  Viper 


page  51,  problem  19 


0.1.5  Self-evaluation 

The  introductory  part  of  a  book  like  this  is  hard  to  write,  because 
every  student  arrives  at  this  starting  point  with  a  different  prepara¬ 
tion.  One  student  may  have  grown  up  outside  the  U.S.  and  so  may 
be  completely  comfortable  with  the  metric  system,  but  may  have 
had  an  algebra  course  in  which  the  instructor  passed  too  quickly 
over  scientific  notation.  Another  student  may  have  already  taken 
vector  calculus,  but  may  have  never  learned  the  metric  system.  The 
following  self-evaluation  is  a  checklist  to  help  you  figure  out  what 
you  need  to  study  to  be  prepared  for  the  rest  of  the  course. 


If  you  disagree  with  this  state¬ 
ment.  . . 

you  should  study  this  section: 

I  am  familiar  with  the  basic  metric 
units  of  meters,  kilograms,  and  sec¬ 
onds,  and  the  most  common  metric 
prefixes:  rnilli-  (nr),  kilo-  (k),  and 
centi-  (c). 

subsection  0.1.6  Basic  of  the  Metric 
System 

I  am  familiar  with  these  less  com¬ 
mon  metric  prefixes:  mega-  (M), 
micro-  (p),  and  nano-  (n). 

subsection  0.1.7  Less  Common  Met¬ 
ric  Prefixes 

I  am  comfortable  with  scientific  no¬ 
tation. 

subsection  0.1.8  Scientific  Notation 

I  can  confidently  do  metric  conver¬ 
sions. 

subsection  0.1.9  Conversions 

I  understand  the  purpose  and  use  of 
significant  figures. 

subsection  0.1.10  Significant  Figures 

It  wouldn’t  hurt  you  to  skim  the  sections  you  think  you  already 
know  about,  and  to  do  the  self-checks  in  those  sections. 
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0.1.6  Basics  of  the  metric  system 

The  metric  system 

Units  were  not  standardized  until  fairly  recently  in  history,  so 
when  the  physicist  Isaac  Newton  gave  the  result  of  an  experiment 
with  a  pendulum,  he  had  to  specify  not  just  that  the  string  was  37 
/  8  inches  long  but  that  it  was  “37  7 /s  London  inches  long.”  The 
inch  as  defined  in  Yorkshire  would  have  been  different.  Even  after 
the  British  Empire  standardized  its  units,  it  was  still  very  inconve¬ 
nient  to  do  calculations  involving  money,  volume,  distance,  time,  or 
weight,  because  of  all  the  odd  conversion  factors,  like  16  ounces  in 
a  pound,  and  5280  feet  in  a  mile.  Through  the  nineteenth  century, 
schoolchildren  squandered  most  of  their  mathematical  education  in 
preparing  to  do  calculations  such  as  making  change  when  a  customer 
in  a  shop  offered  a  one-crown  note  for  a  book  costing  two  pounds, 
thirteen  shillings  and  tuppence.  The  dollar  has  always  been  decimal, 
and  British  money  went  decimal  decades  ago,  but  the  United  States 
is  still  saddled  with  the  antiquated  system  of  feet,  inches,  pounds, 
ounces  and  so  on. 

Every  country  in  the  world  besides  the  U.S.  uses  a  system  of 
units  known  in  English  as  the  “metric  system.1”  This  system  is 
entirely  decimal,  thanks  to  the  same  eminently  logical  people  who 
brought  about  the  French  Revolution.  In  deference  to  France,  the 
system’s  official  name  is  the  Systeme  International,  or  SI,  meaning 
International  System.  (The  phrase  “SI  system”  is  therefore  redun¬ 
dant.) 

The  wonderful  thing  about  the  SI  is  that  people  who  live  in 
countries  more  modern  than  ours  do  not  need  to  memorize  how 
many  ounces  there  are  in  a  pound,  how  many  cups  in  a  pint,  how 
many  feet  in  a  mile,  etc.  The  whole  system  works  with  a  single, 
consistent  set  of  Greek  and  Latin  prefixes  that  modify  the  basic 
units.  Each  prefix  stands  for  a  power  of  ten,  and  has  an  abbreviation 
that  can  be  combined  with  the  symbol  for  the  unit.  For  instance, 
the  meter  is  a  unit  of  distance.  The  prefix  kilo-  stands  for  103,  so  a 
kilometer,  1  km,  is  a  thousand  meters. 

The  basic  units  of  the  metric  system  are  the  meter  for  distance, 


the  second  for  time,  and  the  gram 

for  mass. 

The  following 
memorize  them. 

are  the  most  common  metric  prefixes.  You  should 

prefix 

meaning 

example 

kilo-  k 

103  60  kg 

=  a  person’s  mass 

centi-  c 

10~2  28  cm 

=  height  of  a  piece  of  paper 

milli-  m 

10~3  1  ms 

=  time  for  one  vibration  of  a  guitar 
string  playing  the  note  D 

1  Liberia  and  Myanmar  have  not  legally  adopted  metric  units,  but  use  them 
in  everyday  life. 
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The  prefix  centi-,  meaning  10-2,  is  only  used  in  the  centimeter; 
a  hundredth  of  a  gram  would  not  be  written  as  1  eg  but  as  10  mg. 
The  centi-  prefix  can  be  easily  remembered  because  a  cent  is  10~2 
dollars.  The  official  SI  abbreviation  for  seconds  is  “s”  (not  “sec”) 
and  grams  are  “g”  (not  “gm”). 

The  second 

When  I  stated  briefly  above  that  the  second  was  a  unit  of  time,  it 
may  not  have  occurred  to  you  that  this  was  not  much  of  a  definition. 
We  can  make  a  dictionary-style  definition  of  a  term  like  “time,”  or 
give  a  general  description  like  Isaac  Newton’s:  “Absolute,  true,  and 
mathematical  time,  of  itself,  and  from  its  own  nature,  flows  equably 
without  relation  to  anything  external. . .  ”  Newton’s  characterization 
sounds  impressive,  but  physicists  today  would  consider  it  useless  as 
a  definition  of  time.  Today,  the  physical  sciences  are  based  on  oper¬ 
ational  definitions,  which  means  definitions  that  spell  out  the  actual 
steps  (operations)  required  to  measure  something  numerically. 

In  an  era  when  our  toasters,  pens,  and  coffee  pots  tell  us  the 
time,  it  is  far  from  obvious  to  most  people  what  is  the  fundamental 
operational  definition  of  time.  Until  recently,  the  hour,  minute,  and 
second  were  defined  operationally  in  terms  of  the  time  required  for 
the  earth  to  rotate  about  its  axis.  Unfortunately,  the  Earth’s  ro¬ 
tation  is  slowing  down  slightly,  and  by  1967  this  was  becoming  an 
issue  in  scientific  experiments  requiring  precise  time  measurements. 
The  second  was  therefore  redefined  as  the  time  required  for  a  cer¬ 
tain  number  of  vibrations  of  the  light  waves  emitted  by  a  cesium 
atoms  in  a  lamp  constructed  like  a  familiar  neon  sign  but  with  the 
neon  replaced  by  cesium.  The  new  definition  not  only  promises  to 
stay  constant  indefinitely,  but  for  scientists  is  a  more  convenient 
way  of  calibrating  a  clock  than  having  to  carry  out  astronomical 
measurements. 

self-check  C 

What  is  a  possible  operational  definition  of  how  strong  a  person  is?  c> 

Answer,  p.  961 

The  meter 

The  French  originally  defined  the  meter  as  1CU7  times  the  dis¬ 
tance  from  the  equator  to  the  north  pole,  as  measured  through  Paris 
(of  course).  Even  if  the  definition  was  operational,  the  operation  of 
traveling  to  the  north  pole  and  laying  a  surveying  chain  behind  you 
was  not  one  that  most  working  scientists  wanted  to  carry  out.  Fairly 
soon,  a  standard  was  created  in  the  form  of  a  metal  bar  with  two 
scratches  on  it.  This  was  replaced  by  an  atomic  standard  in  1960, 
and  finally  in  1983  by  the  current  definition,  which  is  that  the  me¬ 
ter  is  the  distance  traveled  by  light  in  a  vacuum  over  a  period  of 
(1/299792458)  seconds. 


e/The  original  definition  of 
the  meter. 
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f  /  A  duplicate  of  the  Paris 
kilogram,  maintained  at  the  Dan¬ 
ish  National  Metrology  Institute. 


The  kilogram 

The  third  base  unit  of  the  SI  is  the  kilogram,  a  unit  of  mass. 
Mass  is  intended  to  be  a  measure  of  the  amount  of  a  substance, 
but  that  is  not  an  operational  definition.  Bathroom  scales  work  by 
measuring  our  planet’s  gravitational  attraction  for  the  object  being 
weighed,  but  using  that  type  of  scale  to  define  mass  operationally 
would  be  undesirable  because  gravity  varies  in  strength  from  place 
to  place  on  the  earth. 

There’s  a  surprising  amount  of  disagreement  among  physics  text¬ 
books  about  how  mass  should  be  defined,  but  here’s  how  it’s  actually 
handled  by  the  few  working  physicists  who  specialize  in  ultra-high- 
precision  measurements.  They  maintain  a  physical  object  in  Paris, 
which  is  the  standard  kilogram,  a  cylinder  made  of  platinum-iridium 
alloy.  Duplicates  are  checked  against  this  mother  of  all  kilograms 
by  putting  the  original  and  the  copy  on  the  two  opposite  pans  of  a 
balance.  Although  this  method  of  comparison  depends  on  gravity, 
the  problems  associated  with  differences  in  gravity  in  different  geo¬ 
graphical  locations  are  bypassed,  because  the  two  objects  are  being 
compared  in  the  same  place.  The  duplicates  can  then  be  removed 
from  the  Parisian  kilogram  shrine  and  transported  elsewhere  in  the 
world.  It  would  be  desirable  to  replace  this  at  some  point  with  a 
universally  accessible  atomic  standard  rather  than  one  based  on  a 
specific  artifact,  but  as  of  2010  the  technology  for  automated  count¬ 
ing  of  large  numbers  of  atoms  has  not  gotten  good  enough  to  make 
that  work  with  the  desired  precision. 

Combinations  of  metric  units 

Just  about  anything  you  want  to  measure  can  be  measured  with 
some  combination  of  meters,  kilograms,  and  seconds.  Speed  can  be 
measured  in  m/s,  volume  in  m3,  and  density  in  kg/m3.  Part  of  what 
makes  the  SI  great  is  this  basic  simplicity.  No  more  funny  units  like 
a  cord  of  wood,  a  bolt  of  cloth,  or  a  jigger  of  whiskey.  No  more 
liquid  and  dry  measure.  Just  a  simple,  consistent  set  of  units.  The 
SI  measures  put  together  from  meters,  kilograms,  and  seconds  make 
up  the  inks  system.  For  example,  the  inks  unit  of  speed  is  m/s,  not 
km/ hr. 

Checking  units 

A  useful  technique  for  finding  mistakes  in  one’s  algebra  is  to 
analyze  the  units  associated  with  the  variables. 

Checking  units  example  2 

>  Jae  starts  from  the  formula  V  =  \Ah  for  the  volume  of  a  cone, 
where  A  is  the  area  of  its  base,  and  h  is  its  height.  He  wants  to 
find  an  equation  that  will  tell  him  how  tall  a  conical  tent  has  to  be 
in  order  to  have  a  certain  volume,  given  its  radius.  His  algebra 
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goes  like  this: 


[1] 

[2] 

[3] 

[4] 


v-lAh 

A  =  nr2 


V  =  - nr2h 
3 


Is  his  algebra  correct?  If  not,  find  the  mistake. 

>  Line  4  is  supposed  to  be  an  equation  for  the  height,  so  the  units 
of  the  expression  on  the  right-hand  side  had  better  equal  meters. 
The  pi  and  the  3  are  unitless,  so  we  can  ignore  them.  In  terms  of 
units,  line  4  becomes 


m2  _  1 
m3  m ' 


This  is  false,  so  there  must  be  a  mistake  in  the  algebra.  The  units 
of  lines  1 , 2,  and  3  check  out,  so  the  mistake  must  be  in  the  step 
from  line  3  to  line  4.  In  fact  the  result  should  have  been 


3 

7tr2 


Now  the  units  check:  m  =  m3/m2. 

Discussion  Question 

A  Isaac  Newton  wrote,  “. . .  the  natural  days  are  truly  unequal,  though 
they  are  commonly  considered  as  equal,  and  used  for  a  measure  of 
time. . .  It  may  be  that  there  is  no  such  thing  as  an  equable  motion,  whereby 
time  may  be  accurately  measured.  All  motions  may  be  accelerated  or  re¬ 
tarded.  . .”  Newton  was  right.  Even  the  modern  definition  of  the  second 
in  terms  of  light  emitted  by  cesium  atoms  is  subject  to  variation.  For  in¬ 
stance,  magnetic  fields  could  cause  the  cesium  atoms  to  emit  light  with 
a  slightly  different  rate  of  vibration.  What  makes  us  think,  though,  that  a 
pendulum  clock  is  more  accurate  than  a  sundial,  or  that  a  cesium  atom 
is  a  more  accurate  timekeeper  than  a  pendulum  clock?  That  is,  how  can 
one  test  experimentally  how  the  accuracies  of  different  time  standards 
compare? 
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Nine  little 


10  -9 

nano 

* — [nuns 

10  -6 

micro 

—/mix 

10  -3 

milli 

<.^milky 

10  3 

kilo 

10  6 

mega 

—/mugs 

g  /  This 

is  a 

mnemonic 

help  you  remember  the  most  im¬ 
portant  metric  prefixes.  The  word 
“little”  is  to  remind  you  that  the 
list  starts  with  the  prefixes  used 
for  small  quantities  and  builds 
upward.  The  exponent  changes 
by  3,  except  that  of  course  that 
we  do  not  need  a  special  prefix 
for  10°,  which  equals  one. 


0.1.7  Less  common  metric  prefixes 


The  following  are  three  metric  prefixes  which,  while  less  common 
than  the  ones  discussed  previously,  are  well  worth  memorizing. 


prefix 

meaning 

example 

mega-  M 

106 

6.4  Mm 

=  radius  of  the  earth 

micro-  /i 

10~6 

10  /jrn 

=  size  of  a  white  blood  cell 

nano-  n 

10”9 

0.154  nm 

=  distance  between  carbon 

nuclei  in  an  ethane  molecule 


Note  that  the  abbreviation  for  micro  is  the  Greek  letter  mu,  /r 
—  a  common  mistake  is  to  confuse  it  with  nr  (milli)  or  M  (mega). 

There  are  other  prefixes  even  less  common,  used  for  extremely 
large  and  small  quantities.  For  instance,  1  femtonreter  =  10~15  nr  is 
a  convenient  unit  of  distance  in  nuclear  physics,  and  1  gigabyte  = 
109  bytes  is  used  for  computers’  hard  disks.  The  international  com¬ 
mittee  that  makes  decisions  about  the  SI  has  recently  even  added 
some  new  prefixes  that  sound  like  jokes,  e.g.,  1  yoctogram  =  10~24  g 
is  about  half  the  mass  of  a  proton.  In  the  immediate  future,  how¬ 
ever,  you’re  unlikely  to  see  prefixes  like  “yocto-”  and  “zepto-”  used 
except  perhaps  in  trivia  contests  at  science-fiction  conventions  or 
other  geekfests. 

self-check  D 

Suppose  you  could  slow  down  time  so  that  according  to  your  perception, 
a  beam  of  light  would  move  across  a  room  at  the  speed  of  a  slow  walk. 
If  you  perceived  a  nanosecond  as  if  it  was  a  second,  how  would  you 
perceive  a  microsecond?  >  Answer,  p.  961 

0.1.8  Scientific  notation 

Most  of  the  interesting  phenomena  in  our  universe  are  not  on 
the  human  scale.  It  would  take  about  1,000,000,000,000,000,000,000 
bacteria  to  equal  the  mass  of  a  human  body.  When  the  physicist 
Thomas  Young  discovered  that  light  was  a  wave,  it  was  back  in  the 
bad  old  days  before  scientific  notation,  and  he  was  obliged  to  write 
that  the  time  required  for  one  vibration  of  the  wave  was  1/500  of 
a  millionth  of  a  millionth  of  a  second.  Scientific  notation  is  a  less 
awkward  way  to  write  very  large  and  very  small  numbers  such  as 
these.  Here’s  a  quick  review. 

Scientific  notation  means  writing  a  number  in  terms  of  a  product 
of  something  from  1  to  10  and  something  else  that  is  a  power  of  ten. 
For  instance, 

32  =  3.2  x  101 
320  =  3.2  x  102 
3200  =  3.2  x  103  ... 

Each  number  is  ten  times  bigger  than  the  previous  one. 

Since  101  is  ten  times  smaller  than  102  ,  it  makes  sense  to  use 
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the  notation  10°  to  stand  for  one,  the  number  that  is  in  turn  ten 
times  smaller  than  101  .  Continuing  on,  we  can  write  10_1  to  stand 
for  0.1,  the  number  ten  times  smaller  than  10°  .  Negative  exponents 
are  used  for  small  numbers: 


3.2  =  3.2  x  10° 

0.32  =  3.2  x  10_1 
0.032  =  3.2  x  10~2  . . . 

A  common  source  of  confusion  is  the  notation  used  on  the  dis¬ 
plays  of  many  calculators.  Examples: 


3.2  x  106  (written  notation) 

3.2E+6  (notation  on  some  calculators) 

3.26  (notation  on  some  other  calculators) 

The  last  example  is  particularly  unfortunate,  because  3.26  really 
stands  for  the  number  3.2  x  3.2  x  3.2  x  3.2  x  3.2  x  3.2  =  1074,  a 
totally  different  number  from  3.2  x  106  =  3200000.  The  calculator 
notation  should  never  be  used  in  writing.  It’s  just  a  way  for  the 
manufacturer  to  save  money  by  making  a  simpler  display. 

self-check  E 

A  student  learns  that  1 04  bacteria,  standing  in  line  to  register  for  classes 
at  Paramecium  Community  College,  would  form  a  queue  of  this  size: 


The  student  concludes  that  1 02  bacteria  would  form  a  line  of  this  length: 


Why  is  the  student  incorrect?  >  Answer,  p.  961 

0.1.9  Conversions 

Conversions  are  one  of  the  three  essential  mathematical  skills, 
summarized  on  pp. 945-947,  that  you  need  for  success  in  this  course. 

I  suggest  you  avoid  memorizing  lots  of  conversion  factors  be¬ 
tween  SI  units  and  U.S.  units,  but  two  that  do  come  in  handy  are: 

1  inch  =  2.54  cm 

An  object  with  a  weight  on  Earth  of  2.2  pounds-force  has  a 
mass  of  1  kg. 

The  first  one  is  the  present  definition  of  the  inch,  so  it’s  exact.  The 
second  one  is  not  exact,  but  is  good  enough  for  most  purposes.  (U.S. 
units  of  force  and  mass  are  confusing,  so  it’s  a  good  thing  they’re 
not  used  in  science.  In  U.S.  units,  the  unit  of  force  is  the  pound- 
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force,  and  the  best  unit  to  use  for  mass  is  the  slug,  which  is  about 
14.6  kg.) 

More  important  than  memorizing  conversion  factors  is  under¬ 
standing  the  right  method  for  doing  conversions.  Even  within  the 
SI,  you  may  need  to  convert,  say,  from  grams  to  kilograms.  Differ¬ 
ent  people  have  different  ways  of  thinking  about  conversions,  but 
the  method  I’ll  describe  here  is  systematic  and  easy  to  understand. 
The  idea  is  that  if  1  kg  and  1000  g  represent  the  same  mass,  then 
we  can  consider  a  fraction  like 

103  g 
1  kg 

to  be  a  way  of  expressing  the  number  one.  This  may  bother  you.  For 
instance,  if  you  type  1000/1  into  your  calculator,  you  will  get  1000, 
not  one.  Again,  different  people  have  different  ways  of  thinking 
about  it,  but  the  justification  is  that  it  helps  us  to  do  conversions, 
and  it  works!  Now  if  we  want  to  convert  0.7  kg  to  units  of  grams, 
we  can  multiply  kg  by  the  number  one: 


0.7  kg  x 


103  g 
1  kg 


If  you’re  willing  to  treat  symbols  such  as  “kg”  as  if  they  were  vari¬ 
ables  as  used  in  algebra  (which  they’re  really  not),  you  can  then 
cancel  the  kg  on  top  with  the  kg  on  the  bottom,  resulting  in 


o.7  x 


103  g 

i  y4> 


700  g. 


To  convert  grams  to  kilograms,  you  would  simply  flip  the  fraction 
upside  down. 


One  advantage  of  this  method  is  that  it  can  easily  be  applied  to 
a  series  of  conversions.  For  instance,  to  convert  one  year  to  units  of 
seconds, 


1  year  x 


365  claye 
1  year 


24  h&afs 
1  jfajf 


60  jnifi  60  s 

— ; -  x  - ^  = 

1  Jaercrf  1  jam 

=  3.15  x  107  s. 


Should  that  exponent  be  positive,  or  negative? 

A  common  mistake  is  to  write  the  conversion  fraction  incorrectly. 
For  instance  the  fraction 

103  kg 

-  ( mcorrec 

1  g 

does  not  equal  one,  because  103  kg  is  the  mass  of  a  car,  and  1  g  is 
the  mass  of  a  raisin.  One  correct  way  of  setting  up  the  conversion 
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factor  would  be 


(correct). 


10  3  kg 
1  g 

You  can  usually  detect  such  a  mistake  if  you  take  the  time  to  check 
your  answer  and  see  if  it  is  reasonable. 

If  common  sense  doesn’t  rule  out  either  a  positive  or  a  negative 
exponent,  here’s  another  way  to  make  sure  you  get  it  right.  There 
are  big  prefixes  and  small  prefixes: 

big  prefixes:  k  M 

small  prefixes:  m  //  n 

(It’s  not  hard  to  keep  straight  which  are  which,  since  “mega”  and 
“micro”  are  evocative,  and  it’s  easy  to  remember  that  a  kilometer 
is  bigger  than  a  meter  and  a  millimeter  is  smaller.)  In  the  example 
above,  we  want  the  top  of  the  fraction  to  be  the  same  as  the  bottom. 
Since  k  is  a  big  prefix,  we  need  to  compensate  by  putting  a  small 
number  like  10~3  in  front  of  it,  not  a  big  number  like  103. 

>  Solved  problem:  a  simple  conversion  page  48,  problem  6 

>  Solved  problem:  the  geometric  mean  page  48,  problem  8 

Discussion  Question 

A  Each  of  the  following  conversions  contains  an  error.  In  each  case, 
explain  what  the  error  is. 

(a)  1000  kgx^  =  1g 

(b)  50  m  x  =  o.5cm 

(c)  "Nano”  is  10~9,  so  there  are  10~9  nm  in  a  meter. 

(d)  “Micro”  is  1 0  6,  so  1  kg  is  106  gg. 

0.1.10  Significant  figures 

The  international  governing  body  for  football  (“soccer”  in  the 
US)  says  the  ball  should  have  a  circumference  of  68  to  70  cm.  Tak¬ 
ing  the  middle  of  this  range  and  dividing  by  it  gives  a  diameter  of 
approximately  21.96338214668155633610595934540698196  cm.  The 
digits  after  the  first  few  are  completely  meaningless.  Since  the  cir¬ 
cumference  could  have  varied  by  about  a  centimeter  in  either  direc¬ 
tion,  the  diameter  is  fuzzy  by  something  like  a  third  of  a  centimeter. 
We  say  that  the  additional,  random  digits  are  not  significant  figures. 
If  you  write  down  a  number  with  a  lot  of  gratuitous  insignificant  fig¬ 
ures,  it  shows  a  lack  of  scientific  literacy  and  imples  to  other  people 
a  greater  precision  than  you  really  have. 

As  a  rule  of  thumb,  the  result  of  a  calculation  has  as  many 
significant  figures,  or  “sig  figs,”  as  the  least  accurate  piece  of  data 
that  went  in.  In  the  example  with  the  soccer  ball,  it  didn’t  do  us  any 
good  to  know  n  to  dozens  of  digits,  because  the  bottleneck  in  the 
precision  of  the  result  was  the  figure  for  the  circumference,  which 
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was  two  sig  figs.  The  result  is  22  cm.  The  rule  of  thumb  works  best 
for  multiplication  and  division. 

For  calculations  involving  multiplication  and  division,  a  given 
fractional  or  “percent”  error  in  one  of  the  inputs  causes  the  same 
fractional  error  in  the  output.  The  number  of  digits  in  a  number 
provides  a  rough  measure  of  its  possible  fractional  error.  These  are 
called  significant  figures  or  “sig  figs.”  Examples: 


3.14 

3  sig  figs 

3.1 

2  sig  figs 

0.03 

1  sig  fig,  because  the  zeroes  are  just  placeholders 

3.0  x  101 

2  sig  figs 

30 

could  be  1  or  2  sig  figs,  since  we  can’t  tell  if  the 

0  is  a  placeholder  or  a  real  sig  fig 

In  such  calculations,  your  result  should  not  have  more  than  the 
number  of  sig  figs  in  the  least  accurate  piece  of  data  you  started 
with. 

Sig  figs  in  the  area  of  a  triangle  example  3 

t>  A  triangle  has  an  area  of  6.45  m2  and  a  base  with  a  width  of 
4.0138  m.  Find  its  height. 

>  The  area  is  related  to  the  base  and  height  by  A  =  bh/2. 


=  3.21 391 200358762  m  (calculator  output) 

=  3.21  m 

The  given  data  were  3  sig  figs  and  5  sig  figs.  We’re  limited  by  the 
less  accurate  piece  of  data,  so  the  final  result  is  3  sig  figs.  The 
additional  digits  on  the  calculator  don’t  mean  anything,  and  if  we 
communicated  them  to  another  person,  we  would  create  the  false 
impression  of  having  determined  h  with  more  precision  than  we 
really  obtained. 

self-check  F 

The  following  quote  is  taken  from  an  editorial  by  Norimitsu  Onishi  in  the 
New  York  Times,  August  1 8,  2002. 

Consider  Nigeria.  Everyone  agrees  it  is  Africa’s  most  populous 
nation.  But  what  is  its  population?  The  United  Nations  says 
114  million;  the  State  Department,  120  million.  The  World  Bank 
says  126.9  million,  while  the  Central  Intelligence  Agency  puts  it 
at  126,635,626. 

What  should  bother  you  about  this?  >  Answer,  p.  961 

Dealing  correctly  with  significant  figures  can  save  you  time!  Of- 
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ten,  students  copy  down  numbers  from  their  calculators  with  eight 
significant  figures  of  precision,  then  type  them  back  in  for  a  later 
calculation.  That’s  a  waste  of  time,  unless  your  original  data  had 
that  kind  of  incredible  precision. 

self-check  G 

How  many  significant  figures  are  there  in  each  of  the  following  mea¬ 
surements? 

(1)  9.937  m 

(2)  4.0  s 

(3)  0.0000000000000037  kg  >  Answer,  p.  961 

The  rules  about  significant  figures  are  only  rules  of  thumb,  and 
are  not  a  substitute  for  careful  thinking.  For  instance,  $20.00  + 
$0.05  is  $20.05.  It  need  not  and  should  not  be  rounded  off  to  $20. 
In  general,  the  sig  fig  rules  work  best  for  multiplication  and  division, 
and  we  sometimes  also  apply  them  when  doing  a  complicated  calcu¬ 
lation  that  involves  many  types  of  operations.  For  simple  addition 
and  subtraction,  it  makes  more  sense  to  maintain  a  fixed  number  of 
digits  after  the  decimal  point. 

When  in  doubt,  don’t  use  the  sig  fig  rules  at  all.  Instead,  in¬ 
tentionally  change  one  piece  of  your  initial  data  by  the  maximum 
amount  by  which  you  think  it  could  have  been  off,  and  recalculate 
the  final  result.  The  digits  on  the  end  that  are  completely  reshuffled 
are  the  ones  that  are  meaningless,  and  should  be  omitted. 

A  nonlinear  function  example  4 

>  How  many  sig  figs  are  there  in  sin  88.7°? 

>  We’re  using  a  sine  function,  which  isn’t  addition,  subtraction, 
multiplication,  or  division.  It  would  be  reasonable  to  guess  that 
since  the  input  angle  had  3  sig  figs,  so  would  the  output.  But  if 
this  was  an  important  calculation  and  we  really  needed  to  know, 
we  would  do  the  following: 

sin  88.7°  =  0.999742609322698 
sin  88.8°  =  0.999780683474846 

Surprisingly,  the  result  appears  to  have  as  many  as  5  sig  figs,  not 
just  3: 

sin  88.7°  =  0.99974, 

where  the  final  4  is  uncertain  but  may  have  some  significance. 
The  unexpectedly  high  precision  of  the  result  is  because  the  sine 
function  is  nearing  its  maximum  at  90  degrees,  where  the  graph 
flattens  out  and  becomes  insensitive  to  the  input  angle. 

0.1 .1 1  A  note  about  diagrams 

A  quick  note  about  diagrams.  Often  when  you  solve  a  problem, 
the  best  way  to  get  started  and  organize  your  thoughts  is  by  draw¬ 
ing  a  diagram.  For  an  artist,  it’s  desirable  to  be  able  to  draw  a 


h  /  A  diagram  of  a  tomato. 
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recognizable,  realistic,  perspective  picture  of  a  tomato,  like  the  one 
at  the  top  of  figure  h.  But  in  science  and  engineering,  we  usually 
don’t  draw  solid  figures  in  perspective,  because  that  would  make  it 
difficult  to  label  distances  and  angles.  Usually  we  want  views  or 
cross-sections  that  project  the  object  into  its  planes  of  symmetry, 
as  in  the  line  drawings  in  the  figure. 
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0.2  Scaling  and  Order-of-Magnitude  Estimates 

0.2.1  Introduction 

Why  can’t  an  insect  be  the  size  of  a  dog?  Some  skinny  stretched- 
out  cells  in  your  spinal  cord  are  a  meter  tall  —  why  does  nature 
display  no  single  cells  that  are  not  just  a  meter  tall,  but  a  meter 
wide,  and  a  meter  thick  as  well?  Believe  it  or  not,  these  are  questions 
that  can  be  answered  fairly  easily  without  knowing  much  more  about 
physics  than  you  already  do.  The  only  mathematical  technique  you 
really  need  is  the  humble  conversion,  applied  to  area  and  volume. 

Area  and  volume 

Area  can  be  defined  by  saying  that  we  can  copy  the  shape  of 
interest  onto  graph  paper  with  1  cm  x  1  cm  squares  and  count  the 
number  of  squares  inside.  Fractions  of  squares  can  be  estimated  by 
eye.  We  then  say  the  area  equals  the  number  of  squares,  in  units  of 
square  cm.  Although  this  might  seem  less  “pure”  than  computing 
areas  using  formulae  like  A  =  tt r2  for  a  circle  or  A  =  wh/2  for  a 
triangle,  those  formulae  are  not  useful  as  definitions  of  area  because 
they  cannot  be  applied  to  irregularly  shaped  areas. 

Units  of  square  cm  are  more  commonly  written  as  cm2  in  science. 
Of  course,  the  unit  of  measurement  symbolized  by  “cm”  is  not  an 
algebra  symbol  standing  for  a  number  that  can  be  literally  multiplied 
by  itself.  But  it  is  advantageous  to  write  the  units  of  area  that  way 
and  treat  the  units  as  if  they  were  algebra  symbols.  For  instance, 
if  you  have  a  rectangle  with  an  area  of  6m2  and  a  width  of  2  nr, 
then  calculating  its  length  as  (6  nr2)/ (2  nr)  =  3  nr  gives  a  result 
that  makes  sense  both  numerically  and  in  terms  of  units.  This 
algebra-style  treatment  of  the  units  also  ensures  that  our  methods 
of  converting  units  work  out  correctly.  For  instance,  if  we  accept 
the  fraction 

100  cnr 
1  nr 

as  a  valid  way  of  writing  the  number  one,  then  one  times  one  equals 
one,  so  we  should  also  say  that  one  can  be  represented  by 

100  cnr  100  cnr 

- x - , 

1  nr  1  nr 

which  is  the  same  as 

10000  cnr2 
1  nr2 

That  means  the  conversion  factor  from  square  meters  to  square  cen¬ 
timeters  is  a  factor  of  104,  i.e.,  a  square  meter  has  104  square  cen¬ 
timeters  in  it. 

All  of  the  above  can  be  easily  applied  to  volume  as  well,  using 
one-cubic-centinreter  blocks  instead  of  squares  on  graph  paper. 

To  many  people,  it  seems  hard  to  believe  that  a  square  meter 
equals  10000  square  centimeters,  or  that  a  cubic  meter  equals  a 
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million  cubic  centimeters  —  they  think  it  would  make  more  sense  if 
there  were  100  cm2  in  1  m2,  and  100  cm3  in  1  m3,  but  that  would  be 
incorrect.  The  examples  shown  in  figure  b  aim  to  make  the  correct 
answer  more  believable,  using  the  traditional  U.S.  units  of  feet  and 
yards.  (One  foot  is  12  inches,  and  one  yard  is  three  feet.) 


b  /  Visualizing  conversions  of 
area  and  volume  using  traditional 
U.S.  units. 

1  ft  1  yd  =  3  ft 


self-check  H 

Based  on  figure  b,  convince  yourself  that  there  are  9  ft2  in  a  square  yard, 
and  27  ft3  in  a  cubic  yard,  then  demonstrate  the  same  thing  symbolically 
(i.e.,  with  the  method  using  fractions  that  equal  one).  >  Answer,  p. 
961 

>  Solved  problem:  converting  mm2  to  cm2  page  52,  problem  31 

>  Solved  problem:  scaling  a  liter  page  53,  problem  40 

Discussion  Question 

A  How  many  square  centimeters  are  there  in  a  square  inch?  (1  inch  = 
2.54  cm)  First  find  an  approximate  answer  by  making  a  drawing,  then  de¬ 
rive  the  conversion  factor  more  accurately  using  the  symbolic  method. 

0.2.2  Scaling  of  area  and  volume 

Great  fleas  have  lesser  fleas 
Upon  their  backs  to  bite  ’em. 

And  lesser  fleas  have  lesser  still, 

And  so  ad  infinitum. 

Jonathan  Swift 

Now  how  do  these  conversions  of  area  and  volume  relate  to  the 
questions  I  posed  about  sizes  of  living  things?  Well,  imagine  that 
you  are  shrunk  like  Alice  in  Wonderland  to  the  size  of  an  insect. 
One  way  of  thinking  about  the  change  of  scale  is  that  what  used 
to  look  like  a  centimeter  now  looks  like  perhaps  a  meter  to  you, 
because  you’re  so  much  smaller.  If  area  and  volume  scaled  according 
to  most  people’s  intuitive,  incorrect  expectations,  with  1  m2  being 
the  same  as  100  cm2,  then  there  would  be  no  particular  reason 
why  nature  should  behave  any  differently  on  your  new,  reduced 
scale.  But  nature  does  behave  differently  now  that  you’re  small. 
For  instance,  you  will  find  that  you  can  walk  on  water,  and  jump 
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to  many  times  your  own  height.  The  physicist  Galileo  Galilei  had 
the  basic  insight  that  the  scaling  of  area  and  volume  determines 
how  natural  phenomena  behave  differently  on  different  scales.  He 
first  reasoned  about  mechanical  structures,  but  later  extended  his 
insights  to  living  things,  taking  the  then-radical  point  of  view  that  at 
the  fundamental  level,  a  living  organism  should  follow  the  same  laws 
of  nature  as  a  machine.  We  will  follow  his  lead  by  first  discussing 
machines  and  then  living  things. 

Galileo  on  the  behavior  of  nature  on  large  and  small  scales 

One  of  the  world’s  most  famous  pieces  of  scientific  writing  is 
Galileo’s  Dialogues  Concerning  the  Two  New  Sciences.  Galileo  was 
an  entertaining  writer  who  wanted  to  explain  things  clearly  to  laypeo- 
ple,  and  he  livened  up  his  work  by  casting  it  in  the  form  of  a  dialogue 
among  three  people.  Salviati  is  really  Galileo’s  alter  ego.  Simplicio 
is  the  stupid  character,  and  one  of  the  reasons  Galileo  got  in  trouble 
with  the  Church  was  that  there  were  rumors  that  Simplicio  repre¬ 
sented  the  Pope.  Sagredo  is  the  earnest  and  intelligent  student,  with 
whom  the  reader  is  supposed  to  identify.  (The  following  excerpts 
are  from  the  1914  translation  by  Crew  and  de  Salvio.) 

Sagredo:  Yes,  that  is  what  I  mean;  and  I  refer  especially  to 
his  last  assertion  which  I  have  always  regarded  as  false. . . ; 
namely,  that  in  speaking  of  these  and  other  similar  machines 
one  cannot  argue  from  the  small  to  the  large,  because  many 
devices  which  succeed  on  a  small  scale  do  not  work  on  a 
large  scale.  Now,  since  mechanics  has  its  foundations  in  ge¬ 
ometry,  where  mere  size  [  is  unimportant],  I  do  not  see  that 
the  properties  of  circles,  triangles,  cylinders,  cones  and  other 
solid  figures  will  change  with  their  size.  If,  therefore,  a  large 
machine  be  constructed  in  such  a  way  that  its  parts  bear  to 
one  another  the  same  ratio  as  in  a  smaller  one,  and  if  the 
smaller  is  sufficiently  strong  for  the  purpose  for  which  it  is 
designed,  I  do  not  see  why  the  larger  should  not  be  able  to 
withstand  any  severe  and  destructive  tests  to  which  it  may  be 
subjected. 

Salviati  contradicts  Sagredo: 

Salviati:  . . .  Please  observe,  gentlemen,  how  facts  which 
at  first  seem  improbable  will,  even  on  scant  explanation,  drop 
the  cloak  which  has  hidden  them  and  stand  forth  in  naked  and 
simple  beauty.  Who  does  not  know  that  a  horse  falling  from  a 
height  of  three  or  four  cubits  will  break  his  bones,  while  a  dog 
falling  from  the  same  height  or  a  cat  from  a  height  of  eight 
or  ten  cubits  will  suffer  no  injury?  Equally  harmless  would  be 
the  fall  of  a  grasshopper  from  a  tower  or  the  fall  of  an  ant  from 
the  distance  of  the  moon. 

The  point  Galileo  is  making  here  is  that  small  things  are  sturdier 


c  /  Galileo  Galilei  (1 564-1 642). 


d/The  small  boat  holds  up 
just  fine. 


e/A  larger  boat  built  with 
the  same  proportions  as  the 
small  one  will  collapse  under  its 
own  weight. 


f  /  A  boat  this  large  needs  to 
have  timbers  that  are  thicker 
compared  to  its  size. 
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in  proportion  to  their  size.  There  are  a  lot  of  objections  that  could  be 
raised,  however.  After  all,  what  does  it  really  mean  for  something  to 
be  “strong” ,  to  be  “strong  in  proportion  to  its  size,”  or  to  be  strong 
“out  of  proportion  to  its  size?”  Galileo  hasn’t  given  operational 
definitions  of  things  like  “strength,”  i.e.,  definitions  that  spell  out 
how  to  measure  them  numerically. 

Also,  a  cat  is  shaped  differently  from  a  horse  —  an  enlarged 
photograph  of  a  cat  would  not  be  mistaken  for  a  horse,  even  if  the 
photo-doctoring  experts  at  the  National  Inquirer  made  it  look  like  a 
person  was  riding  on  its  back.  A  grasshopper  is  not  even  a  mammal, 
and  it  has  an  exoskeleton  instead  of  an  internal  skeleton.  The  whole 
argument  would  be  a  lot  more  convincing  if  we  could  do  some  iso¬ 
lation  of  variables,  a  scientific  term  that  means  to  change  only  one 
thing  at  a  time,  isolating  it  from  the  other  variables  that  might  have 
an  effect.  If  size  is  the  variable  whose  effect  we’re  interested  in  see¬ 
ing,  then  we  don’t  really  want  to  compare  things  that  are  different 
in  size  but  also  different  in  other  ways. 

Salviati:  ...  we  asked  the  reason  why  [shipbuilders]  em¬ 

ployed  stocks,  scaffolding,  and  bracing  of  larger  dimensions 
for  launching  a  big  vessel  than  they  do  for  a  small  one;  and 
[an  old  man]  answered  that  they  did  this  in  order  to  avoid  the 
danger  of  the  ship  parting  under  its  own  heavy  weight,  a  dan¬ 
ger  to  which  small  boats  are  not  subject? 

After  this  entertaining  but  not  scientifically  rigorous  beginning, 
Galileo  starts  to  do  something  worthwhile  by  modern  standards. 
He  simplifies  everything  by  considering  the  strength  of  a  wooden 
plank.  The  variables  involved  can  then  be  narrowed  down  to  the 
type  of  wood,  the  width,  the  thickness,  and  the  length.  He  also 
gives  an  operational  definition  of  what  it  means  for  the  plank  to 
have  a  certain  strength  “in  proportion  to  its  size,”  by  introducing 
the  concept  of  a  plank  that  is  the  longest  one  that  would  not  snap 
under  its  own  weight  if  supported  at  one  end.  If  you  increased 
its  length  by  the  slightest  amount,  without  increasing  its  width  or 
thickness,  it  would  break.  He  says  that  if  one  plank  is  the  same 
shape  as  another  but  a  different  size,  appearing  like  a  reduced  or 
enlarged  photograph  of  the  other,  then  the  planks  would  be  strong 
“in  proportion  to  their  sizes”  if  both  were  just  barely  able  to  support 
their  own  weight. 


38 


Chapter  0 


Introduction  and  Review 


2 


g/1.  This  plank  is  as  long  as  it  can  be  without  collapsing  under 
its  own  weight.  If  it  was  a  hundredth  of  an  inch  longer,  it  would  collapse. 
2.  This  plank  is  made  out  of  the  same  kind  of  wood.  It  is  twice  as  thick, 
twice  as  long,  and  twice  as  wide.  It  will  collapse  under  its  own  weight. 


Also,  Galileo  is  doing  something  that  would  be  frowned  on  in 
modern  science:  he  is  mixing  experiments  whose  results  he  has  ac¬ 
tually  observed  (building  boats  of  different  sizes),  with  experiments 
that  he  could  not  possibly  have  done  (dropping  an  ant  from  the 
height  of  the  moon).  He  now  relates  how  he  has  done  actual  ex¬ 
periments  with  such  planks,  and  found  that,  according  to  this  op¬ 
erational  definition,  they  are  not  strong  in  proportion  to  their  sizes. 
The  larger  one  breaks.  He  makes  sure  to  tell  the  reader  how  impor¬ 
tant  the  result  is,  via  Sagredo’s  astonished  response: 

Sagredo:  My  brain  already  reels.  My  mind,  like  a  cloud 

momentarily  illuminated  by  a  lightning  flash,  is  for  an  instant 
filled  with  an  unusual  light,  which  now  beckons  to  me  and 
which  now  suddenly  mingles  and  obscures  strange,  crude 
ideas.  From  what  you  have  said  it  appears  to  me  impossible 
to  build  two  similar  structures  of  the  same  material,  but  of 
different  sizes  and  have  them  proportionately  strong. 

In  other  words,  this  specific  experiment,  using  things  like  wooden 
planks  that  have  no  intrinsic  scientific  interest,  has  very  wide  impli¬ 
cations  because  it  points  out  a  general  principle,  that  nature  acts 
differently  on  different  scales. 

To  finish  the  discussion,  Galileo  gives  an  explanation.  He  says 
that  the  strength  of  a  plank  (defined  as,  say,  the  weight  of  the  heav¬ 
iest  boulder  you  could  put  on  the  end  without  breaking  it)  is  pro¬ 
portional  to  its  cross-sectional  area,  that  is,  the  surface  area  of  the 
fresh  wood  that  would  be  exposed  if  you  sawed  through  it  in  the 
middle.  Its  weight,  however,  is  proportional  to  its  volume.2 

How  do  the  volume  and  cross-sectional  area  of  the  longer  plank 
compare  with  those  of  the  shorter  plank?  We  have  already  seen, 


2Galileo  makes  a  slightly  more  complicated  argument,  taking  into  account 
the  effect  of  leverage  (torque).  The  result  I’m  referring  to  comes  out  the  same 
regardless  of  this  effect. 


h  /  Galileo  discusses  planks 
made  of  wood,  but  the  concept 
may  be  easier  to  imagine  with 
clay.  All  three  clay  rods  in  the 
figure  were  originally  the  same 
shape.  The  medium-sized  one 
was  twice  the  height,  twice  the 
length,  and  twice  the  width  of 
the  small  one,  and  similarly  the 
large  one  was  twice  as  big  as 
the  medium  one  in  all  its  linear 
dimensions.  The  big  one  has 
four  times  the  linear  dimensions 
of  the  small  one,  16  times  the 
cross-sectional  area  when  cut 
perpendicular  to  the  page,  and 
64  times  the  volume.  That  means 
that  the  big  one  has  64  times  the 
weight  to  support,  but  only  16 
times  the  strength  compared  to 
the  smallest  one. 
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i  /  The  area  of  a  shape  is 
proportional  to  the  square  of  its 
linear  dimensions,  even  if  the 
shape  is  irregular. 


while  discussing  conversions  of  the  units  of  area  and  volume,  that 
these  quantities  don’t  act  the  way  most  people  naively  expect.  You 
might  think  that  the  volume  and  area  of  the  longer  plank  would  both 
be  doubled  compared  to  the  shorter  plank,  so  they  would  increase 
in  proportion  to  each  other,  and  the  longer  plank  would  be  equally 
able  to  support  its  weight.  You  would  be  wrong,  but  Galileo  knows 
that  this  is  a  common  misconception,  so  he  has  Salviati  address  the 
point  specifically: 

Salviati:  ...Take,  for  example,  a  cube  two  inches  on  a 

side  so  that  each  face  has  an  area  of  four  square  inches 
and  the  total  area,  i.e. ,  the  sum  of  the  six  faces,  amounts 
to  twenty-four  square  inches;  now  imagine  this  cube  to  be 
sawed  through  three  times  [with  cuts  in  three  perpendicular 
planes]  so  as  to  divide  it  into  eight  smaller  cubes,  each  one 
inch  on  the  side,  each  face  one  inch  square,  and  the  total 
surface  of  each  cube  six  square  inches  instead  of  twenty- 
four  in  the  case  of  the  larger  cube.  It  is  evident  therefore, 
that  the  surface  of  the  little  cube  is  only  one-fourth  that  of 
the  larger,  namely,  the  ratio  of  six  to  twenty-four;  but  the  vol¬ 
ume  of  the  solid  cube  itself  is  only  one-eighth;  the  volume, 
and  hence  also  the  weight,  diminishes  therefore  much  more 
rapidly  than  the  surface. . .  You  see,  therefore,  Simplicio,  that 
I  was  not  mistaken  when  ...  I  said  that  the  surface  of  a  small 
solid  is  comparatively  greater  than  that  of  a  large  one. 

The  same  reasoning  applies  to  the  planks.  Even  though  they 
are  not  cubes,  the  large  one  could  be  sawed  into  eight  small  ones, 
each  with  half  the  length,  half  the  thickness,  and  half  the  width. 
The  small  plank,  therefore,  has  more  surface  area  in  proportion  to 
its  weight,  and  is  therefore  able  to  support  its  own  weight  while  the 
large  one  breaks. 

Scaling  of  area  and  volume  for  irregularly  shaped  objects 

You  probably  are  not  going  to  believe  Galileo’s  claim  that  this 
has  deep  implications  for  all  of  nature  unless  you  can  be  convinced 
that  the  same  is  true  for  any  shape.  Every  drawing  you’ve  seen  so 
far  has  been  of  squares,  rectangles,  and  rectangular  solids.  Clearly 
the  reasoning  about  sawing  things  up  into  smaller  pieces  would  not 
prove  anything  about,  say,  an  egg,  which  cannot  be  cut  up  into  eight 
smaller  egg-shaped  objects  with  half  the  length. 

Is  it  always  true  that  something  half  the  size  has  one  quarter 
the  surface  area  and  one  eighth  the  volume,  even  if  it  has  an  irreg¬ 
ular  shape?  Take  the  example  of  a  child’s  violin.  Violins  are  made 
for  small  children  in  smaller  size  to  accomodate  their  small  bodies. 
Figure  i  shows  a  full-size  violin,  along  with  two  violins  made  with 
half  and  3/4  of  the  normal  length.3  Let’s  study  the  surface  area  of 


3The  customary  terms  “half-size”  and  “3/4-size”  actually  don’t  describe  the 
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the  front  panels  of  the  three  violins. 

Consider  the  square  in  the  interior  of  the  panel  of  the  full-size 
violin.  In  the  3/4-size  violin,  its  height  and  width  are  both  smaller 
by  a  factor  of  3/4,  so  the  area  of  the  corresponding,  smaller  square 
becomes  3/4x3/4  =  9/16  of  the  original  area,  not  3/4  of  the  original 
area.  Similarly,  the  corresponding  square  on  the  smallest  violin  has 
half  the  height  and  half  the  width  of  the  original  one,  so  its  area  is 
1/4  the  original  area,  not  half. 

The  same  reasoning  works  for  parts  of  the  panel  near  the  edge, 
such  as  the  part  that  only  partially  fills  in  the  other  square.  The 
entire  square  scales  down  the  same  as  a  square  in  the  interior,  and 
in  each  violin  the  same  fraction  (about  70%)  of  the  square  is  full,  so 
the  contribution  of  this  part  to  the  total  area  scales  down  just  the 
same. 

Since  any  small  square  region  or  any  small  region  covering  part 
of  a  square  scales  down  like  a  square  object,  the  entire  surface  area 
of  an  irregularly  shaped  object  changes  in  the  same  manner  as  the 
surface  area  of  a  square:  scaling  it  down  by  3/4  reduces  the  area  by 
a  factor  of  9/16,  and  so  on. 

In  general,  we  can  see  that  any  time  there  are  two  objects  with 
the  same  shape,  but  different  linear  dimensions  (i.e. ,  one  looks  like  a 
reduced  photo  of  the  other) ,  the  ratio  of  their  areas  equals  the  ratio 
of  the  squares  of  their  linear  dimensions: 

M  =  (LA2 
A-2  \  L'2  ) 

Note  that  it  doesn’t  matter  where  we  choose  to  measure  the  linear 
size,  L,  of  an  object.  In  the  case  of  the  violins,  for  instance,  it  could 
have  been  measured  vertically,  horizontally,  diagonally,  or  even  from 
the  bottom  of  the  left  f-hole  to  the  middle  of  the  right  f-hole.  We 
just  have  to  measure  it  in  a  consistent  way  on  each  violin.  Since  all 
the  parts  are  assumed  to  shrink  or  expand  in  the  same  manner,  the 
ratio  L\/L2  is  independent  of  the  choice  of  measurement. 

It  is  also  important  to  realize  that  it  is  completely  unnecessary 
to  have  a  formula  for  the  area  of  a  violin.  It  is  only  possible  to 
derive  simple  formulas  for  the  areas  of  certain  shapes  like  circles, 
rectangles,  triangles  and  so  on,  but  that  is  no  impediment  to  the 
type  of  reasoning  we  are  using. 

Sometimes  it  is  inconvenient  to  write  all  the  equations  in  terms 
of  ratios,  especially  when  more  than  two  objects  are  being  compared. 
A  more  compact  way  of  rewriting  the  previous  equation  is 

A  oc  L2. 

sizes  in  any  accurate  way.  They’re  really  just  standard,  arbitrary  marketing 
labels. 


j/The  muffin  comes  out  of 
the  oven  too  hot  to  eat.  Breaking 
it  up  into  four  pieces  increases 
its  surface  area  while  keeping 
the  total  volume  the  same.  It 
cools  faster  because  of  the 
greater  surface-to-volume  ratio. 
In  general,  smaller  things  have 
greater  surface-to-volume  ratios, 
but  in  this  example  there  is  no 
easy  way  to  compute  the  effect 
exactly,  because  the  small  pieces 
aren’t  the  same  shape  as  the 
original  muffin. 
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The  symbol  “oc”  means  “is  proportional  to.”  Scientists  and  engi¬ 
neers  often  speak  about  such  relationships  verbally  using  the  phrases 
“scales  like”  or  “goes  like,”  for  instance  “area  goes  like  length  squared.” 

All  of  the  above  reasoning  works  just  as  well  in  the  case  of  vol¬ 
ume.  Volume  goes  like  length  cubed: 

V  oc  L3. 


self-check  I 

When  a  car  or  truck  travels  over  a  road,  there  is  wear  and  tear  on  the 
road  surface,  which  incurs  a  cost.  Studies  show  that  the  cost  C  per  kilo¬ 
meter  of  travel  is  related  to  the  weight  per  axle  w  by  C  oc  w4.  Translate 
this  into  a  statement  about  ratios.  >  Answer,  p.  961 


If  different  objects  are  made  of  the  same  material  with  the  same 
density,  p  =  m/V,  then  their  masses,  m  =  pV,  are  proportional  to 
L3.  (The  symbol  for  density  is  p,  the  lower-case  Greek  letter  “rho.”) 


k/ Example  5.  The  big  trian¬ 
gle  has  four  times  more  area  than 
the  little  one. 


I  /  A  tricky  way  of  solving  ex¬ 
ample  5,  explained  in  solution  #2. 


An  important  point  is  that  all  of  the  above  reasoning  about 
scaling  only  applies  to  objects  that  are  the  same  shape.  For  instance, 
a  piece  of  paper  is  larger  than  a  pencil,  but  has  a  much  greater 
surface-to- volume  ratio. 

Scaling  of  the  area  of  a  triangle  example  5 

t>  In  figure  k,  the  larger  triangle  has  sides  twice  as  long.  How 
many  times  greater  is  its  area? 

Correct  solution  #1 :  Area  scales  in  proportion  to  the  square  of  the 
linear  dimensions,  so  the  larger  triangle  has  four  times  more  area 
(22  =  4). 

Correct  solution  #2:  You  could  cut  the  larger  triangle  into  four  of 
the  smaller  size,  as  shown  in  fig.  (b),  so  its  area  is  four  times 
greater.  (This  solution  is  correct,  but  it  would  not  work  for  a  shape 
like  a  circle,  which  can’t  be  cut  up  into  smaller  circles.) 

Correct  solution  #3:  The  area  of  a  triangle  is  given  by 

A  =  bh/2,  where  b  is  the  base  and  h  is  the  height.  The  areas  of 
the  triangles  are 

At  =  bi  ^  /2 

A2  =  b2h2/2 
=  (2di)(2f7i)/2 

=  2  bi  hi 

A2/Ai=(2bihi)/(bihi/2) 

=  4 


(Although  this  solution  is  correct,  it  is  a  lot  more  work  than  solution 
#1 ,  and  it  can  only  be  used  in  this  case  because  a  triangle  is  a 
simple  geometric  shape,  and  we  happen  to  know  a  formula  for  its 
area.) 
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Correct  solution  #4:  The  area  of  a  triangle  is  A  =  bh/2.  The 
comparison  of  the  areas  will  come  out  the  same  as  long  as  the 
ratios  of  the  linear  sizes  of  the  triangles  is  as  specified,  so  let’s 
just  say  b-\  =  1 .00  m  and  b2  =  2.00  m.  The  heights  are  then  also 
h-\  =  1.00  m  and  h2  =  2.00  m,  giving  areas  Ai  =  0.50  m2  and 
A2  =  2.00  m2,  so  A2/Ai  =  4.00. 

(The  solution  is  correct,  but  it  wouldn’t  work  with  a  shape  for 
whose  area  we  don’t  have  a  formula.  Also,  the  numerical  cal¬ 
culation  might  make  the  answer  of  4.00  appear  inexact,  whereas 
solution  #1  makes  it  clear  that  it  is  exactly  4.) 

Incorrect  solution:  The  area  of  a  triangle  is  A  =  bh/2,  and  if  you 
plug  in  b  =  2.00  m  and  h  =  2.00  m,  you  get  A  =  2.00  m2,  so 
the  bigger  triangle  has  2.00  times  more  area.  (This  solution  is 
incorrect  because  no  comparison  has  been  made  with  the  smaller 
triangle.) 

Scaling  of  the  volume  of  a  sphere  example  6 

>  In  figure  m,  the  larger  sphere  has  a  radius  that  is  five  times 
greater.  How  many  times  greater  is  its  volume? 

Correct  solution  #1 :  Volume  scales  like  the  third  power  of  the 
linear  size,  so  the  larger  sphere  has  a  volume  that  is  125  times 
greater  (53  =  125). 

Correct  solution  #2:  The  volume  of  a  sphere  is  V  =  (4/3)nr3,  so 


Incorrect  solution:  The  volume  of  a  sphere  is  V  =  (4/3)7tr3,  so 


(The  solution  is  incorrect  because  (5ri)3  is  not  the  same  as  5rf.) 


O 

m  /  Example  6.  The  big  sphere 
has  125  times  more  volume  than 
the  little  one. 


n  /  Example  7.  The  48-point 
“S”  has  1.78  times  more  area 
than  the  36-point  “S.” 
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Scaling  of  a  more  complex  shape  example  7 

>  The  first  letter  “S”  in  figure  n  is  in  a  36-point  font,  the  second  in 
48-point.  How  many  times  more  ink  is  required  to  make  the  larger 
“S”?  (Points  are  a  unit  of  length  used  in  typography.) 

Correct  solution:  The  amount  of  ink  depends  on  the  area  to  be 
covered  with  ink,  and  area  is  proportional  to  the  square  of  the 
linear  dimensions,  so  the  amount  of  ink  required  for  the  second 
“S”  is  greater  by  a  factor  of  (48/36)2  =  1 .78. 

Incorrect  solution:  The  length  of  the  curve  of  the  second  “S”  is 
longer  by  a  factor  of  48/36  =  1.33,  so  1.33  times  more  ink  is 
required. 

(The  solution  is  wrong  because  it  assumes  incorrectly  that  the 
width  of  the  curve  is  the  same  in  both  cases.  Actually  both  the 
width  and  the  length  of  the  curve  are  greater  by  a  factor  of  48/36, 
so  the  area  is  greater  by  a  factor  of  (48/36)2  =  1 .78.) 

Reasoning  about  ratios  and  proportionalities  is  one  of  the  three 
essential  mathematical  skills,  summarized  on  pp. 945-947,  that  you 
need  for  success  in  this  course. 

>  Solved  problem:  a  telescope  gathers  light  page  52,  problem  32 
i>  Solved  problem:  distance  from  an  earthquake  page  52,  problem  33 

Discussion  Questions 

A  A  toy  fire  engine  is  1/30  the  size  of  the  real  one,  but  is  constructed 
from  the  same  metal  with  the  same  proportions.  How  many  times  smaller 
is  its  weight?  How  many  times  less  red  paint  would  be  needed  to  paint 
it? 

B  Galileo  spends  a  lot  of  time  in  his  dialog  discussing  what  really 
happens  when  things  break.  He  discusses  everything  in  terms  of  Aristo¬ 
tle’s  now-discredited  explanation  that  things  are  hard  to  break,  because 
if  something  breaks,  there  has  to  be  a  gap  between  the  two  halves  with 
nothing  in  between,  at  least  initially.  Nature,  according  to  Aristotle,  “ab¬ 
hors  a  vacuum,”  i.e.,  nature  doesn’t  “like”  empty  space  to  exist.  Of  course, 
air  will  rush  into  the  gap  immediately,  but  at  the  very  moment  of  breaking, 
Aristotle  imagined  a  vacuum  in  the  gap.  Is  Aristotle’s  explanation  of  why 
it  is  hard  to  break  things  an  experimentally  testable  statement?  If  so,  how 
could  it  be  tested  experimentally? 

0.2.3  Order-of-magnitude  estimates 

It  is  the  mark  of  an  instructed  mind  to  rest  satisfied  with  the  degree 
of  precision  that  the  nature  of  the  subject  permits  and  not  to  seek 
an  exactness  where  only  an  approximation  of  the  truth  is  possible. 

Aristotle 

It  is  a  common  misconception  that  science  must  be  exact.  For 
instance,  in  the  Star  Trek  TV  series,  it  would  often  happen  that 
Captain  Kirk  would  ask  Mr.  Spock,  “Spock,  we’re  in  a  pretty  bad 
situation.  What  do  you  think  are  our  chances  of  getting  out  of 
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here?”  The  scientific  Mr.  Spock  would  answer  with  something  like, 
“Captain,  I  estimate  the  odds  as  237.345  to  one.”  In  reality,  he 
could  not  have  estimated  the  odds  with  six  significant  figures  of 
accuracy,  but  nevertheless  one  of  the  hallmarks  of  a  person  with  a 
good  education  in  science  is  the  ability  to  make  estimates  that  are 
likely  to  be  at  least  somewhere  in  the  right  ballpark.  In  many  such 
situations,  it  is  often  only  necessary  to  get  an  answer  that  is  off  by  no 
more  than  a  factor  of  ten  in  either  direction.  Since  things  that  differ 
by  a  factor  of  ten  are  said  to  differ  by  one  order  of  magnitude,  such 
an  estimate  is  called  an  order-of-magnitude  estimate.  The  tilde, 
~,  is  used  to  indicate  that  things  are  only  of  the  same  order  of 
magnitude,  but  not  exactly  equal,  as  in 

odds  of  survival  ~  100  to  one. 

The  tilde  can  also  be  used  in  front  of  an  individual  number  to  em¬ 
phasize  that  the  number  is  only  of  the  right  order  of  magnitude. 

Although  making  order-of-magnitude  estimates  seems  simple  and 
natural  to  experienced  scientists,  it’s  a  mode  of  reasoning  that  is 
completely  unfamiliar  to  most  college  students.  Some  of  the  typical 
mental  steps  can  be  illustrated  in  the  following  example. 

Cost  of  transporting  tomatoes  (incorrect  solution)  example  8 
>  Roughly  what  percentage  of  the  price  of  a  tomato  comes  from 
the  cost  of  transporting  it  in  a  truck? 

o  The  following  incorrect  solution  illustrates  one  of  the  main  ways 
you  can  go  wrong  in  order-of-magnitude  estimates. 

Incorrect  solution:  Let’s  say  the  trucker  needs  to  make  a  $400 
profit  on  the  trip.  Taking  into  account  her  benefits,  the  cost  of  gas, 
and  maintenance  and  payments  on  the  truck,  let’s  say  the  total 
cost  is  more  like  $2000.  I’d  guess  about  5000  tomatoes  would  fit 
in  the  back  of  the  truck,  so  the  extra  cost  per  tomato  is  40  cents. 
That  means  the  cost  of  transporting  one  tomato  is  comparable  to 
the  cost  of  the  tomato  itself.  Transportation  really  adds  a  lot  to  the 
cost  of  produce,  I  guess. 

The  problem  is  that  the  human  brain  is  not  very  good  at  esti¬ 
mating  area  or  volume,  so  it  turns  out  the  estimate  of  5000  tomatoes 
fitting  in  the  truck  is  way  off.  That’s  why  people  have  a  hard  time 
at  those  contests  where  you  are  supposed  to  estimate  the  number  of 
jellybeans  in  a  big  jar.  Another  example  is  that  most  people  think 
their  families  use  about  10  gallons  of  water  per  day,  but  in  reality 
the  average  is  about  300  gallons  per  day.  When  estimating  area 
or  volume,  you  are  much  better  off  estimating  linear  dimensions, 
and  computing  volume  from  the  linear  dimensions.  Here’s  a  better 
solution  to  the  problem  about  the  tomato  truck: 

Cost  of  transporting  tomatoes  (correct  solution)  example  9 
As  in  the  previous  solution,  say  the  cost  of  the  trip  is  $2000.  The 


o  /  Can  you  guess  how  many 
jelly  beans  are  in  the  jar?  If  you 
try  to  guess  directly,  you  will 
almost  certainly  underestimate. 
The  right  way  to  do  it  is  to  esti¬ 
mate  the  linear  dimensions,  then 
get  the  volume  indirectly.  See 
problem  44,  p.  54. 


k 


1  m 


>\ 


p  /  Consider  a  spherical  cow. 
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dimensions  of  the  bin  are  probably  4  m  x  2  m  x  1  m,  for  a  vol¬ 
ume  of  8  m3.  Since  the  whole  thing  is  just  an  order-of-magnitude 
estimate,  let’s  round  that  off  to  the  nearest  power  of  ten,  10  m3. 
The  shape  of  a  tomato  is  complicated,  and  I  don’t  know  any  for¬ 
mula  for  the  volume  of  a  tomato  shape,  but  since  this  is  just  an 
estimate,  let’s  pretend  that  a  tomato  is  a  cube,  0.05  m  x  0.05  m  x 
0.05  m,  for  a  volume  of  1 .25  x  1 0-4  m3.  Since  this  is  just  a  rough 
estimate,  let’s  round  that  to  10~4m3.  We  can  find  the  total  num¬ 
ber  of  tomatoes  by  dividing  the  volume  of  the  bin  by  the  volume 
of  one  tomato:  10  m3/10~4  m3  =  105  tomatoes.  The  transporta¬ 
tion  cost  per  tomato  is  $2000/1 05  tomatoes=$0. 02/tomato.  That 
means  that  transportation  really  doesn’t  contribute  very  much  to 
the  cost  of  a  tomato. 

Approximating  the  shape  of  a  tomato  as  a  cube  is  an  example  of 
another  general  strategy  for  making  order-of-magnitude  estimates. 
A  similar  situation  would  occur  if  you  were  trying  to  estimate  how 
many  m2  of  leather  could  be  produced  from  a  herd  of  ten  thousand 
cattle.  There  is  no  point  in  trying  to  take  into  account  the  shape  of 
the  cows’  bodies.  A  reasonable  plan  of  attack  might  be  to  consider 
a  spherical  cow.  Probably  a  cow  has  roughly  the  same  surface  area 
as  a  sphere  with  a  radius  of  about  1  m,  which  would  be  4vr(l  m)2. 
Using  the  well-known  facts  that  pi  equals  three,  and  four  times  three 
equals  about  ten,  we  can  guess  that  a  cow  has  a  surface  area  of  about 
10  m2,  so  the  herd  as  a  whole  might  yield  105  m2  of  leather. 

Estimating  mass  indirectly  example  1 0 

Usually  the  best  way  to  estimate  mass  is  to  estimate  linear  di¬ 
mensions,  then  use  those  to  infer  volume,  and  then  get  the  mass 
based  on  the  volume.  For  example,  Amphicoelias,  shown  in  the 
figure,  may  have  been  the  largest  land  animal  ever  to  live.  Fossils 
tell  us  the  linear  dimensions  of  an  animal,  but  we  can  only  indi¬ 
rectly  guess  its  mass.  Given  the  length  scale  in  the  figure,  let’s 
estimate  the  mass  of  an  Amphicoelias. 

Its  torso  looks  like  it  can  be  approximated  by  a  rectangular  box 
with  dimensions  10mx5mx3m,  giving  about  2  x  1 02  m3.  Living 
things  are  mostly  made  of  water,  so  we  assume  the  animal  to 
have  the  density  of  water,  1  g/cm3,  which  converts  to  1 03  kg/m3. 
This  gives  a  mass  of  about  2  x  105  kg,  or  200  metric  tons. 


The  following  list  summarizes  the  strategies  for  getting  a  good 
order-of-magnitude  estimate. 


46 


Chapter  0 


Introduction  and  Review 


1.  Don’t  even  attempt  more  than  one  significant  figure  of  preci¬ 
sion. 

2.  Don’t  guess  area,  volume,  or  mass  directly.  Guess  linear  di¬ 
mensions  and  get  area,  volume,  or  mass  from  them. 

3.  When  dealing  with  areas  or  volumes  of  objects  with  complex 
shapes,  idealize  them  as  if  they  were  some  simpler  shape,  a 
cube  or  a  sphere,  for  example. 

4.  Check  your  final  answer  to  see  if  it  is  reasonable.  If  you  esti¬ 
mate  that  a  herd  of  ten  thousand  cattle  would  yield  0.01  m2 
of  leather,  then  you  have  probably  made  a  mistake  with  con¬ 
version  factors  somewhere. 
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The  symbols  C,  ^ ,  etc.  are  explained  on  page  54. 

1  Correct  use  of  a  calculator:  (a)  Calculate  539^2+97554  011  a  ca^" 
culator.  [Self-check:  The  most  common  mistake  results  in  97555.40.] 

V 

(b)  Which  would  be  more  like  the  price  of  a  TV,  and  which  would 
be  more  like  the  price  of  a  house,  $3.5  x  105  or  $3.55?  - 

2  Compute  the  following  things.  If  they  don’t  make  sense  be¬ 
cause  of  units,  say  so. 

(a)  3  cm  +  5  cm 

(b)  1.11  m  +  22  cm 

(c)  120  miles  +  2.0  hours 

(d)  120  miles  /  2.0  hours 

■ 

3  Your  backyard  has  brick  walls  on  both  ends.  You  measure  a 

distance  of  23.4  m  from  the  inside  of  one  wall  to  the  inside  of  the 
other.  Each  wall  is  29.4  cm  thick.  How  far  is  it  from  the  outside 
of  one  wall  to  the  outside  of  the  other?  Pay  attention  to  significant 
figures.  - 

4  The  speed  of  light  is  3.0  x  108  m/s.  Convert  this  to  furlongs 

per  fortnight.  A  furlong  is  220  yards,  and  a  fortnight  is  14  days.  An 
inch  is  2.54  cm.  V 

5  Express  each  of  the  following  quantities  in  micrograms: 

(a)  10  mg,  (b)  104  g,  (c)  10  kg,  (d)  100  x  103  g,  (e)  1000  ng.  V 

■ 

6  Convert  134  mg  to  units  of  kg,  writing  your  answer  in  scientific 

notation.  >  Solution,  p.  970  - 

7  In  the  last  century,  the  average  age  of  the  onset  of  puberty  for 
girls  has  decreased  by  several  years.  Urban  folklore  has  it  that  this 
is  because  of  hormones  fed  to  beef  cattle,  but  it  is  more  likely  to  be 
because  modern  girls  have  more  body  fat  on  the  average  and  pos¬ 
sibly  because  of  estrogen-mimicking  chemicals  in  the  environment 
from  the  breakdown  of  pesticides.  A  hamburger  from  a  hormone- 
implanted  steer  has  about  0.2  ng  of  estrogen  (about  double  the 
amount  of  natural  beef).  A  serving  of  peas  contains  about  300 
ng  of  estrogen.  An  adult  woman  produces  about  0.5  mg  of  estrogen 
per  day  (note  the  different  unit!),  (a)  How  many  hamburgers  would 
a  girl  have  to  eat  in  one  day  to  consume  as  much  estrogen  as  an 
adult  woman’s  daily  production?  (b)  How  many  servings  of  peas? 

V 
■ 

8  The  usual  definition  of  the  mean  (average)  of  two  numbers  a 
and  b  is  (a  +  b) /2.  This  is  called  the  arithmetic  mean.  The  geometric 
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mean,  however,  is  defined  as  ( ab )1//2  (i.e. ,  the  square  root  of  ab).  For 
the  sake  of  definiteness,  let’s  say  both  numbers  have  units  of  mass, 
(a)  Compute  the  arithmetic  mean  of  two  numbers  that  have  units 
of  grams.  Then  convert  the  numbers  to  units  of  kilograms  and 
recompute  their  mean.  Is  the  answer  consistent?  (b)  Do  the  same 
for  the  geometric  mean,  (c)  If  a  and  b  both  have  units  of  grams, 
what  should  we  call  the  units  of  ab ?  Does  your  answer  make  sense 
when  you  take  the  square  root?  (d)  Suppose  someone  proposes  to 
you  a  third  kind  of  mean,  called  the  superduper  mean,  defined  as 
(ab)1/3.  Is  this  reasonable?  >  Solution,  p.  971  - 

9  In  an  article  on  the  SARS  epidemic,  the  May  7,  2003  New 

York  Times  discusses  conflicting  estimates  of  the  disease’s  incuba¬ 
tion  period  (the  average  time  that  elapses  from  infection  to  the  first 
symptoms).  “The  study  estimated  it  to  be  6.4  days.  But  other  sta¬ 
tistical  calculations  ...  showed  that  the  incubation  period  could  be 
as  long  as  14.22  days.”  What’s  wrong  here?  - 

10  The  photo  shows  the  corner  of  a  bag  of  pretzels.  What’s 

wrong  here?  ■ 

11  The  distance  to  the  horizon  is  given  by  the  expression  y/2 rh, 

where  r  is  the  radius  of  the  Earth,  and  h  is  the  observer’s  height 
above  the  Earth’s  surface.  (This  can  be  proved  using  the  Pythagorean 
theorem.)  Show  that  the  units  of  this  expression  make  sense.  Don’t 
try  to  prove  the  result,  just  check  its  units.  (See  example  2  on  p. 
26  for  an  example  of  how  to  do  this.)  - 

12  (a)  Based  on  the  definitions  of  the  sine,  cosine,  and  tangent, 
what  units  must  they  have?  (b)  A  cute  formula  from  trigonometry 
lets  you  find  any  angle  of  a  triangle  if  you  know  the  lengths  of 
its  sides.  Using  the  notation  shown  in  the  figure,  and  letting  s  = 
(a  +  b  +  c)/ 2  be  half  the  perimeter,  we  have 


tan  A/2 


(s-b)(s-c) 
s(s  —  a) 


Show  that  the  units  of  this  equation  make  sense.  In  other  words, 
check  that  the  units  of  the  right-hand  side  are  the  same  as  your 
answer  to  part  a  of  the  question.  >  Solution,  p.  971  - 

13  A  physics  homework  question  asks,  “If  you  start  from  rest 
and  accelerate  at  1.54  m/s2  for  3.29  s,  how  far  do  you  travel  by  the 
end  of  that  time?”  A  student  answers  as  follows: 


1.54  x  3.29  =  5.07  m 

His  Aunt  Wanda  is  good  with  numbers,  but  has  never  taken  physics. 
She  doesn’t  know  the  formula  for  the  distance  traveled  under  con¬ 
stant  acceleration  over  a  given  amount  of  time,  but  she  tells  her 
nephew  his  answer  cannot  be  right.  How  does  she  know? 


NET  WT.  3  Vt  oi.  (99.2  g) 
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14  You  are  looking  into  a  deep  well.  It  is  dark,  and  you  cannot 

see  the  bottom.  You  want  to  find  out  how  deep  it  is,  so  you  drop 
a  rock  in,  and  you  hear  a  splash  3.0  seconds  later.  How  deep  is  the 
well?  V  3 

15  You  take  a  trip  in  your  spaceship  to  another  star.  Setting  off, 
you  increase  your  speed  at  a  constant  acceleration.  Once  you  get 
half-way  there,  you  start  decelerating,  at  the  same  rate,  so  that  by 
the  time  you  get  there,  you  have  slowed  down  to  zero  speed.  You  see 
the  tourist  attractions,  and  then  head  home  by  the  same  method. 

(a)  Find  a  formula  for  the  time,  T,  required  for  the  round  trip,  in 
terms  of  d ,  the  distance  from  our  sun  to  the  star,  and  a,  the  magni¬ 
tude  of  the  acceleration.  Note  that  the  acceleration  is  not  constant 
over  the  whole  trip,  but  the  trip  can  be  broken  up  into  constant- 
acceleration  parts. 

(b)  The  nearest  star  to  the  Earth  (other  than  our  own  sun)  is  Prox- 
ima  Centauri,  at  a  distance  of  d  =  4  x  1016  m.  Suppose  you  use  an 
acceleration  of  a  =  10  m/s2,  just  enough  to  compensate  for  the  lack 
of  true  gravity  and  make  you  feel  comfortable.  How  long  does  the 
round  trip  take,  in  years? 

(c)  Using  the  same  numbers  for  d  and  a,  find  your  maximum  speed. 

Compare  this  to  the  speed  of  light,  which  is  3.0  x  10s  m/s.  (Later 
in  this  course,  you  will  learn  that  there  are  some  new  things  going 
on  in  physics  when  one  gets  close  to  the  speed  of  light,  and  that  it 
is  impossible  to  exceed  the  speed  of  light.  For  now,  though,  just  use 
the  simpler  ideas  you’ve  learned  so  far.)  V  ^ 

16  You  climb  half-way  up  a  tree,  and  drop  a  rock.  Then  you 

climb  to  the  top,  and  drop  another  rock.  How  many  times  greater 
is  the  velocity  of  the  second  rock  on  impact?  Explain.  (The  answer 
is  not  two  times  greater.)  - 

17  If  the  acceleration  of  gravity  on  Mars  is  1/3  that  on  Earth, 

how  many  times  longer  does  it  take  for  a  rock  to  drop  the  same 
distance  on  Mars?  Ignore  air  resistance.  i>  Solution,  p.  971  - 

18  A  person  is  parachute  jumping.  During  the  time  between 
when  she  leaps  out  of  the  plane  and  when  she  opens  her  chute,  her 
altitude  is  given  by  an  equation  of  the  form 

y  =  b-  c(t  +  ke~^k^j  , 

where  e  is  the  base  of  natural  logarithms,  and  b,  c,  and  k  are  con¬ 
stants.  Because  of  air  resistance,  her  velocity  does  not  increase  at  a 
steady  rate  as  it  would  for  an  object  falling  in  vacuum. 

(a)  What  units  would  6,  c,  and  k  have  to  have  for  the  equation  to 
make  sense? 

(b)  Find  the  person’s  velocity,  v,  as  a  function  of  time.  [You  will 
need  to  use  the  chain  rule,  and  the  fact  that  d{ex)/ dx  =  ex .]  V 

(c)  Use  your  answer  from  part  (b)  to  get  an  interpretation  of  the 
constant  c.  [Hint:  e~x  approaches  zero  for  large  values  of  x.] 
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(d)  Find  the  person’s  acceleration,  a,  as  a  function  of  time.  V 

(e)  Use  your  answer  from  part  (d)  to  show  that  if  she  waits  long 
enough  to  open  her  chute,  her  acceleration  will  become  very  small. 

■ 

19  In  July  1999,  Popular  Mechanics  carried  out  tests  to  find 

which  car  sold  by  a  major  auto  maker  could  cover  a  quarter  mile 
(402  meters)  in  the  shortest  time,  starting  from  rest.  Because  the 
distance  is  so  short,  this  type  of  test  is  designed  mainly  to  favor  the 
car  with  the  greatest  acceleration,  not  the  greatest  maximum  speed 
(which  is  irrelevant  to  the  average  person).  The  winner  was  the 
Dodge  Viper,  with  a  time  of  12.08  s.  The  car’s  top  (and  presumably 
final)  speed  was  118.51  miles  per  hour  (52.98  m/s),  (a)  If  a  car, 
starting  from  rest  and  moving  with  constant  acceleration,  covers 
a  quarter  mile  in  this  time  interval,  what  is  its  acceleration?  (b) 
What  would  be  the  final  speed  of  a  car  that  covered  a  quarter  mile 
with  the  constant  acceleration  you  found  in  part  a?  (c)  Based  on 
the  discrepancy  between  your  answer  in  part  b  and  the  actual  final 
speed  of  the  Viper,  what  do  you  conclude  about  how  its  acceleration 
changed  over  time?  >  Solution,  p.  971  ■ 

20  The  speed  required  for  a  low-earth  orbit  is  7.9  x  103  m/s. 

When  a  rocket  is  launched  into  orbit,  it  goes  up  a  little  at  first  to  get 
above  almost  all  of  the  atmosphere,  but  then  tips  over  horizontally 
to  build  up  to  orbital  speed.  Suppose  the  horizontal  acceleration  is 
limited  to  3 g  to  keep  from  damaging  the  cargo  (or  hurting  the  crew, 
for  a  crewed  flight),  (a)  What  is  the  minimum  distance  the  rocket 
must  travel  downrange  before  it  reaches  orbital  speed?  How  much 
does  it  matter  whether  you  take  into  account  the  initial  eastward 
velocity  due  to  the  rotation  of  the  earth?  (b)  Rather  than  a  rocket 
ship,  it  might  be  advantageous  to  use  a  railgun  design,  in  which  the 
craft  would  be  accelerated  to  orbital  speeds  along  a  railroad  track. 
This  has  the  advantage  that  it  isn’t  necessary  to  lift  a  large  mass  of 
fuel,  since  the  energy  source  is  external.  Based  on  your  answer  to 
part  a,  comment  on  the  feasibility  of  this  design  for  crewed  launches 
from  the  earth’s  surface.  - 

21  Consider  the  following  passage  from  Alice  in  Wonderland,  in 
which  Alice  has  been  falling  for  a  long  time  down  a  rabbit  hole: 

Down,  down,  down.  Would  the  fall  never  come  to  an  end?  “I 
wonder  how  many  miles  I’ve  fallen  by  this  time?”  she  said  aloud. 
“I  must  be  getting  somewhere  near  the  center  of  the  earth.  Let  me 
see:  that  would  be  four  thousand  miles  down,  I  think”  (for,  you  see, 
Alice  had  learned  several  things  of  this  sort  in  her  lessons  in  the 
schoolroom,  and  though  this  was  not  a  very  good  opportunity  for 
showing  off  her  knowledge,  as  there  was  no  one  to  listen  to  her,  still 
it  was  good  practice  to  say  it  over)... 

Alice  doesn’t  know  much  physics,  but  let’s  try  to  calculate  the 
amount  of  time  it  would  take  to  fall  four  thousand  miles,  starting 


Problems 


51 


from  rest  with  an  acceleration  of  10  m/s2.  This  is  really  only  a  lower 
limit;  if  there  really  was  a  hole  that  deep,  the  fall  would  actually 
take  a  longer  time  than  the  one  you  calculate,  both  because  there 
is  air  friction  and  because  gravity  gets  weaker  as  you  get  deeper  (at 
the  center  of  the  earth,  g  is  zero,  because  the  earth  is  pulling  you 
equally  in  every  direction  at  once).  V  ^ 

22  How  many  cubic  inches  are  there  in  a  cubic  foot?  The  answer 

is  not  12.  V  ■ 

23  Assume  a  dog’s  brain  is  twice  as  great  in  diameter  as  a  cat’s, 
but  each  animal’s  brain  cells  are  the  same  size  and  their  brains  are 
the  same  shape.  In  addition  to  being  a  far  better  companion  and 
much  nicer  to  come  home  to,  how  many  times  more  brain  cells  does 
a  dog  have  than  a  cat?  The  answer  is  not  2. 

24  The  population  density  of  Los  Angeles  is  about  4000  people/km2 . 

That  of  San  Francisco  is  about  6000  people/km2.  How  many  times 
farther  away  is  the  average  person’s  nearest  neighbor  in  LA  than  in 
San  Francisco?  The  answer  is  not  1.5.  V  ^ 

25  A  hunting  dog’s  nose  has  about  10  square  inches  of  active 

surface.  How  is  this  possible,  since  the  dog’s  nose  is  only  about  1  in 
x  1  in  x  1  in  =  1  in3?  After  all,  10  is  greater  than  1,  so  how  can  it 
fit?  ■ 

26  Estimate  the  number  of  blades  of  grass  on  a  football  field. 

B 

27  In  a  computer  memory  chip,  each  bit  of  information  (a  0  or 

a  1)  is  stored  in  a  single  tiny  circuit  etched  onto  the  surface  of  a 
silicon  chip.  The  circuits  cover  the  surface  of  the  chip  like  lots  in  a 
housing  development.  A  typical  chip  stores  64  Mb  (megabytes)  of 
data,  where  a  byte  is  8  bits.  Estimate  (a)  the  area  of  each  circuit, 
and  (b)  its  linear  size.  - 

28  Suppose  someone  built  a  gigantic  apartment  building,  mea¬ 

suring  10  km  x  10  km  at  the  base.  Estimate  how  tall  the  building 
would  have  to  be  to  have  space  in  it  for  the  entire  world’s  population 
to  live.  - 

29  A  hamburger  chain  advertises  that  it  has  sold  10  billion 
Bongo  Burgers.  Estimate  the  total  mass  of  feed  required  to  raise 
the  cows  used  to  make  the  burgers. 

30  Estimate  the  volume  of  a  human  body,  in  cm3. 

31  How  many  cm2  is  1  mm2?  [>  Solution,  p.  971  - 

32  Compare  the  light-gathering  powers  of  a  3-cm-diameter  tele¬ 
scope  and  a  30-cm  telescope.  t>  Solution,  p.  971  - 

33  One  step  on  the  Richter  scale  corresponds  to  a  factor  of  100 
in  terms  of  the  energy  absorbed  by  something  on  the  surface  of  the 
Earth,  e.g.,  a  house.  For  instance,  a  9.3-magnitude  quake  would 
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release  100  times  more  energy  than  an  8.3.  The  energy  spreads  out 
from  the  epicenter  as  a  wave,  and  for  the  sake  of  this  problem  we’ll 
assume  we’re  dealing  with  seismic  waves  that  spread  out  in  three 
dimensions,  so  that  we  can  visualize  them  as  hemispheres  spreading 
out  under  the  surface  of  the  earth.  If  a  certain  7.6-magnitude  earth¬ 
quake  and  a  certain  5.6-magnitude  earthquake  produce  the  same 
amount  of  vibration  where  I  live,  compare  the  distances  from  my 
house  to  the  two  epicenters.  >  Solution,  p.  971 

34  In  Europe,  a  piece  of  paper  of  the  standard  size,  called  A4, 

is  a  little  narrower  and  taller  than  its  American  counterpart.  The 
ratio  of  the  height  to  the  width  is  the  square  root  of  2,  and  this  has 
some  useful  properties.  For  instance,  if  you  cut  an  A4  sheet  from  left 
to  right,  you  get  two  smaller  sheets  that  have  the  same  proportions. 
You  can  even  buy  sheets  of  this  smaller  size,  and  they’re  called  A5. 
There  is  a  whole  series  of  sizes  related  in  this  way,  all  with  the  same 
proportions,  (a)  Compare  an  A5  sheet  to  an  A4  in  terms  of  area  and 
linear  size,  (b)  The  series  of  paper  sizes  starts  from  an  AO  sheet, 
which  has  an  area  of  one  square  meter.  Suppose  we  had  a  series 
of  boxes  defined  in  a  similar  way:  the  BO  box  has  a  volume  of  one 
cubic  meter,  two  B1  boxes  fit  exactly  inside  an  BO  box,  and  so  on. 
What  would  be  the  dimensions  of  a  BO  box?  V  - 

35  Estimate  the  mass  of  one  of  the  hairs  in  Albert  Einstein’s 

moustache,  in  units  of  kg.  ■ 

36  According  to  folklore,  every  time  you  take  a  breath,  you  are 

inhaling  some  of  the  atoms  exhaled  in  Caesar’s  last  words.  Is  this 
true?  If  so,  how  many?  ^ 

37  The  Earth’s  surface  is  about  70%  water.  Mars’s  diameter  is 

about  half  the  Earth’s,  but  it  has  no  surface  water.  Compare  the 
land  areas  of  the  two  planets.  %  ^ 

38  The  traditional  Martini  glass  is  shaped  like  a  cone  with 

the  point  at  the  bottom.  Suppose  you  make  a  Martini  by  pouring 
vermouth  into  the  glass  to  a  depth  of  3  cm,  and  then  adding  gin 
to  bring  the  depth  to  6  cm.  What  are  the  proportions  of  gin  and 
vermouth?  >  Solution,  p.  972  ^ 

39  The  central  portion  of  a  CD  is  taken  up  by  the  hole  and  some 

surrounding  clear  plastic,  and  this  area  is  unavailable  for  storing 
data.  The  radius  of  the  central  circle  is  about  35%  of  the  outer 
radius  of  the  data-storing  area.  What  percentage  of  the  CD’s  area 
is  therefore  lost?  %  - 

40  The  one-liter  cube  in  the  photo  has  been  marked  off  into 
smaller  cubes,  with  linear  dimensions  one  tenth  those  of  the  big 
one.  What  is  the  volume  of  each  of  the  small  cubes? 

>  Solution,  p.  972  - 


Albert  Einstein,  and  his  mous¬ 
tache,  problem  35. 


Problem  40. 


Problems 
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Problem  42. 


Problem  43. 


41  Estimate  the  number  of  man-hours  required  for  building  the 

Great  Wall  of  China.  >  Solution,  p.  972  ^ 

42  (a)  Using  the  microscope  photo  in  the  figure,  estimate  the 

mass  of  a  one  cell  of  the  E.  coli  bacterium,  which  is  one  of  the 
most  common  ones  in  the  human  intestine.  Note  the  scale  at  the 
lower  right  corner,  which  is  1  /im.  Each  of  the  tubular  objects  in 
the  column  is  one  cell,  (b)  The  feces  in  the  human  intestine  are 
mostly  bacteria  (some  dead,  some  alive),  of  which  E.  coli  is  a  large 
and  typical  component.  Estimate  the  number  of  bacteria  in  your 
intestines,  and  compare  with  the  number  of  human  cells  in  your 
body,  which  is  believed  to  be  roughly  on  the  order  of  1013.  (c) 

Interpreting  your  result  from  part  b,  what  does  this  tell  you  about 
the  size  of  a  typical  human  cell  compared  to  the  size  of  a  typical 
bacterial  cell?  - 

43  The  figure  shows  a  practical,  simple  experiment  for  determin¬ 
ing  g  to  high  precision.  Two  steel  balls  are  suspended  from  electro¬ 
magnets,  and  are  released  simultaneously  when  the  electric  current 
is  shut  off.  They  fall  through  unequal  heights  Ax\  and  Ax2 ■  A 
computer  records  the  sounds  through  a  microphone  as  first  one  ball 
and  then  the  other  strikes  the  floor.  From  this  recording,  we  can 
accurately  determine  the  quantity  T  defined  as  T  =  At.2  —  Ati,  i.e., 
the  time  lag  between  the  first  and  second  impacts.  Note  that  since 
the  balls  do  not  make  any  sound  when  they  are  released,  we  have 
no  way  of  measuring  the  individual  times  At2  and  At\. 

(a)  Find  an  equation  for  g  in  terms  of  the  measured  quantities  T, 

Ax\  and  Ax2 .  V 

(b)  Check  the  units  of  your  equation. 

(c)  Check  that  your  equation  gives  the  correct  result  in  the  case 
where  Ax\  is  very  close  to  zero.  However,  is  this  case  realistic? 

(d)  What  happens  when  Ax\  =  A22?  Discuss  this  both  mathemat¬ 
ically  and  physically.  B 

44  Estimate  the  number  of  jellybeans  in  figure  o  on  p.  45. 

>  Solution,  p.  972  - 

45  Let  the  function  x  be  defined  by  x(t)  =  Aebt,  where  t  has 

units  of  seconds  and  x  has  units  of  meters.  (For  b  <  0,  this  could  be 
a  fairly  accurate  model  of  the  motion  of  a  bullet  shot  into  a  tank  of 
oil.)  Show  that  the  Taylor  series  of  this  function  makes  sense  if  and 
only  if  A  and  b  have  certain  units.  ^ 


Key  to  symbols: 

easy  -  typical  B  challenging  B  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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■  The  universe  has  been  recycling 

its  contents  ever  since  the  Big 
Bang,  13.7  billion  years  ago. 


Chapter  1 

Conservation  of  Mass 


It  took  just  a  moment  for  that  head  to  fall,  but  a  hundred  years 
might  not  produce  another  like  it. 

Joseph-Louis  Lagrange,  referring  to  the  execution  of  Lavoisier 
on  May  8,  1 79 4 

1.1  Mass 

Change  is  impossible,  claimed  the  ancient  Greek  philosopher  Par¬ 
menides.  His  work  was  nonscientihc,  since  he  didn’t  state  his  ideas 
in  a  form  that  would  allow  them  to  be  tested  experimentally,  but 
modern  science  nevertheless  has  a  strong  Parmenidean  flavor.  His 
main  argument  that  change  is  an  illusion  was  that  something  can’t 
be  turned  into  nothing,  and  likewise  if  you  have  nothing,  you  can’t 
turn  it  into  something.  To  make  this  into  a  scientific  theory,  we  have 
to  decide  on  a  way  to  measure  what  “something”  is,  and  we  can  then 
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a  /  Portrait  of  Monsieur  Lavoisier 
and  His  Wife,  by  Jacques-Louis 
David,  1788.  Lavoisier  invented 
the  concept  of  conservation  of 
mass.  The  husband  is  depicted 
with  his  scientific  apparatus, 
while  in  the  background  on  the 
left  is  the  portfolio  belonging 
to  Madame  Lavoisier,  who  is 
thought  to  have  been  a  student  of 
David’s. 


b/A  measurement  of  gravi¬ 
tational  mass:  the  sphere  has 
a  gravitational  mass  of  five 
kilograms. 


check  by  measurements  whether  the  total  amount  of  “something”  in 
the  universe  really  stays  constant.  How  much  “something”  is  there 
in  a  rock?  Does  a  sunbeam  count  as  “something?”  Does  heat  count? 
Motion?  Thoughts  and  feelings? 

If  you  look  at  the  table  of  contents  of  this  book,  you’ll  see  that 
the  first  four  chapters  have  the  word  “conservation”  in  them.  In 
physics,  a  conservation  law  is  a  statement  that  the  total  amount  of 
a  certain  physical  quantity  always  stays  the  same.  This  chapter  is 
about  conservation  of  mass.  The  metric  system  is  designed  around 
a  unit  of  distance,  the  meter,  a  unit  of  mass,  the  kilogram,  and  a 
time  unit,  the  second.  Numerical  measurement  of  distance  and  time 
probably  date  back  almost  as  far  into  prehistory  as  counting  money, 
but  mass  is  a  more  modern  concept.  Until  scientists  figured  out  that 
mass  was  conserved,  it  wasn’t  obvious  that  there  could  be  a  single, 
consistent  way  of  measuring  an  amount  of  matter,  hence  jiggers  of 
whiskey  and  cords  of  wood.  You  may  wonder  why  conservation  of 
mass  wasn’t  discovered  until  relatively  modern  times,  but  it  wasn’t 
obvious,  for  example,  that  gases  had  mass,  and  that  the  apparent 
loss  of  mass  when  wood  was  burned  was  exactly  matched  by  the 
mass  of  the  escaping  gases. 

Once  scientists  were  on  the  track  of  the  conservation  of  mass 
concept,  they  began  looking  for  a  way  to  define  mass  in  terms  of  a 
definite  measuring  procedure.  If  they  tried  such  a  procedure,  and  the 
result  was  that  it  led  to  nonconservation  of  mass,  then  they  would 
throw  it  out  and  try  a  different  procedure.  For  instance,  we  might 
be  tempted  to  define  mass  using  kitchen  measuring  cups,  i.e.,  as  a 
measure  of  volume.  Mass  would  then  be  perfectly  conserved  for  a 
process  like  mixing  marbles  with  peanut  butter,  but  there  would  be 
processes  like  freezing  water  that  led  to  a  net  increase  in  mass,  and 
others  like  soaking  up  water  with  a  sponge  that  caused  a  decrease. 
If,  with  the  benefit  of  hindsight,  it  seems  like  the  measuring  cup 
definition  was  just  plain  silly,  then  here’s  a  more  subtle  example  of 
a  wrong  definition  of  mass.  Suppose  we  define  it  using  a  bathroom 
scale,  or  a  more  precise  device  such  as  a  postal  scale  that  works  on 
the  same  principle  of  using  gravity  to  compress  or  twist  a  spring. 
The  trouble  is  that  gravity  is  not  equally  strong  all  over  the  surface 
of  the  earth,  so  for  instance  there  would  be  nonconservation  of  mass 
when  you  brought  an  object  up  to  the  top  of  a  mountain,  where 
gravity  is  a  little  weaker. 

Although  some  of  the  obvious  possibilities  have  problems,  there 
do  turn  out  to  be  at  least  two  approaches  to  defining  mass  that  lead 
to  its  being  a  conserved  quantity,  so  we  consider  these  definitions 
to  be  “right”  in  the  pragmatic  sense  that  what’s  correct  is  what’s 
useful. 

One  definition  that  works  is  to  use  balances,  but  compensate 
for  the  local  strength  of  gravity.  This  is  the  method  that  is  used 


56 


Chapter  1 


Conservation  of  Mass 


by  scientists  who  actually  specialize  in  ultraprecise  measurements. 
A  standard  kilogram,  in  the  form  of  a  platinum-iridium  cylinder, 
is  kept  in  a  special  shrine  in  Paris.  Copies  are  made  that  balance 
against  the  standard  kilogram  in  Parisian  gravity,  and  they  are  then 
transported  to  laboratories  in  other  parts  of  the  world,  where  they 
are  compared  with  other  masses  in  the  local  gravity.  The  quantity 
defined  in  this  way  is  called  gravitational  mass. 

A  second  and  completely  different  approach  is  to  measure  how 
hard  it  is  to  change  an  object’s  state  of  motion.  This  tells  us  its  in¬ 
ertial  mass.  For  example,  I’d  be  more  willing  to  stand  in  the  way  of 
an  oncoming  poodle  than  in  the  path  of  a  freight  train,  because  my 
body  will  have  a  harder  time  convincing  the  freight  train  to  stop. 
This  is  a  dictionary-style  conceptual  definition,  but  in  physics  we 
need  to  back  up  a  conceptual  definition  with  an  operational  defini¬ 
tion,  which  is  one  that  spells  out  the  operations  required  in  order 
to  measure  the  quantity  being  defined.  We  can  operationalize  our 
definition  of  inertial  mass  by  throwing  a  standard  kilogram  at  an  ob¬ 
ject  at  a  speed  of  1  m/s  (one  meter  per  second)  and  measuring  the 
recoiling  object’s  velocity.  Suppose  we  want  to  measure  the  mass 
of  a  particular  block  of  cement.  We  put  the  block  in  a  toy  wagon 
on  the  sidewalk,  and  throw  a  standard  kilogram  at  it.  Suppose  the 
standard  kilogram  hits  the  wagon,  and  then  drops  straight  down 
to  the  sidewalk,  having  lost  all  its  velocity,  and  the  wagon  and  the 
block  inside  recoil  at  a  velocity  of  0.23  m/s.  We  then  repeat  the 
experiment  with  the  block  replaced  by  various  numbers  of  standard 
kilograms,  and  find  that  we  can  reproduce  the  recoil  velocity  of  0.23 
m/s  with  four  standard  kilograms  in  the  wagon.  We  have  deter¬ 
mined  the  mass  of  the  block  to  be  four  kilograms.1  Although  this 
definition  of  inertial  mass  has  an  appealing  conceptual  simplicity,  it 
is  obviously  not  very  practical,  at  least  in  this  crude  form.  Never¬ 
theless,  this  method  of  collision  is  very  much  like  the  methods  used 
for  measuring  the  masses  of  subatomic  particles,  which,  after  all, 
can’t  be  put  on  little  postal  scales! 

Astronauts  spending  long  periods  of  time  in  space  need  to  mon¬ 
itor  their  loss  of  bone  and  muscle  mass,  and  here  as  well,  it’s  im¬ 
possible  to  measure  gravitational  mass.  Since  they  don’t  want  to 
have  standard  kilograms  thrown  at  them,  they  use  a  slightly  differ¬ 
ent  technique  (figures  d  and  e).  They  strap  themselves  to  a  chair 
which  is  attached  to  a  large  spring,  and  measure  the  time  it  takes 
for  one  cycle  of  vibration. 


1You  might  think  intuitively  that  the  recoil  velocity  should  be  exactly  one 
fourth  of  a  meter  per  second,  and  you’d  be  right  except  that  the  wagon  has  some 
mass  as  well.  Our  present  approach,  however,  only  requires  that  we  give  a  way 
to  test  for  equality  of  masses.  To  predict  the  recoil  velocity  from  scratch,  we’d 
need  to  use  conservation  of  momentum,  which  is  discussed  in  a  later  chapter. 


c/A  measurement  of  inertial 
mass:  the  wagon  recoils  with 
the  same  velocity  in  experiments 
1  and  2,  establishing  that  the 
inertial  mass  of  the  cement  block 
is  four  kilograms. 


d  /  The  time  for  one  cycle  of 
vibration  is  related  to  the  object’s 
inertial  mass. 


e  /  Astronaut  Tamara  Jernigan 
measures  her  inertial  mass 
aboard  the  Space  Shuttle. 


Section  1.1  Mass 
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1.1.1  Problem-solving  techniques 

How  do  we  use  a  conservation  law,  such  as  conservation  of  mass, 
to  solve  problems?  There  are  two  basic  techniques. 

As  an  analogy,  consider  conservation  of  money,  which  makes  it 
illegal  for  you  to  create  dollar  bills  using  your  own  inkjet  printer. 
(Most  people  don’t  intentionally  destroy  their  dollar  bills,  either!) 
Suppose  the  police  notice  that  a  particular  store  doesn’t  seem  to 
have  any  customers,  but  the  owner  wears  lots  of  gold  jewelry  and 
drives  a  BMW.  They  suspect  that  the  store  is  a  front  for  some  kind 
of  crime,  perhaps  counterfeiting.  With  intensive  surveillance,  there 
are  two  basic  approaches  they  could  use  in  their  investigation.  One 
method  would  be  to  have  undercover  agents  try  to  find  out  how 
much  money  goes  in  the  door,  and  how  much  money  comes  back 
out  at  the  end  of  the  day,  perhaps  by  arranging  through  some  trick 
to  get  access  to  the  owner’s  briefcase  in  the  morning  and  evening.  If 
the  amount  of  money  that  comes  out  every  day  is  greater  than  the 
amount  that  went  in,  and  if  they’re  convinced  there  is  no  safe  on  the 
premises  holding  a  large  reservoir  of  money,  then  the  owner  must 
be  counterfeiting.  This  inflow-equals-outflow  technique  is  useful  if 
we  are  sure  that  there  is  a  region  of  space  within  which  there  is  no 
supply  of  mass  that  is  being  built  up  or  depleted. 

A  stream  of  water  example  1 

If  you  watch  water  flowing  out  of  the  end  of  a  hose,  you’ll  see 
that  the  stream  of  water  is  fatter  near  the  mouth  of  the  hose,  and 
skinnier  lower  down.  This  is  because  the  water  speeds  up  as  it 
falls.  If  the  cross-sectional  area  of  the  stream  was  equal  all  along 
its  length,  then  the  rate  of  flow  (kilograms  per  second)  through  a 
lower  cross-section  would  be  greater  than  the  rate  of  flow  through 
a  cross-section  higher  up.  Since  the  flow  is  steady,  the  amount 
of  water  between  the  two  cross-sections  stays  constant.  Conser¬ 
vation  of  mass  therefore  requires  that  the  cross-sectional  area  of 
the  stream  shrink  in  inverse  proportion  to  the  increasing  speed  of 
the  falling  water. 

self-check  A 

Suppose  the  you  point  the  hose  straight  up,  so  that  the  water  is  rising 
rather  than  falling.  What  happens  as  the  velocity  gets  smaller?  What 
happens  when  the  velocity  becomes  zero?  t>  Answer,  p.  961 

,  r-  .  .  How  can  we  apply  a  conservation  law,  such  as  conservation  of 

mass,  in  a  situation  where  mass  might  be  stored  up  somewhere?  To 
use  a  crime  analogy  again,  a  prison  could  contain  a  certain  number 
of  prisoners,  who  are  not  allowed  to  flow  in  or  out  at  will.  In  physics, 
this  is  known  as  a  closed  system.  A  guard  might  notice  that  a  certain 
prisoner’s  cell  is  empty,  but  that  doesn’t  mean  he’s  escaped.  He 
could  be  sick  in  the  infirmary,  or  hard  at  work  in  the  shop  earning 
cigarette  money.  What  prisons  actually  do  is  to  count  all  their 
prisoners  every  day,  and  make  sure  today’s  total  is  the  same  as 
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yesterday’s.  One  way  of  stating  a  conservation  law  is  that  for  a 
closed  system,  the  total  amount  of  stuff  (mass,  in  this  chapter)  stays 
constant. 

Lavoisier  and  chemical  reactions  in  a  closed  system  example  2 
The  French  chemist  Antoine-Laurent  Lavoisier  is  considered  the 
inventor  of  the  concept  of  conservation  of  mass.  Before  Lavoisier, 
chemists  had  never  systematically  weighed  their  chemicals  to 
quantify  the  amount  of  each  substance  that  was  undergoing  re¬ 
actions.  They  also  didn’t  completely  understand  that  gases  were 
just  another  state  of  matter,  and  hadn’t  tried  performing  reactions 
in  sealed  chambers  to  determine  whether  gases  were  being  con¬ 
sumed  from  or  released  into  the  air.  For  this  they  had  at  least 
one  practical  excuse,  which  is  that  if  you  perform  a  gas-releasing 
reaction  in  a  sealed  chamber  with  no  room  for  expansion,  you  get 
an  explosion!  Lavoisier  invented  a  balance  that  was  capable  of 
measuring  milligram  masses,  and  figured  out  how  to  do  reactions 
in  an  upside-down  bowl  in  a  basin  of  water,  so  that  the  gases 
could  expand  by  pushing  out  some  of  the  water.  In  a  crucial  ex¬ 
periment,  Lavoisier  heated  a  red  mercury  compound,  which  we 
would  now  describe  as  mercury  oxide  (FlgO),  in  such  a  sealed 
chamber.  A  gas  was  produced  (Lavoisier  later  named  it  “oxy¬ 
gen”),  driving  out  some  of  the  water,  and  the  red  compound  was 
transformed  into  silvery  liquid  mercury  metal.  The  crucial  point 
was  that  the  total  mass  of  the  entire  apparatus  was  exactly  the 
same  before  and  after  the  reaction.  Based  on  many  observations 
of  this  type,  Lavoisier  proposed  a  general  law  of  nature,  that  mass 
is  always  conserved.  (In  earlier  experiments,  in  which  closed  sys¬ 
tems  were  not  used,  chemists  had  become  convinced  that  there 
was  a  mysterious  substance,  phlogiston,  involved  in  combustion 
and  oxidation  reactions,  and  that  phlogiston’s  mass  could  be  pos¬ 
itive,  negative,  or  zero  depending  on  the  situation!) 

1.1.2  Delta  notation 

A  convenient  notation  used  throughout  physics  is  A,  the  up¬ 
percase  Greek  letter  delta,  which  indicates  “change  in”  or  “after 
minus  before.”  For  example,  if  b  represents  how  much  money  you 
have  in  the  bank,  then  a  deposit  of  $100  could  be  represented  as 
A b  =  $100.  That  is,  the  change  in  your  balance  was  $100,  or  the 
balance  after  the  transaction  minus  the  balance  before  the  transac¬ 
tion  equals  $100.  A  withdrawal  would  be  indicated  by  A b  <  0.  We 
represent  “before”  and  “after”  using  the  subscripts  i  (initial)  and 
/  (final),  e.g.,  A b  =  bf  —  bt.  Often  the  delta  notation  allows  more 
precision  than  English  words.  For  instance,  “time”  can  be  used  to 
mean  a  point  in  time  (“now’s  the  time”),  t,  or  it  could  mean  a  period 
of  time  (“the  whole  time,  he  had  spit  on  his  chin”),  At. 

This  notation  is  particularly  convenient  for  discussing  conserved 
quantities.  The  law  of  conservation  of  mass  can  be  stated  simply  as 
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a /The  two  pendulum  bobs 
are  constructed  with  equal  grav¬ 
itational  masses.  If  their  inertial 
masses  are  also  equal,  then  each 
pendulum  should  take  exactly  the 
same  amount  of  time  per  swing. 


b  /  If  the  cylinders  have  slightly 
unequal  ratios  of  inertial  to  grav¬ 
itational  mass,  their  trajectories 
will  be  a  little  different. 


M  si1 

c  /  A  simplified  drawing  of  an 
Eotvos-style  experiment.  If 
the  two  masses,  made  out  of 
two  different  substances,  have 
slightly  different  ratios  of  inertial 
to  gravitational  mass,  then  the 
apparatus  will  twist  slightly  as  the 
earth  spins. 


Am  =  0,  where  m  is  the  total  mass  of  any  closed  system. 

self-check  B 

If  x  represents  the  location  of  an  object  moving  in  one  dimension,  then 
how  would  positive  and  negative  signs  of  Ax  be  interpreted?  > 

Answer,  p.  961 

Discussion  Questions 

A  If  an  object  had  a  straight-line  x  -  t  graph  with  Ax  =  0  and  At  f  0, 
what  would  be  true  about  its  velocity?  What  would  this  look  like  on  a 
graph?  What  about  At  =  0  and  Ax  f  0? 

1 .2  Equivalence  of  Gravitational  and  Inertial  Mass 

We  find  experimentally  that  both  gravitational  and  inertial  mass 
are  conserved  to  a  high  degree  of  precision  for  a  great  number  of 
processes,  including  chemical  reactions,  melting,  boiling,  soaking  up 
water  with  a  sponge,  and  rotting  of  meat  and  vegetables.  Now  it’s 
logically  possible  that  both  gravitational  and  inertial  mass  are  con¬ 
served,  but  that  there  is  no  particular  relationship  between  them,  in 
which  case  we  would  say  that  they  are  separately  conserved.  On  the 
other  hand,  the  two  conservation  laws  may  be  redundant,  like  having 
one  law  against  murder  and  another  law  against  killing  people! 

Here’s  an  experiment  that  gets  at  the  issue:  stand  up  now  and 
drop  a  coin  and  one  of  your  shoes  side  by  side.  I  used  a  400-gram 
shoe  and  a  2-gram  penny,  and  they  hit  the  floor  at  the  same  time 
as  far  as  I  could  tell  by  eye.  This  is  an  interesting  result,  but  a 
physicist  and  an  ordinary  person  will  find  it  interesting  for  different 
reasons. 

The  layperson  is  surprised,  since  it  would  seem  logical  that 
heaver  objects  would  always  fall  faster  than  light  ones.  However, 
it’s  fairly  easy  to  prove  that  if  air  friction  is  negligible,  any  two  ob¬ 
jects  made  of  the  same  substance  must  have  identical  motion  when 
they  fall.  For  instance,  a  2-kg  copper  mass  must  exhibit  the  same 
falling  motion  as  a  1-kg  copper  mass,  because  nothing  would  be 
changed  by  physically  joining  together  two  1-kg  copper  masses  to 
make  a  single  2-kg  copper  mass.  Suppose,  for  example,  that  they 
are  joined  with  a  dab  of  glue;  the  glue  isn’t  under  any  strain,  be¬ 
cause  the  two  masses  are  doing  the  same  thing  side  by  side.  Since 
the  glue  isn’t  really  doing  anything,  it  makes  no  difference  whether 
the  masses  fall  separately  or  side  by  side.2 

What  a  physicist  finds  remarkable  about  the  shoe-and-penny  ex¬ 
periment  is  that  it  came  out  the  way  it  did  even  though  the  shoe 
and  the  penny  are  made  of  different  substances.  There  is  absolutely 
no  theoretical  reason  why  this  should  be  true.  We  could  say  that  it 

2 The  argument  only  fails  for  objects  light  enough  to  be  affected  appreciably 
by  air  friction:  a  bunch  of  feathers  falls  differently  if  you  wad  them  up  because 
the  pattern  of  air  flow  is  altered  by  putting  them  together. 
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happens  because  the  greater  gravitational  mass  of  the  shoe  is  exactly 
counteracted  by  its  greater  inertial  mass,  which  makes  it  harder  for 
gravity  to  get  it  moving,  but  that  just  leaves  us  wondering  why  in¬ 
ertial  mass  and  gravitational  mass  are  always  in  proportion  to  each 
other.  It’s  possible  that  they  are  only  approximately  equivalent. 
Most  of  the  mass  of  ordinary  matter  comes  from  neutrons  and  pro¬ 
tons,  and  we  could  imagine,  for  instance,  that  neutrons  and  protons 
do  not  have  exactly  the  same  ratio  of  gravitational  to  inertial  mass. 
This  would  show  up  as  a  different  ratio  of  gravitational  to  inertial 
mass  for  substances  containing  different  proportions  of  neutrons  and 
protons. 

Galileo  did  the  first  numerical  experiments  on  this  issue  in  the 
seventeenth  century  by  rolling  balls  down  inclined  planes,  although 
he  didn’t  think  about  his  results  in  these  terms.  A  fairly  easy  way  to 
improve  on  Galileo’s  accuracy  is  to  use  pendulums  with  bobs  made 
of  different  materials.  Suppose,  for  example,  that  we  construct  an 
aluminum  bob  and  a  brass  bob,  and  use  a  double-pan  balance  to 
verify  to  good  precision  that  their  gravitational  masses  are  equal.  If 
we  then  measure  the  time  required  for  each  pendulum  to  perform 
a  hundred  cycles,  we  can  check  whether  the  results  are  the  same. 
If  their  inertial  masses  are  unequal,  then  the  one  with  a  smaller 
inertial  mass  will  go  through  each  cycle  faster,  since  gravity  has 
an  easier  time  accelerating  and  decelerating  it.  With  this  type  of 
experiment,  one  can  easily  verify  that  gravitational  and  inertial  mass 
are  proportional  to  each  other  to  an  accuracy  of  10-3  or  10-4. 

In  1889,  the  Hungarian  physicist  Roland  Eotvos  used  a  slightly 
different  approach  to  verify  the  equivalence  of  gravitational  and  in¬ 
ertial  mass  for  various  substances  to  an  accuracy  of  about  10-8,  and 
the  best  such  experiment,  figure  d,  improved  on  even  this  phenome¬ 
nal  accuracy,  bringing  it  to  the  10-12  level.3  In  all  the  experiments 
described  so  far,  the  two  objects  move  along  similar  trajectories: 
straight  lines  in  the  penny-and-shoe  and  inclined  plane  experiments, 
and  circular  arcs  in  the  pendulum  version.  The  Eotvos-style  exper¬ 
iment  looks  for  differences  in  the  objects’  trajectories.  The  concept 
can  be  understood  by  imagining  the  following  simplified  version. 
Suppose,  as  in  figure  b,  we  roll  a  brass  cylinder  off  of  a  tabletop 
and  measure  where  it  hits  the  floor,  and  then  do  the  same  with  an 
aluminum  cylinder,  making  sure  that  both  of  them  go  over  the  edge 
with  precisely  the  same  velocity.  An  object  with  zero  gravitational 
mass  would  fly  off  straight  and  hit  the  wall,  while  an  object  with 
zero  inertial  mass  would  make  a  sudden  90-degree  turn  and  drop 
straight  to  the  floor.  If  the  aluminum  and  brass  cylinders  have  or¬ 
dinary,  but  slightly  unequal,  ratios  of  gravitational  to  inertial  mass, 
then  they  will  follow  trajectories  that  are  just  slightly  different.  In 
other  words,  if  inertial  and  gravitational  mass  are  not  exactly  pro¬ 
portional  to  each  other  for  all  substances,  then  objects  made  of 

3V.B.  Braginskii  and  V.I.  Panov,  Soviet  Physics  JETP  34,  463  (1972). 


c 


c 


d/A  more  realistic  drawing 
of  Braginskii  and  Panov’s  ex¬ 
periment.  The  whole  thing  was 
encased  in  a  tall  vacuum  tube, 
which  was  placed  in  a  sealed 
basement  whose  temperature 
was  controlled  to  within  0.02°C. 
The  total  mass  of  the  platinum 
and  aluminum  test  masses, 
plus  the  tungsten  wire  and  the 
balance  arms,  was  only  4.4  g. 
To  detect  tiny  motions,  a  laser 
beam  was  bounced  off  of  a  mirror 
attached  to  the  wire.  There  was 
so  little  friction  that  the  balance 
would  have  taken  on  the  order  of 
several  years  to  calm  down  com¬ 
pletely  after  being  put  in  place; 
to  stop  these  vibrations,  static 
electrical  forces  were  applied 
through  the  two  circular  plates  to 
provide  very  gentle  twists  on  the 
ellipsoidal  mass  between  them. 
After  Braginskii  and  Panov. 
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different  substances  will  have  different  trajectories  in  the  presence 
of  gravity. 

A  simplified  drawing  of  a  practical,  high-precision  experiment 
is  shown  in  figure  c.  Two  objects  made  of  different  substances  are 
balanced  on  the  ends  of  a  bar,  which  is  suspended  at  the  center  from 
a  thin  fiber.  The  whole  apparatus  moves  through  space  on  a  com¬ 
plicated,  looping  trajectory  arising  from  the  rotation  of  the  earth 
superimposed  on  the  earth’s  orbital  motion  around  the  sun.  Both 
the  earth’s  gravity  and  the  sun’s  gravity  act  on  the  two  objects.  If 
their  inertial  masses  are  not  exactly  in  proportion  to  their  gravi¬ 
tational  masses,  then  they  will  follow  slightly  different  trajectories 
through  space,  which  will  result  in  a  very  slight  twisting  of  the  fiber 
between  the  daytime,  when  the  sun’s  gravity  is  pulling  upward,  and 
the  night,  when  the  sun’s  gravity  is  downward.  Figure  d  shows  a 
more  realistic  picture  of  the  apparatus. 

This  type  of  experiment,  in  which  one  expects  a  null  result,  is 
a  tough  way  to  make  a  career  as  a  scientist.  If  your  measurement 
comes  out  as  expected,  but  with  better  accuracy  than  other  people 
had  previously  achieved,  your  result  is  publishable,  but  won’t  be 
considered  earthshattering.  On  the  other  hand,  if  you  build  the 
most  sensitive  experiment  ever,  and  the  result  comes  out  contrary 
to  expectations,  you’re  in  a  scary  situation.  You  could  be  right,  and 
earn  a  place  in  history,  but  if  the  result  turns  out  to  be  due  to  a 
defect  in  your  experiment,  then  you’ve  made  a  fool  of  yourself. 


a  /  Galileo  Galilei  (1 564-1 642). 


1.3  Galilean  Relativity 

I  defined  inertial  mass  conceptually  as  a  measure  of  how  hard  it 
is  to  change  an  object’s  state  of  motion,  the  implication  being  that  if 
you  don’t  interfere,  the  object’s  motion  won’t  change.  Most  people, 
however,  believe  that  objects  in  motion  have  a  natural  tendency  to 
slow  down.  Suppose  I  push  my  refrigerator  to  the  west  for  a  while  at 
0.1  m/s,  and  then  stop  pushing.  The  average  person  would  say  fridge 
just  naturally  stopped  moving,  but  let’s  imagine  how  someone  in 
China  would  describe  the  fridge  experiment  carried  out  in  my  house 
here  in  California.  Due  to  the  rotation  of  the  earth,  California  is 
moving  to  the  east  at  about  400  m/s.  A  point  in  China  at  the  same 
latitude  has  the  same  speed,  but  since  China  is  on  the  other  side 
of  the  planet,  China’s  east  is  my  west.  (If  you’re  finding  the  three- 
dimensional  visualization  difficult,  just  think  of  China  and  California 
as  two  freight  trains  that  go  past  each  other,  each  traveling  at  400 
m/s.)  If  I  insist  on  thinking  of  my  dirt  as  being  stationary,  then 
China  and  its  dirt  are  moving  at  800  m/s  to  my  west.  From  China’s 
point  of  view,  however,  it’s  California  that  is  moving  800  m/s  in 
the  opposite  direction  (my  east).  When  I’m  pushing  the  fridge  to 
the  west  at  0.1  m/s,  the  observer  in  China  describes  its  speed  as 
799.9  m/s.  Once  I  stop  pushing,  the  fridge  speeds  back  up  to  800 
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m/s.  From  my  point  of  view,  the  fridge  “naturally”  slowed  down 
when  I  stopped  pushing,  but  according  to  the  observer  in  China,  it 
“naturally”  sped  up! 

What’s  really  happening  here  is  that  there’s  a  tendency,  due 
to  friction,  for  the  fridge  to  stop  moving  relative  to  the  floor.  In 
general,  only  relative  motion  has  physical  significance  in  physics,  not 
absolute  motion.  It’s  not  even  possible  to  define  absolute  motion, 
since  there  is  no  special  reference  point  in  the  universe  that  everyone 
can  agree  is  at  rest.  Of  course  if  we  want  to  measure  motion,  we 
do  have  to  pick  some  arbitrary  reference  point  which  we  will  say 
is  standing  still,  and  we  can  then  define  x,  y,  and  2  coordinates 
extending  out  from  that  point,  which  we  can  define  as  having  x  =  0, 
y  =  0,  z  =  0.  Setting  up  such  a  system  is  known  as  choosing  a 
frame  of  reference.  The  local  dirt  is  a  natural  frame  of  reference  for 
describing  a  game  of  basketball,  but  if  the  game  was  taking  place  on 
the  deck  of  a  moving  ocean  liner,  we  would  probably  pick  a  frame  of 
reference  in  which  the  deck  was  at  rest,  and  the  land  was  moving. 

Galileo  was  the  first  scientist  to  reason  along  these  lines,  and 
we  now  use  the  term  Galilean  relativity  to  refer  to  a  somewhat 
modernized  version  of  his  principle.  Roughly  speaking,  the  principle 
of  Galilean  relativity  states  that  the  same  laws  of  physics  apply  in 
any  frame  of  reference  that  is  moving  in  a  straight  line  at  constant 
speed.  We  need  to  refine  this  statement,  however,  since  it  is  not 
necessarily  obvious  which  frames  of  reference  are  going  in  a  straight 
line  at  constant  speed.  A  person  in  a  pickup  truck  pulling  away 
from  a  stoplight  could  admit  that  the  car’s  velocity  is  changing, 
or  she  could  insist  that  the  truck  is  at  rest,  and  the  meter  on  the 
dashboard  is  going  up  because  the  asphalt  picked  that  moment  to 
start  moving  faster  and  faster  backward!  Frames  of  reference  are 
not  all  created  equal,  however,  and  the  accelerating  truck’s  frame 
of  reference  is  not  as  good  as  the  asphalt’s.  We  can  tell,  because 
a  bowling  ball  in  the  back  of  the  truck,  as  in  figure  c,  appears  to 
behave  strangely  in  the  driver’s  frame  of  reference:  in  her  rear¬ 
view  mirror,  she  sees  the  ball,  initially  at  rest,  start  moving  faster 
and  faster  toward  the  back  of  the  truck.  This  goofy  behavior  is 
evidence  that  there  is  something  wrong  with  her  frame  of  reference. 
A  person  on  the  sidewalk,  however,  sees  the  ball  as  standing  still.  In 
the  sidewalk’s  frame  of  reference,  the  truck  pulls  away  from  the  ball, 
and  this  makes  sense,  because  the  truck  is  burning  gas  and  using  up 
energy  to  change  its  state  of  motion. 

We  therefore  define  an  inertial  frame  of  reference  as  one  in  which 
we  never  see  objects  change  their  state  of  motion  without  any  appar¬ 
ent  reason.  The  sidewalk  is  a  pretty  good  inertial  frame,  and  a  car 
moving  relative  to  the  sidewalk  at  constant  speed  in  a  straight  line 
defines  a  pretty  good  inertial  frame,  but  a  car  that  is  accelerating 
or  turning  is  not  a  inertial  frame. 


b/The  earth  spins.  People 
in  Shanghai  say  they’re  at  rest 
and  people  in  Los  Angeles  are 
moving.  Angelenos  say  the  same 
about  the  Shanghainese. 
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c/Left:  In  a  frame  of  reference 
that  speeds  up  with  the  truck,  the 
bowling  ball  appears  to  change 
its  state  of  motion  for  no  reason. 
Right:  In  an  inertial  frame  of  ref¬ 
erence,  which  the  surface  of  the 
earth  approximately  is,  the  bowl¬ 
ing  ball  stands  still,  which  makes 
sense  because  there  is  nothing 
that  would  cause  it  to  change  its 
state  of  motion. 


The  principle  of  Galilean  relativity  states  that  inertial  frames 
exist,  and  that  the  same  laws  of  physics  apply  in  all  inertial  frames 
of  reference,  regardless  of  one  frame’s  straight-line,  constant-speed 
motion  relative  to  another. 

Another  way  of  putting  it  is  that  all  inertial  frames  are  created 
equal.  We  can  say  whether  one  inertial  frame  is  in  motion  or  at  rest 
relative  to  another,  but  there  is  no  privileged  “rest  frame.”  There 
is  no  experiment  that  comes  out  any  different  in  laboratories  in 
different  inertial  frames,  so  there  is  no  experiment  that  could  tell  us 
which  inertial  frame  is  really,  truly  at  rest. 

The  speed  of  sound  example  3 

>  The  speed  of  sound  in  air  is  only  340  m/s,  so  unless  you  live 
at  a  near-polar  latitude,  you’re  moving  at  greater  than  the  speed 
of  sound  right  now  due  to  the  Earth’s  rotation.  In  that  case,  why 
don’t  we  experience  exciting  phenomena  like  sonic  booms  all  the 
time?  >  It  might  seem  as  though  you’re  unprepared  to  deal  with 
this  question  right  now,  since  the  only  law  of  physics  you  know 
is  conservation  of  mass,  and  conservation  of  mass  doesn’t  tell 
you  anything  obviously  useful  about  the  speed  of  sound  or  sonic 
booms.  Galilean  relativity,  however,  is  a  blanket  statement  about 
all  the  laws  of  physics,  so  in  a  situation  like  this,  it  may  let  you 
predict  the  results  of  the  laws  of  physics  without  actually  knowing 
what  all  the  laws  are!  If  the  laws  of  physics  predict  a  certain  value 
for  the  speed  of  sound,  then  they  had  better  predict  the  speed 
of  the  sound  relative  to  the  air,  not  their  speed  relative  to  some 


4The  principle  of  Galilean  relativity  is  extended  on  page  190. 
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special  “rest  frame.”  Since  the  air  is  moving  along  with  the  rotation 
of  the  earth,  we  don’t  detect  any  special  phenomena.  To  get  a 
sonic  boom,  the  source  of  the  sound  would  have  to  be  moving 
relative  to  the  air. 

The  Foucault  pendulum  example  4 

Note  that  in  the  example  of  the  bowling  ball  in  the  truck,  I  didn’t 
claim  the  sidewalk  was  exactly  a  Galilean  frame  of  reference. 
This  is  because  the  sidewalk  is  moving  in  a  circle  due  to  the  rota¬ 
tion  of  the  Earth,  and  is  therefore  changing  the  direction  of  its  mo¬ 
tion  continuously  on  a  24-hour  cycle.  However,  the  curve  of  the 
motion  is  so  gentle  that  under  ordinary  conditions  we  don’t  notice 
that  the  local  dirt’s  frame  of  reference  isn’t  quite  inertial.  The  first 
demonstration  of  the  noninertial  nature  of  the  earth-fixed  frame  of 
reference  was  by  Foucault  using  a  very  massive  pendulum  (fig¬ 
ure  d)  whose  oscillations  would  persist  for  many  hours  without  be¬ 
coming  imperceptible.  Although  Foucault  did  his  demonstration  in 
Paris,  it’s  easier  to  imagine  what  would  happen  at  the  north  pole: 
the  pendulum  would  keep  swinging  in  the  same  plane,  but  the 
earth  would  spin  underneath  it  once  every  24  hours.  To  someone 
standing  in  the  snow,  it  would  appear  that  the  pendulum’s  plane 
of  motion  was  twisting.  The  effect  at  latitudes  less  than  90  de¬ 
grees  turns  out  to  be  slower,  but  otherwise  similar.  The  Foucault 
pendulum  was  the  first  definitive  experimental  proof  that  the  earth 
really  did  spin  on  its  axis,  although  scientists  had  been  convinced 
of  its  rotation  for  a  century  based  on  more  indirect  evidence  about 
the  structure  of  the  solar  system. 

Although  popular  belief  has  Galileo  being  prosecuted  by  the 
Catholic  Church  for  saying  the  earth  rotated  on  its  axis  and  also  or¬ 
bited  the  sun,  Foucault’s  pendulum  was  still  centuries  in  the  future, 
so  Galileo  had  no  hard  proof;  Galileo’s  insights  into  relative  versus 
absolute  motion  simply  made  it  more  plausible  that  the  world  could 
be  spinning  without  producing  dramatic  effects,  but  didn’t  disprove 
the  contrary  hypothesis  that  the  sun,  moon,  and  stars  went  around 
the  earth  every  24  hours.  Furthermore,  the  Church  was  much  more 
liberal  and  enlightened  than  most  people  believe.  It  didn’t  (and  still 
doesn’t)  require  a  literal  interpretation  of  the  Bible,  and  one  of  the 
Church  officials  involved  in  the  Galileo  affair  wrote  that  “the  Bible 
tells  us  how  to  go  to  heaven,  not  how  the  heavens  go.”  In  other 
words,  religion  and  science  should  be  separate.  The  actual  reason 
Galileo  got  in  trouble  is  shrouded  in  mystery,  since  Italy  in  the  age  of 
the  Medicis  was  a  secretive  place  where  unscrupulous  people  might 
settle  a  score  with  poison  or  a  false  accusation  of  heresy.  What  is  cer¬ 
tain  is  that  Galileo’s  satirical  style  of  scientific  writing  made  many 
enemies  among  the  powerful  Jesuit  scholars  who  were  his  intellec¬ 
tual  opponents  —  he  compared  one  to  a  snake  that  doesn’t  know 
its  own  back  is  broken.  It’s  also  possible  that  the  Church  was  far 
less  upset  by  his  astronomical  work  than  by  his  support  for  atomism 


Section  1.3 


d  /  Foucault  demonstrates 

his  pendulum  to  an  audience  at  a 
lecture  in  1851. 


e  /  Galileo’s  trial. 


Galilean  Relativity 
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f  /  Discussion  question  A. 


g  /  Discussion  question  B. 


h  /  Discussion  question  D. 


(discussed  further  in  the  next  section).  Some  theologians  perceived 
atomism  as  contradicting  transubstantiation,  the  Church’s  doctrine 
that  the  holy  bread  and  wine  were  literally  transformed  into  the 
flesh  and  blood  of  Christ  by  the  priest’s  blessing. 

self-check  C 

What  is  incorrect  about  the  following  supposed  counterexamples  to  the 
principle  of  inertia? 

(1 )  When  astronauts  blast  off  in  a  rocket,  their  huge  velocity  does  cause 
a  physical  effect  on  their  bodies  —  they  get  pressed  back  into  their 
seats,  the  flesh  on  their  faces  gets  distorted,  and  they  have  a  hard  time 
lifting  their  arms. 

(2)  When  you’re  driving  in  a  convertible  with  the  top  down,  the  wind  in 
your  face  is  an  observable  physical  effect  of  your  absolute  motion.  > 
Answer,  p.  961 

>  Solved  problem:  a  bug  on  a  wheel  page  71,  problem  12 

Discussion  Questions 

A  A  passenger  on  a  cruise  ship  finds,  while  the  ship  is  docked,  that 
he  can  leap  off  of  the  upper  deck  and  just  barely  make  it  into  the  pool 
on  the  lower  deck.  If  the  ship  leaves  dock  and  is  cruising  rapidly,  will  this 
adrenaline  junkie  still  be  able  to  make  it? 

B  You  are  a  passenger  in  the  open  basket  hanging  under  a  helium 
balloon.  The  balloon  is  being  carried  along  by  the  wind  at  a  constant 
velocity.  If  you  are  holding  a  flag  in  your  hand,  will  the  flag  wave?  If  so, 
which  way?  [Based  on  a  question  from  PSSC  Physics.] 

C  Aristotle  stated  that  all  objects  naturally  wanted  to  come  to  rest,  with 
the  unspoken  implication  that  “rest”  would  be  interpreted  relative  to  the 
surface  of  the  earth.  Suppose  we  could  transport  Aristotle  to  the  moon, 
put  him  in  a  space  suit,  and  kick  him  out  the  door  of  the  spaceship  and  into 
the  lunar  landscape.  What  would  he  expect  his  fate  to  be  in  this  situation? 
If  intelligent  creatures  inhabited  the  moon,  and  one  of  them  independently 
came  up  with  the  equivalent  of  Aristotelian  physics,  what  would  they  think 
about  objects  coming  to  rest? 

D  Sally  is  on  an  amusement  park  ride  which  begins  with  her  chair  being 
hoisted  straight  up  a  tower  at  a  constant  speed  of  60  miles/hour.  Despite 
stern  warnings  from  her  father  that  he’ll  take  her  home  the  next  time  she 
misbehaves,  she  decides  that  as  a  scientific  experiment  she  really  needs 
to  release  her  corndog  over  the  side  as  she’s  on  the  way  up.  She  does 
not  throw  it.  She  simply  sticks  it  out  of  the  car,  lets  it  go,  and  watches  it 
against  the  background  of  the  sky,  with  no  trees  or  buildings  as  reference 
points.  What  does  the  corndog’s  motion  look  like  as  observed  by  Sally? 
Does  its  speed  ever  appear  to  her  to  be  zero?  What  acceleration  does 
she  observe  it  to  have:  is  it  ever  positive?  negative?  zero?  What  would 
her  enraged  father  answer  if  asked  for  a  similar  description  of  its  motion 
as  it  appears  to  him,  standing  on  the  ground? 

1.3.1  Applications  of  calculus 

Let’s  see  how  this  relates  to  calculus.  If  an  object  is  moving  in 
one  dimension,  we  can  describe  its  position  with  a  function  x(t).  The 
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derivative  v  =  dxj  dt  is  called  the  velocity,  and  the  second  derivative 
a  =  dv/  dt  =  d2x/  dt2  is  the  acceleration.  Galilean  relativity  tells  us 
that  there  is  no  detectable  effect  due  to  an  object’s  absolute  velocity, 
since  in  some  other  frame  of  reference,  the  object’s  velocity  might 
be  zero.  However,  an  acceleration  does  have  physical  consequences. 


i  /  This  Air  Force  doctor  volun¬ 
teered  to  ride  a  rocket  sled  as  a 
medical  experiment.  The  obvious 
effects  on  his  head  and  face  are 
not  because  of  the  sled’s  speed 
but  because  of  its  rapid  changes 
in  speed:  increasing  in  (ii)  and 
(iii),  and  decreasing  in  (v)  and  (vi). 
In  (iv)  his  speed  is  greatest,  but 
because  his  speed  is  not  increas¬ 
ing  or  decreasing  very  much  at 
this  moment,  there  is  little  effect 
on  him.  (U.S.  Air  Force) 


Observers  in  different  inertial  frames  of  reference  will  disagree 
on  velocities,  but  agree  on  accelerations.  Let’s  keep  it  simple  by 
continuing  to  work  in  one  dimension.  One  frame  of  reference  uses  a 
coordinate  system  x\ ,  and  the  other  we  label  x^-  If  the  positive  x\ 
and  X2  axes  point  in  the  same  direction,  then  in  general  two  inertial 
frames  could  be  related  by  an  equation  of  the  form  X2  =  x\  +  b  +  ut, 
where  u  is  the  constant  velocity  of  one  frame  relative  to  the  other, 
and  the  constant  b  tells  us  how  far  apart  the  origins  of  the  two 
coordinate  systems  were  at  t  =  0.  The  velocities  are  different  in  the 
two  frames  of  reference: 


dX2 

dt 


dx’i 

=  lF  +  '“’ 


Suppose,  for  example,  frame  1  is  defined  from  the  sidewalk,  and 
frame  2  is  fixed  to  a  float  in  a  parade  that  is  moving  to  our  left  at 
a  velocity  u  =  1  m/s.  A  dog  that  is  moving  to  the  right  with  a 
velocity  v\  =  dx\/  dt  =  3  m/s  in  the  sidewalk’s  frame  will  appear 
to  be  moving  at  a  velocity  of  V2  =  dx,2/  dt  =  dx  \  /  dt  +  u  =  4  m/s  in 
the  float’s  frame. 


For  acceleration,  however,  we  have 


<#  X2  _  <f  X\ 

dt2  =  dt2  ’ 


Section  1.3  Galilean  Relativity 
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Self-check  D. 


since  the  derivative  of  the  constant  u  is  zero.  Thus  an  accelera¬ 
tion,  unlike  a  velocity,  can  have  a  definite  physical  significance  to  all 
observers  in  all  frames  of  reference.  If  this  wasn’t  true,  then  there 
would  be  no  particular  reason  to  define  a  quantity  called  acceleration 
in  the  first  place. 

self-check  D 

The  figure  shows  a  bottle  of  beer  sitting  on  a  table  in  the  dining  car  of  a 
train.  Does  the  tilting  of  the  surface  tell  us  about  the  train’s  velocity,  or 
its  acceleration?  What  would  a  person  in  the  train  say  about  the  bottle’s 
velocity?  What  about  a  person  standing  in  a  field  outside  and  looking  in 
through  the  window?  What  about  the  acceleration?  >  Answer,  p.  961 

1.4  A  Preview  of  Some  Modern  Physics 

“Mommy,  why  do  you  and  Daddy  have  to  go  to  work?”  “To  make 
money,  sweetie-pie.”  “Why  do  we  need  money?”  “To  buy  food.” 
“Why  does  food  cost  money?”  When  small  children  ask  a  chain 
of  “why”  questions  like  this,  it  usually  isn’t  too  long  before  their 
parents  end  up  saying  something  like,  “Because  that’s  just  the  way 
it  is,”  or,  more  honestly,  “I  don’t  know  the  answer.” 

The  same  happens  in  physics.  We  may  gradually  learn  to  ex¬ 
plain  things  more  and  more  deeply,  but  there’s  always  the  possibility 
that  a  certain  observed  fact,  such  as  conservation  of  mass,  will  never 
be  understood  on  any  deeper  level.  Science,  after  all,  uses  limited 
methods  to  achieve  limited  goals,  so  the  ultimate  reason  for  all  exis¬ 
tence  will  always  be  the  province  of  religion.  There  is,  however,  an 
appealing  explanation  for  conservation  of  mass,  which  is  atomism, 
the  theory  that  matter  is  made  of  tiny,  unchanging  particles.  The 
atomic  hypothesis  dates  back  to  ancient  Greece,  but  the  first  solid 
evidence  to  support  it  didn’t  come  until  around  the  eighteenth  cen¬ 
tury,  and  individual  atoms  were  never  detected  until  about  1900. 
The  atomic  theory  implies  not  only  conservation  of  mass,  but  a 
couple  of  other  things  as  well. 

First,  it  implies  that  the  total  mass  of  one  particular  element 
is  conserved.  For  instance,  lead  and  gold  are  both  elements,  and  if 
we  assume  that  lead  atoms  can’t  be  turned  into  gold  atoms,  then 
the  total  mass  of  lead  and  the  total  mass  of  gold  are  separately 
conserved.  It’s  as  though  there  was  not  just  a  law  against  pickpock¬ 
eting,  but  also  a  law  against  surreptitiously  moving  money  from 
one  of  the  victim’s  pockets  to  the  other.  It  turns  out,  however,  that 
although  chemical  reactions  never  change  one  type  of  atom  into  an¬ 
other,  transmutation  can  happen  in  nuclear  reactions,  such  as  the 
ones  that  created  most  of  the  elements  in  your  body  out  of  the  pri¬ 
mordial  hydrogen  and  helium  that  condensed  out  of  the  aftermath 
of  the  Big  Bang. 

Second,  atomism  implies  that  mass  is  quantized ,  meaning  that 
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only  certain  values  of  mass  are  possible  and  the  ones  in  between 
can’t  exist.  We  can  have  three  atoms  of  gold  or  four  atoms  of  gold, 
but  not  three  an  a  half.  Although  quantization  of  mass  is  a  natural 
consequence  of  any  theory  in  which  matter  is  made  up  of  tiny  parti¬ 
cles,  it  was  discovered  in  the  twentieth  century  that  other  quantities, 
such  as  energy,  are  quantized  as  well,  which  had  previously  not  been 
suspected. 

self-check  E 

Is  money  quantized?  >  Answer,  p.  961 

If  atomism  is  starting  to  make  conservation  of  mass  seem  in¬ 
evitable  to  you,  then  it  may  disturb  you  to  know  that  Einstein 
discovered  it  isn’t  really  conserved.  If  you  put  a  50-gram  iron  nail 
in  some  water,  seal  the  whole  thing  up,  and  let  it  sit  on  a  fantasti¬ 
cally  precise  balance  while  the  nail  rusts,  you’ll  find  that  the  system 
loses  about  6  x  10-12  kg  of  mass  by  the  time  the  nail  has  turned 
completely  to  rust.  This  has  to  do  with  Einstein’s  famous  equa¬ 
tion  E  =  me2.  Rusting  releases  heat  energy,  which  then  escapes 
out  into  the  room.  Einstein’s  equation  states  that  this  amount  of 
heat,  E,  is  equivalent  to  a  certain  amount  of  mass,  m.  The  c  in 
the  c2  is  the  speed  of  light,  which  is  a  large  number,  and  a  large 
amount  of  energy  is  therefore  equivalent  to  a  very  small  amount  of 
mass,  so  you  don’t  notice  nonconservation  of  mass  under  ordinary 
conditions.  What  is  really  conserved  is  not  the  mass,  m,  but  the 
mass-plus-energy,  E  +  me2.  The  point  of  this  discussion  is  not  to 
get  you  to  do  numerical  exercises  with  E  =  me2  (at  this  point  you 
don’t  even  know  what  units  are  used  to  measure  energy),  but  simply 
to  point  out  to  you  the  empirical  nature  of  the  laws  of  physics.  If  a 
previously  accepted  theory  is  contradicted  by  an  experiment,  then 
the  theory  needs  to  be  changed.  This  is  also  a  good  example  of 
something  called  the  correspondence  principle ,  which  is  a  historical 
observation  about  how  scientific  theories  change:  when  a  new  scien¬ 
tific  theory  replaces  an  old  one,  the  old  theory  is  always  contained 
within  the  new  one  as  an  approximation  that  works  within  a  certain 
restricted  range  of  situations.  Conservation  of  mass  is  an  extremely 
good  approximation  for  all  chemical  reactions,  since  chemical  reac¬ 
tions  never  release  or  consume  enough  energy  to  change  the  total 
mass  by  a  large  percentage.  Conservation  of  mass  would  not  have 
been  accepted  for  110  years  as  a  fundamental  principle  of  physics 
if  it  hadn’t  been  verified  over  and  over  again  by  a  huge  number  of 
accurate  experiments. 

This  chapter  is  summarized  on  page  989.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 
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The  symbols  \/,  ^ ,  etc.  are  explained  on  page  72. 

1  Thermometers  normally  use  either  mercury  or  alcohol  as  their 
working  fluid.  If  the  level  of  the  fluid  rises  or  falls,  does  this  violate 
conservation  of  mass? 

2  The  ratios  of  the  masses  of  different  types  of  atoms  were 

determined  a  century  before  anyone  knew  any  actual  atomic  masses 
in  units  of  kg.  One  finds,  for  example,  that  when  ordinary  table 
salt,  NaCl,  is  melted,  the  chlorine  atoms  bubble  off  as  a  gas,  leaving 
liquid  sodium  metal.  Suppose  the  chlorine  escapes,  so  that  its  mass 
cannot  be  directly  determined  by  weighing.  Experiments  show  that 
when  1.00000  kg  of  NaCl  is  treated  in  this  way,  the  mass  of  the 
remaining  sodium  metal  is  0.39337  kg.  Based  on  this  information, 
determine  the  ratio  of  the  mass  of  a  chlorine  atom  to  that  of  a 
sodium  atom.  v  ■ 

3  An  atom  of  the  most  common  naturally  occurring  uranium 
isotope  breaks  up  spontaneously  into  a  thorium  atom  plus  a  helium 
atom.  The  masses  are  as  follows: 

uranium  3.95292849  x  1CT25  kg 
thorium  3.88638748  x  10-25  kg 
helium  6.646481  x  10~2'  kg 

Each  of  these  experimentally  determined  masses  is  uncertain  in  its 
last  decimal  place.  Is  mass  conserved  in  this  process  to  within  the 
accuracy  of  the  experimental  data?  How  would  you  interpret  this? 


Problem  6. 


4  If  two  spherical  water  droplets  of  radius  b  combine  to  make  a 
single  droplet,  what  is  its  radius?  (Assume  that  water  has  constant 
density.) 

5  Make  up  an  experiment  that  would  test  whether  mass  is  con¬ 
served  in  an  animal’s  metabolic  processes. 

6  The  figure  shows  a  hydraulic  jack.  What  is  the  relationship 
between  the  distance  traveled  by  the  plunger  and  the  distance  trav¬ 
eled  by  the  object  being  lifted,  in  terms  of  the  cross-sectional  areas? 

■ 

7  In  an  example  in  this  chapter,  I  argued  that  a  stream  of  water 

must  change  its  cross-sectional  area  as  it  rises  or  falls.  Suppose  that 
the  stream  of  water  is  confined  to  a  constant-diameter  pipe.  Which 
assumption  breaks  down  in  this  situation?  - 

8  A  river  with  a  certain  width  and  depth  splits  into  two  parts, 
each  of  which  has  the  same  width  and  depth  as  the  original  river. 
What  can  you  say  about  the  speed  of  the  current  after  the  split? 

■ 

9  The  diagram  shows  a  cross-section  of  a  wind  tunnel  of  the 
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kind  used,  for  example,  to  test  designs  of  airplanes.  Under  normal 
conditions  of  use,  the  density  of  the  air  remains  nearly  constant 
throughout  the  whole  wind  tunnel.  How  can  the  speed  of  the  air 
be  controlled  and  calculated?  (Diagram  by  NASA,  Glenn  Research 
Center.) 


View  from  above: 


Drive  Motor 


Test  Section  Model 


10  A  water  wave  is  in  a  tank  that  extends  horizontally  from 
x  =  0  to  x  =  a,  and  from  z  =  0  to  2  =  b.  We  assume  for  simplicity 
that  at  a  certain  moment  in  time  the  height  y  of  the  water’s  surface 
only  depends  on  x,  not  z,  so  that  we  can  effectively  ignore  the  z 
coordinate.  Under  these  assumptions,  the  total  volume  of  the  water 
in  the  tank  is 

r 

V  =  b  I  y(x)  dx. 

Jo 

Since  the  density  of  the  water  is  essentially  constant,  conservation 
of  mass  requires  that  V  is  always  the  same.  When  the  water  is  calm, 
we  have  y  =  h,  where  h  =  V/ab.  If  two  different  wave  patterns  move 
into  each  other,  we  might  imagine  that  they  would  add  in  the  sense 
that  y  total  —  h  =  (yi~h)  +  (2/2  —  h).  Show  that  this  type  of  addition 
is  consistent  with  conservation  of  mass.  ■ 

11  The  figure  shows  the  position  of  a  falling  ball  at  equal  time 

intervals,  depicted  in  a  certain  frame  of  reference.  On  a  similar  grid, 
show  how  the  ball’s  motion  would  appear  in  a  frame  of  reference  that 
was  moving  horizontally  at  a  speed  of  one  box  per  unit  time  relative 
to  the  first  frame.  ^ 

12  The  figure  shows  the  motion  of  a  point  on  the  rim  of  a  rolling 
wheel.  (The  shape  is  called  a  cycloid.)  Suppose  bug  A  is  riding  on 
the  rim  of  the  wheel  on  a  bicycle  that  is  rolling,  while  bug  B  is  on 
the  spinning  wheel  of  a  bike  that  is  sitting  upside  down  on  the  floor. 
Bug  A  is  moving  along  a  cycloid,  while  bug  B  is  moving  in  a  circle. 
Both  wheels  are  doing  the  same  number  of  revolutions  per  minute. 
Which  bug  has  a  harder  time  holding  on,  or  do  they  find  it  equally 


Problem  1 1 . 


Problem  12. 


Problems 
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difficult? 


>  Solution,  p.  972  - 


Key  to  symbols: 

easy  -  typical  ^  challenging  ■  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  wwwdightandmatter.com. 
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Conservation  of  Energy 


Do  you  pronounce  it  Joule’s  to  rhyme  with  schools, 

Joule’s  to  rhyme  with  Bowls, 
or  Joule’s  to  rhyme  with  Scowls? 

Whatever  you  call  it,  by  Joule’s, 
or  Joule’s, 

or  Joule’s,  it’s  good! 

Advertising  slogan  of  the  Joule  brewery.  The  name,  and  the 
corresponding  unit  of  energy,  are  now  usually  pronounced  so 
as  to  rhyme  with  “school.  ” 

2.1  Energy 

2.1 .1  The  energy  concept 

You’d  probably  like  to  be  able  to  drive  your  car  and  light  your 
apartment  without  having  to  pay  money  for  gas  and  electricity,  and 
if  you  do  a  little  websurfing,  you  can  easily  find  people  who  say 
they  have  the  solution  to  your  problem.  This  kind  of  scam  has  been 
around  for  centuries.  It  used  to  be  known  as  a  perpetual  motion 
machine,  but  nowadays  the  con  artists’  preferred  phrase  is  “free 
energy.”1  A  typical  “free-energy”  machine  would  be  a  sealed  box 
that  heats  your  house  without  needing  to  be  plugged  into  a  wall 
socket  or  a  gas  pipe.  Heat  comes  out,  but  nothing  goes  in,  and  this 
can  go  on  indefinitely.  But  an  interesting  thing  happens  if  you  try 
to  check  on  the  advertised  performance  of  the  machine.  Typically, 
you’ll  find  out  that  either  the  device  is  still  in  development,  or  it’s 
back-ordered  because  so  many  people  have  already  taken  advantage 
of  this  Fantastic  Opportunity!  In  a  few  cases,  the  magic  box  exists, 
but  the  inventor  is  only  willing  to  demonstrate  very  small  levels  of 
heat  output  for  short  periods  of  time,  in  which  case  there’s  probably 
a  tiny  hearing-aid  battery  hidden  in  there  somewhere,  or  some  other 
trick. 

Since  nobody  has  ever  succeeded  in  building  a  device  that  creates 
heat  out  of  nothing,  we  might  also  wonder  whether  any  device  exists 


a  /James  Joule,  1818-1889. 
The  son  of  a  wealthy  brewer, 
Joule  was  tutored  as  a  young 
man  by  the  famous  scientist  John 
Dalton.  Fascinated  by  electricity, 
he  and  his  brother  experimented 
by  giving  electric  shocks  to  each 
other  and  to  the  family’s  servants. 
Joule  ran  the  brewery  as  an  adult, 
and  science  was  merely  a  serious 
hobby.  His  work  on  energy  can 
be  traced  to  his  attempt  to  build 
an  electric  motor  that  would 
replace  steam  engines.  His  ideas 
were  not  accepted  at  first,  partly 
because  they  contradicted  the 
widespread  belief  that  heat  was 
a  fluid,  and  partly  because  they 
depended  on  extremely  precise 
measurements,  which  had  not 
previously  been  common  in 
physics. 


1An  entertaining  account  of  this  form  of  quackery  is  given  in  Voodoo  Sci¬ 
ence:  The  Road  from  Foolishness  to  Fraud,  Robert  Park,  Oxford  Univer¬ 
sity  Press,  2000.  Until  reading  this  book,  I  hadn’t  realized  the  degree  to  which 
pseudoscience  had  penetrated  otherwise  respectable  scientific  organizations  like 
NASA. 
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b  /  Heat  energy  can  be  con¬ 
verted  to  light  energy.  Very  hot 
objects  glow  visibly,  and  even 
objects  that  aren’t  so  hot  give 
off  infrared  light,  a  color  of  light 
that  lies  beyond  the  red  end  of 
the  visible  rainbow.  This  photo 
was  made  with  a  special  camera 
that  records  infrared  light.  The 
man’s  warm  skin  emits  quite  a 
bit  of  infrared  light  energy,  while 
his  hair,  at  a  lower  temperature, 
emits  less. 


that  can  do  the  opposite,  turning  heat  into  nothing.  You  might  think 
that  a  refrigerator  was  such  a  device,  but  actually  your  refrigerator 
doesn’t  destroy  the  heat  in  the  food.  What  it  really  does  is  to  extract 
some  of  the  heat  and  bring  it  out  into  the  room.  That’s  why  it  has 
big  radiator  coils  on  the  back,  which  get  hot  when  it’s  in  operation. 

If  it’s  not  possible  to  destroy  or  create  heat  outright,  then  you 
might  start  to  suspect  that  heat  was  a  conserved  quantity.  This 
would  be  a  successful  rule  for  explaining  certain  processes,  such  as 
the  transfer  of  heat  between  a  cold  Martini  and  a  room-temperature 
olive:  if  the  olive  loses  a  little  heat,  then  the  drink  must  gain  the 
same  amount.  It  would  fail  in  general,  however.  Sunlight  can  heat 
your  skin,  for  example,  and  a  hot  lightbulb  filament  can  cool  off  by 
emitting  light.  Based  on  these  observations,  we  could  revise  our  pro¬ 
posed  conservation  law,  and  say  that  there  is  something  called  heat- 
pluslight,  which  is  conserved.  Even  this,  however,  needs  to  be  gen¬ 
eralized  in  order  to  explain  why  you  can  get  a  painful  burn  playing 
baseball  when  you  slide  into  a  base.  Now  we  could  call  it  heatplus- 
lightplusmotion.  The  word  is  getting  pretty  long,  and  we  haven’t 
even  finished  the  list. 

Rather  than  making  the  word  longer  and  longer,  physicists  have 
hijacked  the  word  “energy”  from  ordinary  usage,  and  give  it  a  new, 
specific  technical  meaning.  Just  as  the  Parisian  platinum-iridium 
kilogram  defines  a  specific  unit  of  mass,  we  need  to  pick  something 
that  defines  a  definite  unit  of  energy.  The  metric  unit  of  energy  is 
the  joule  (J),  and  we’ll  define  it  as  the  amount  of  energy  required 
to  heat  0.24  grams  of  water  from  20  to  21  degrees  Celsius.  (Don’t 
memorize  the  numbers.)2 

Temperature  of  a  mixture  example  1 

>  If  1 .0  kg  of  water  at  20°C  is  mixed  with  4.0  kg  of  water  at  30°C, 
what  is  the  temperature  of  the  mixture? 

>  Let’s  assume  as  an  approximation  that  each  degree  of  tem¬ 
perature  change  corresponds  to  the  same  amount  of  energy.  In 
other  words,  we  assume  AE=mcAT,  regardless  of  whether,  as 
in  the  definition  of  the  joule,  we  have  AT  =  21°C-20°C  or,  as  in 
the  present  example,  some  other  combination  of  initial  and  final 
temperatures.  To  be  consistent  with  the  definition  of  the  joule,  we 
must  have  c  =  (1  J)/(0.24  g)/(1°C)  =  4.2x103  J/kg-°C,  which  is 
referred  to  as  the  specific  heat  of  water. 


2Although  the  definition  refers  to  the  Celsius  scale  of  temperature,  it’s  not 
necessary  to  give  an  operational  definition  of  the  temperature  concept  in  general 
(which  turns  out  to  be  quite  a  tricky  thing  to  do  completely  rigorously);  we 
only  need  to  establish  two  specific  temperatures  that  can  be  reproduced  on  ther¬ 
mometers  that  have  been  calibrated  in  a  standard  way.  Heat  and  temperature 
are  discussed  in  more  detail  in  section  2.4,  and  in  chapter  5.  Conceptually,  heat 
is  a  measure  of  energy,  whereas  temperature  relates  to  how  concentrated  that 
energy  is. 
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Conservation  of  energy  tells  us  A E  =  0,  so 
m-i  cATj  +  m2cAT2  =  0 


or 

A  7j  m2 

AT2  m-\ 

=  -4.0 

If  h  has  to  change  four  times  as  much  as  T2,  and  the  two  fi¬ 
nal  temperatures  are  equal,  then  the  final  temperature  must  be 
28°C. 

Note  how  only  differences  in  temperature  and  energy  appeared 
in  the  preceding  example.  In  other  words,  we  don’t  have  to  make 
any  assumptions  about  whether  there  is  a  temperature  at  which  all 
an  object’s  heat  energy  is  removed.  Historically,  the  energy  and 
temperature  units  were  invented  before  it  was  shown  that  there  is 
such  a  temperature,  called  absolute  zero.  There  is  a  scale  of  tem¬ 
perature,  the  Kelvin  scale,  in  which  the  unit  of  temperature  is  the 
same  as  the  Celsius  degree,  but  the  zero  point  is  defined  as  absolute 
zero.  But  as  long  as  we  only  deal  with  temperature  differences,  it 
doesn’t  matter  whether  we  use  Kelvin  or  Celsius.  Likewise,  as  long 
as  we  deal  with  differences  in  heat  energy,  we  don’t  normally  have 
to  worry  about  the  total  amount  of  heat  energy  the  object  has.  In 
standard  physics  terminology,  “heat”  is  used  only  to  refer  to  differ¬ 
ences,  while  the  total  amount  is  called  the  object’s  “thermal  energy.” 
This  distinction  is  often  ignored  by  scientists  in  casual  speech,  and 
in  this  book  I’ll  usually  use  “heat”  for  either  quantity. 

We’re  defining  energy  by  adding  up  things  from  a  list,  which  we 
lengthen  as  needed:  heat,  light,  motion,  etc.  One  objection  to  this 
approach  is  aesthetic:  physicists  tend  to  regard  complication  as  a 
synonym  for  ugliness.  If  we  have  to  keep  on  adding  more  and  more 
forms  of  energy  to  our  laundry  list,  then  it’s  starting  to  sound  like 
energy  is  distressingly  complicated.  Luckily  it  turns  out  that  energy 
is  simpler  than  it  seems.  Many  forms  of  energy  that  are  apparently 
unrelated  turn  out  to  be  manifestations  of  a  small  number  of  forms 
at  the  atomic  level,  and  this  is  the  topic  of  section  2.4. 

Discussion  Questions 

A  The  ancient  Greek  philosopher  Aristotle  said  that  objects  “naturally” 
tended  to  slow  down,  unless  there  was  something  pushing  on  them  to 
keep  them  moving.  What  important  insight  was  he  missing? 

2.1.2  Logical  issues 

Another  possible  objection  is  that  the  open-ended  approach  to 
defining  energy  might  seem  like  a  kind  of  cheat,  since  we  keep  on 
inventing  new  forms  whenever  we  need  them.  If  a  certain  experi¬ 
ment  seems  to  violate  conservation  of  energy,  can’t  we  just  invent 
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c/As  in  figure  b,  an  infrared 
camera  distinguishes  hot  and 
cold  areas.  As  the  bike  skids  to 
a  stop  with  its  brakes  locked,  the 
kinetic  energy  of  the  bike  and 
rider  is  converted  into  heat  in 
both  the  floor  (top)  and  the  tire 
(bottom). 


a  new  form  of  invisible  “mystery  energy”  that  patches  things  up? 
This  would  be  like  balancing  your  checkbook  by  putting  in  a  fake 
transaction  that  makes  your  calculation  of  the  balance  agree  with 
your  bank’s.  If  we  could  fudge  this  way,  then  conservation  of  energy 
would  be  untestable  —  impossible  to  prove  or  disprove. 

Actually  all  scientific  theories  are  improvable.  A  theory  can 
never  be  proved,  because  the  experiments  can  only  cover  a  finite 
number  out  of  the  infinitely  many  situations  in  which  the  theory  is 
supposed  to  apply.  Even  a  million  experiments  won’t  suffice  to  prove 
it  in  the  same  sense  of  the  word  “proof”  that  is  used  in  mathematics. 
However,  even  one  experiment  that  contradicts  a  theory  is  sufficient 
to  show  that  the  theory  is  wrong.  A  theory  that  is  immune  to 
disproof  is  a  bad  theory,  because  there  is  no  way  to  test  it.  For 
instance,  if  I  say  that  23  is  the  maximum  number  of  angels  that 
can  dance  on  the  head  of  a  pin,  I  haven’t  made  a  properly  falsifiable 
scientific  theory,  since  there’s  no  method  by  which  anyone  could  even 
attempt  to  prove  me  wrong  based  on  observations  or  experiments. 

Conservation  of  energy  is  testable  because  new  forms  of  energy 
are  expected  to  show  regular  mathematical  behavior,  and  are  sup¬ 
posed  to  be  related  in  a  measurable  way  to  observable  phenomena. 
As  an  example,  let’s  see  how  to  extend  the  energy  concept  to  include 
motion. 

2.1.3  Kinetic  energy 

Energy  of  motion  is  called  kinetic  energy.  (The  root  of  the  word 
is  the  same  as  the  word  “cinema”  -  in  French,  kinetic  energy  is 
“energie  cinetique.”)  How  does  an  object’s  kinetic  energy  depend 
on  its  mass  and  velocity?  Joule  attempted  a  conceptually  simple 
experiment  on  his  honeymoon  in  the  French-Swiss  Alps  near  Mt. 
Chamonix,  in  which  he  measured  the  difference  in  temperature  be¬ 
tween  the  top  and  bottom  of  a  waterfall.  The  water  at  the  top  of 
the  falls  has  some  gravitational  energy,  which  isn’t  our  subject  right 
now,  but  as  it  drops,  that  gravitational  energy  is  converted  into  ki¬ 
netic  energy,  and  then  into  heat  energy  due  to  internal  friction  in 
the  churning  pool  at  the  bottom: 


gravitational  energy  — >  kinetic  energy  — >  heat  energy 


In  the  logical  framework  of  this  book’s  presentation  of  energy,  the 
significance  of  the  experiment  is  that  it  provides  a  way  to  find  out 
how  an  object’s  kinetic  energy  depends  on  its  mass  and  velocity.  The 
increase  in  heat  energy  should  equal  the  kinetic  energy  of  the  water 
just  before  impact,  so  in  principle  we  could  measure  the  water’s 
mass,  velocity,  and  kinetic  energy,  and  see  how  they  relate  to  one 
another.3 

3From  Joule’s  point  of  view,  the  point  of  the  experiment  was  different.  At 
that  time,  most  physicists  believed  that  heat  was  a  quantity  that  was  conserved 
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Although  the  story  is  picturesque  and  memorable,  most  books 
that  mention  the  experiment  fail  to  note  that  it  was  a  failure!  The 
problem  was  that  heat  wasn’t  the  only  form  of  energy  being  released. 
In  reality,  the  situation  was  more  like  this: 

gravitational  energy  —akinetic  energy 
— dreat  energy 
+  sound  energy 

+  energy  of  partial  evaporation. 


d/A  simplified  drawing  of 
Joule’s  paddlewheel  experiment. 


The  successful  version  of  the  experiment,  shown  in  figures  d  and 
f,  used  a  paddlewheel  spun  by  a  dropping  weight.  As  with  the 
waterfall  experiment,  this  one  involves  several  types  of  energy,  but 
the  difference  is  that  in  this  case,  they  can  all  be  determined  and 
taken  into  account.  (Joule  even  took  the  precaution  of  putting  a 
screen  between  himself  and  the  can  of  water,  so  that  the  infrared 
light  emitted  by  his  warm  body  wouldn’t  warm  it  up  at  all!)  The 
result4  is 


K  =  -mv1 
2 


[kinetic  energy]. 


Whenever  you  encounter  an  equation  like  this  for  the  first  time, 
you  should  get  in  the  habit  of  interpreting  it.  First  off,  we  can  tell 
that  by  making  the  mass  or  velocity  greater,  we’d  get  more  kinetic 
energy.  That  makes  sense.  Notice,  however,  that  we  have  mass  to 
the  first  power,  but  velocity  to  the  second.  Having  the  whole  thing 
proportional  to  mass  to  the  first  power  is  necessary  on  theoretical 
grounds,  since  energy  is  supposed  to  be  additive.  The  dependence 
on  v 2  couldn’t  have  been  predicted,  but  it  is  sensible.  For  instance, 
suppose  we  reverse  the  direction  of  motion.  This  would  reverse  the 
sign  of  v ,  because  in  one  dimension  we  use  positive  and  negative  signs 
to  indicate  the  direction  of  motion.  But  since  v 2  is  what  appears  in 
the  equation,  the  resulting  kinetic  energy  is  unchanged. 

separately  from  the  rest  of  the  things  to  which  we  now  refer  as  energy,  i.e. , 
mechanical  energy.  Separate  units  of  measurement  had  been  constructed  for 
heat  and  mechanical  of  energy,  but  Joule  was  trying  to  show  that  one  could 
convert  back  and  forth  between  them,  and  that  it  was  actually  their  sum  that 
was  conserved,  if  they  were  both  expressed  in  consistent  units.  His  main  result 
was  the  conversion  factor  that  would  allow  the  two  sets  of  units  to  be  reconciled. 
By  showing  that  the  conversion  factor  came  out  the  same  in  different  types 
of  experiments,  he  was  supporting  his  assertion  that  heat  was  not  separately 
conserved.  From  Joule’s  perspective  or  from  ours,  the  result  is  to  connect  the 
mysterious,  invisible  phenomenon  of  heat  with  forms  of  energy  that  are  visible 
properties  of  objects,  i.e.,  mechanical  energy. 

4If  you’ve  had  a  previous  course  in  physics,  you  may  have  seen  this  presented 
not  as  an  empirical  result  but  as  a  theoretical  one,  derived  from  Newton’s  laws, 
and  in  that  case  you  might  feel  you’re  being  cheated  here.  However,  I’m  go¬ 
ing  to  reverse  that  reasoning  and  derive  Newton’s  laws  from  the  conservation 
laws  in  chapter  3.  From  the  modern  perspective,  conservation  laws  are  more 
fundamental,  because  they  apply  in  cases  where  Newton’s  laws  don’t. 


e/The  heating  of  the  tire 
and  floor  in  figure  c  is  something 
that  the  average  person  might 
have  predicted  in  advance,  but 
there  are  other  situations  where 
it's  not  so  obvious.  When  a 
ball  slams  into  a  wall,  it  doesn’t 
rebound  with  the  same  amount  of 
kinetic  energy.  Was  some  energy 
destroyed?  No.  The  ball  and 
the  wall  heat  up.  These  infrared 
photos  show  a  squash  ball  at 
room  temperature  (top),  and  after 
it  has  been  played  with  for  several 
minutes  (bottom),  causing  it  to 
heat  up  detectably. 
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f/A  realistic  drawing  of  Joule’s  apparatus,  based  on  the  illustration 
in  his  original  paper.  The  paddlewheel  is  sealed  inside  the  can  in  the 
middle.  Joule  wound  up  the  two  13-kg  lead  weights  and  dropped  them 
1.6  meters,  repeating  this  20  times  to  produce  a  temperature  change 
of  only  about  half  a  degree  Fahrenheit  in  the  water  inside  the  sealed 
can.  He  claimed  in  his  paper  to  be  able  to  measure  temperatures  to  an 
accuracy  of  1/200  of  a  degree. 


What  about  the  factor  of  1/2  in  front?  It  comes  out  to  be 
exactly  1/2  by  the  design  of  the  metric  system.  If  we’d  been  using 
the  old-fashioned  British  engineering  system  of  units  (which  is  no 
longer  used  in  the  U.K.),  the  equation  would  have  been  K  =  (7.44  x 
10~2  Btu-  s2/slug-  ft2)mu2.  The  version  of  the  metric  system  called 
the  SI,5  in  which  everything  is  based  on  units  of  kilograms,  meters, 
and  seconds,  not  only  has  the  numerical  constant  equal  to  1/2,  but 
makes  it  unitless  as  well.  In  other  words,  we  can  think  of  the  joule 
as  simply  an  abbreviation,  1  J=1  kg-m2/s2.  More  familiar  examples 
of  this  type  of  abbreviation  are  1  minute=60  s,  and  the  metric  unit 
of  land  area,  1  hectare=10000  nr2. 

Ergs  and  joules  example  2 

t>  There  used  to  be  two  commonly  used  systems  of  metric  units, 
referred  to  as  mks  and  cgs.  The  mks  system,  now  called  the  SI, 
is  based  on  the  meter,  the  kilogram,  and  the  second.  The  cgs 
system,  which  is  now  obsolete,  was  based  on  the  centimeter,  the 
gram,  and  the  second.  In  the  cgs  system,  the  unit  of  energy  is 
not  the  joule  but  the  erg,  1  erg=1  g  •  cm2/s2.  How  many  ergs  are 
in  one  joule? 

>  The  simplest  approach  is  to  treat  the  units  as  if  they  were  alge- 


5Systeme  International 
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bra  symbols. 


1  J  =  1 


kg  •  m 2 


„  kg  •  m2  1000  g 

=  1  - s —  x  — — x 


1  kg 


100  cm 
1  m 


=  107^ 


=  107  erg 


2 


Cabin  air  in  a  jet  airplane  example  3 

>  A  jet  airplane  typically  cruises  at  a  velocity  of  270  m/s.  Outside 
air  is  continuously  pumped  into  the  cabin,  but  must  be  cooled  off 
first,  both  because  (1 )  it  heats  up  due  to  friction  as  it  enters  the  en¬ 
gines,  and  (2)  it  is  heated  as  a  side-effect  of  being  compressed  to 
cabin  pressure.  Calculate  the  increase  in  temperature  due  to  the 
first  effect.  The  specific  heat  of  dry  air  is  about  I.OxlO3  J/kg-°C. 

o  This  is  easiest  to  understand  in  the  frame  of  reference  of  the 
plane,  in  which  the  air  rushing  into  the  engine  is  stopped,  and  its 
kinetic  energy  converted  into  heat.6  Conservation  of  energy  tells 
us 


0  =  A  E 

=  AK  +  AE^eat- 

In  the  plane’s  frame  of  reference,  the  air’s  initial  velocity  is  v,= 270 
m/s,  and  its  final  velocity  is  zero,  so  the  change  in  its  kinetic  en¬ 
ergy  is  negative, 


A  K  =  Kf-  Kj 
=  0  —  (1/2  )mv? 

=  -(1/2  )mvi2. 

Assuming  that  the  specific  heat  of  air  is  roughly  independent  of 
temperature  (which  is  why  the  number  was  stated  with  the  word 
“about”),  we  can  substitute  into  0  =  AK  +  A Eheat,  giving 

1  P 

0  =  — - mvj  +  mcAT 

1  9 

-Vi2  =  CAT. 

Note  how  the  mass  cancels  out.  This  is  a  big  advantage  of  solving 
problems  algebraically  first,  and  waiting  until  the  end  to  plug  in 

6It’s  not  at  all  obvious  that  the  solution  would  work  out  in  the  earth’s  frame  of 
reference,  although  Galilean  relativity  states  that  it  doesn’t  matter  which  frame 
we  use.  Chapter  3  discusses  the  relationship  between  conservation  of  energy  and 
Galilean  relativity. 
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numbers.  With  a  purely  numerical  approach,  we  wouldn’t  even 
have  known  what  value  of  m  to  pick,  or  if  we’d  guessed  a  value 
like  1  kg,  we  wouldn’t  have  known  whether  our  answer  depended 
on  that  guess. 

Solving  for  AT,  and  writing  v  instead  of  v,  for  simplicity,  we  find 


«  40°C. 

The  passengers  would  be  boiled  alive  if  not  for  the  refrigeration. 
The  first  stage  of  cooling  happens  via  heat  exchangers  in  the 
engine  struts,  but  a  second  stage,  using  a  refrigerator  under  the 
floor  of  the  cabin,  is  also  necessary.  Running  this  refrigerator 
uses  up  energy,  cutting  into  the  fuel  efficiency  of  the  airplane, 
which  is  why  typically  only  50%  of  the  cabin’s  air  is  replaced  in 
each  pumping  cycle  of  2-3  minutes.  Although  the  airlines  prefer 
to  emphasize  that  this  is  a  much  faster  recirculation  rate  than  in 
the  ventilation  systems  of  most  buildings,  people  are  packed  more 
tightly  in  an  airplane. 

2.1.4  Power 

Power,  P,  is  defined  as  the  rate  of  change  of  energy,  d E /  dt. 
Power  thus  has  units  of  joules  per  second,  which  are  usually  abbre¬ 
viated  as  watts,  1  W=1  J/s.  Since  energy  is  conserved,  we  would 
have  dE/  dt  =  0  if  E  was  the  total  energy  of  a  closed  system,  and 
that’s  not  very  interesting.  What’s  usually  more  interesting  to  dis¬ 
cuss  is  either  the  power  flowing  in  or  out  of  an  open  system,  or  the 
rate  at  which  energy  is  being  transformed  from  one  form  into  an¬ 
other.  The  following  is  an  example  of  energy  flowing  into  an  open 
system. 

Heating  by  a  lightbulb  example  4 

t>  The  electric  company  bills  you  for  energy  in  units  of  kilowatt- 
hours  (kilowatts  multiplied  by  hours)  rather  than  in  SI  units  of 
joules.  How  many  joules  is  a  kilowatt-hour? 

>  1  kilowatt-hour  =  (1  kW)(1  hour)  =  (1 000  J/s)(3600  s)  =  3.6  MJ. 

Now  here’s  an  example  of  energy  being  transformed  from  one 
form  into  another. 

Human  wattage  example  5 

>  Food  contains  chemical  energy  (discussed  in  more  detail  in  sec¬ 
tion  2.4),  and  for  historical  reasons,  food  energy  is  normally  given 
in  non-SI  units  of  Calories.  One  Calorie  with  a  capital  “C”  equals 
1000  calories,  and  1  calorie  is  defined  as  4.18  J.  A  typical  person 
consumes  2000  Calories  of  food  in  a  day,  and  converts  nearly  all 
of  that  directly  to  body  heat.  Compare  the  person’s  heat  produc¬ 
tion  to  the  rate  of  energy  consumption  of  a  100-watt  lightbulb. 

>  Strictly  speaking,  we  can’t  really  compute  the  derivative  d E/  d t, 


80 


Chapter  2  Conservation  of  Energy 


since  we  don’t  know  how  the  person’s  metabolism  ebbs  and  flows 
over  the  course  of  a  day.  What  we  can  really  compute  is  AE/At, 
which  is  the  power  averaged  over  a  one-day  period. 

Converting  to  joules,  we  find  AE  =  8  x  106  J  for  the  amount  of 
energy  transformed  into  heat  within  our  bodies  in  one  day.  Con¬ 
verting  the  time  interval  likewise  into  SI  units,  At  =  9  x  104  s. 
Dividing,  we  find  that  our  power  is  90  J/s  =  90  W,  about  the  same 
as  a  lightbulb. 

2.1.5  Gravitational  energy 

Gravitational  energy,  to  which  I’ve  already  alluded,  is  different 
from  heat  and  kinetic  energy  in  an  important  way.  Heat  and  kinetic 
energy  are  properties  of  a  single  object,  whereas  gravitational  energy 
describes  an  interaction  between  two  objects.  When  the  skater  in 
figures  g  and  h  is  at  the  top,  his  distance  from  the  bulk  of  the  planet 
earth  is  greater.  Since  we  observe  his  kinetic  energy  decreasing  on 
the  way  up,  there  must  be  some  other  form  of  energy  that  is  increas¬ 
ing.  We  invent  a  new  form  of  energy,  called  gravitational  energy,  and 
written  U  or  Ug,  which  depends  on  the  distance  between  his  body 
and  the  planet.  Where  is  this  energy?  It’s  not  in  the  skater’s  body, 
and  it’s  not  inside  the  earth,  either,  since  it  takes  two  to  tango.  If 
either  object  didn’t  exist,  there  wouldn’t  be  any  interaction  or  any 
way  to  measure  a  distance,  so  it  wouldn’t  make  sense  to  talk  about 
a  distance-dependent  energy.  Just  as  marriage  is  a  relationship  be¬ 
tween  two  people,  gravitational  energy  is  a  relationship  between  two 
objects. 

There  is  no  precise  way  to  define  the  distance  between  the  skater 
and  the  earth,  since  both  are  objects  that  have  finite  size.  As  dis¬ 
cussed  in  more  detail  in  section  2.3,  gravity  is  one  of  the  fundamen¬ 
tal  forces  of  nature,  a  universal  attraction  between  any  two  particles 
that  have  mass.  Each  atom  in  the  skater’s  body  is  at  a  definite  dis¬ 
tance  from  each  atom  in  the  earth,  but  each  of  these  distances  is 
different.  An  atom  in  his  foot  is  only  a  few  centimeters  from  some 
of  the  atoms  in  the  plaster  side  of  the  pool,  but  most  of  the  earth’s 
atoms  are  thousands  of  kilometers  away  from  him.  In  theory,  we 
might  have  to  add  up  the  contribution  to  the  gravitational  energy 
for  every  interaction  between  an  atom  in  the  skater’s  body  and  an 
atom  in  the  earth. 

For  our  present  purposes,  however,  there  is  a  far  simpler  and 
more  practical  way  to  solve  problems.  In  any  region  of  the  earth’s 
surface,  there  is  a  direction  called  “down,”  which  we  can  establish 
by  dropping  a  rock  or  hanging  a  plumb  bob.  In  figure  h,  the  skater  is 
moving  up  and  down  in  one  dimension,  and  if  we  did  measurements 
of  his  kinetic  energy,  like  the  made-up  data  in  the  figure,  we  could 
infer  his  gravitational  energy.  As  long  as  we  stay  within  a  relatively 
small  range  of  heights,  we  find  that  an  object’s  gravitational  energy 
increases  at  a  steady  rate  with  height.  In  other  words,  the  strength 


g  /  A  skateboarder  rises  to 
the  edge  of  an  empty  pool  and 
then  falls  back  down. 


j/\Y  11=3000  J 


K=0 


K=2000 J 


K=3000 J 


h/The  sum  of  kinetic  plus 
gravitational  energy  is  constant. 
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of  gravity  doesn’t  change  much  if  you  only  move  up  or  down  a  few 
meters.  We  also  find  that  the  gravitational  energy  is  proportional 
to  the  mass  of  the  object  we’re  testing.  Writing  y  for  the  height, 
and  g  for  the  overall  constant  of  proportionality,  we  have 

Ug  =  mgy.  [gravitational  energy;  y=height;  only  ac¬ 
curate  within  a  small  range  of  heights] 

The  number  g,  with  units  of  joules  per  kilogram  per  meter,  is  called 
the  gravitational  field.  It  tells  us  the  strength  of  gravity  in  a  certain 
region  of  space.  Near  the  surface  of  our  planet,  it  has  a  value  of 
about  9.8  J/kg-m,  which  is  conveniently  close  to  10  J/kg-rn  for 
rough  calculations. 

Velocity  at  the  bottom  of  a  drop  example  6 

t>  If  the  skater  in  figure  g  drops  3  meters  from  rest,  what  is  his 
velocity  at  the  bottom  of  the  pool? 

>  Starting  from  conservation  of  energy,  we  have 

0  =  A  E 
= AK + AU 
=  Kf  —  Ki  +  Uf-  Ui 

1  2 

=  -mvf  +  mgyf  -  mgy y 
=  ^mvf2  +  mg  Ay, 
so 

v  =  \J —IgAy 

=  v/-(2)(10  J/kg-m)(  -3  m) 

=  8  m/s  (rounded  to  one  sig.  fig.) 


(because  K-,= 0) 
(Ay  <0) 


There  are  a  couple  of  important  things  to  note  about  this  ex¬ 
ample.  First,  we  were  able  to  massage  the  equation  so  that  it  only 
involved  Ay,  rather  than  y  itself.  In  other  words,  we  don’t  need  to 
worry  about  where  y  =  0  is;  any  coordinate  system  will  work,  as 
long  as  the  positive  y  axis  points  up,  not  down.  This  is  no  accident. 
Gravitational  energy  can  always  be  changed  by  adding  a  constant 
onto  it,  with  no  effect  on  the  final  result,  as  long  as  you’re  consistent 
within  a  given  problem. 

The  other  interesting  thing  is  that  the  mass  canceled  out:  even 
if  the  skater  gained  weight  or  strapped  lead  weights  to  himself,  his 
velocity  at  the  bottom  would  still  be  8  m/s.  This  isn’t  an  accident 
either.  This  is  the  same  conclusion  we  reached  in  section  1.2,  based 
on  the  equivalence  of  gravitational  and  inertial  mass.  The  kinetic 
energy  depends  on  the  inertial  mass,  while  gravitational  energy  is 
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related  to  gravitational  mass,  but  since  these  two  masses  are  equal, 
we  were  able  to  use  a  single  symbol,  m,  for  them,  and  cancel  them 
out. 

We  can  see  from  the  equation  v  =  \J—2gAy  that  a  falling  object’s 
velocity  isn’t  constant.  It  increases  as  the  object  drops  farther  and 
farther.  What  about  its  acceleration?  If  we  assume  that  air  friction 
is  negligible,  the  arguments  in  section  1.2  show  that  the  accelera¬ 
tion  can’t  depend  on  the  object’s  mass,  so  there  isn’t  much  else  the 
acceleration  can  depend  on  besides  g.  In  fact,  the  acceleration  of  a 
falling  object  equals  —g  (in  a  coordinate  system  where  the  positive 
y  axis  points  up),  as  we  can  easily  show  using  the  chain  rule: 

fdv\  _  /  du  \  /d K\  /dU\  Tdy\ 

=  (— )  (-1  )(mg)(v) 

\  rnv  ) 

=  ~9, 

where  I’ve  calculated  dv/  dK  as  l/(dA"/  du),  and  d K/  dU  =  —  1  can 
be  found  by  differentiating  K+U  =  (constant)  to  give  dK+dU  =  0. ' 

We  can  also  check  that  the  units  of  g ,  J/kg-rn,  are  equivalent  to 
the  units  of  acceleration, 

J  kg-m2/s2  m 

kg-rn  kg-rn  s2  ’ 

and  therefore  the  strength  of  the  gravitational  field  near  the  earth’s 
surface  can  just  as  well  be  stated  as  10  m/s2. 

Speed  after  a  given  time  example  7 

>  An  object  falls  from  rest.  How  fast  is  it  moving  after  two  sec¬ 
onds?  Assume  that  the  amount  of  energy  converted  to  heat  by 
air  friction  is  negligible. 

>  Under  the  stated  assumption,  we  have  a  =  -g,  which  can  be 
integrated  to  give  v  =  - gt  +  constant.  If  we  let  f  =  0  be  the 
beginning  of  the  fall,  then  the  constant  of  integration  is  zero,  so  at 
t  =  2  s  we  have  v  =  - gt  =-(10  m/s2)  x  (2  s)  =  20  m/s. 

The  Vomit  Comet  example  8 

>  The  U.S.  Air  Force  has  an  airplane,  affectionately  known  as  the 
Vomit  Comet,  in  which  astronaut  trainees  can  experience  sim¬ 
ulated  weightlessness.  Oversimplifying  a  little,  imagine  that  the 
plane  climbs  up  high,  and  then  drops  straight  down  like  a  rock. 
(It  actually  flies  along  a  parabola.)  Since  the  people  are  falling 
with  the  same  acceleration  as  the  plane,  the  sensation  is  just  like 
what  you’d  experience  if  you  went  out  of  the  earth’s  gravitational 

7There  is  a  mathematical  loophole  in  this  argument  that  would  allow  the 
object  to  hover  for  a  while  with  zero  velocity  and  zero  acceleration.  This  point 
is  discussed  on  page  950. 
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field.  If  the  plane  can  start  from  10  km  up,  what  is  the  maximum 
amount  of  time  for  which  the  dive  can  last? 

>  Based  on  data  about  acceleration  and  distance,  we  want  to 
find  time.  Acceleration  is  the  second  derivative  of  distance,  so  if 
we  integrate  the  acceleration  twice  with  respect  to  time,  we  can 
find  how  position  relates  to  time.  For  convenience,  let’s  pick  a 
coordinate  system  in  which  the  positive  y  axis  is  down,  so  a=g 
instead  of  -g. 

a  =  g 

v  =  gt  +  constant  (integrating) 

=  gt  (starts  from  rest) 

y  =  ^ gt 2  +  constant  (integrating  again) 

Choosing  our  coordinate  system  to  have  y  =  0  at  t  =  0,  we  can 
make  the  second  constant  of  integration  equal  zero  as  well,  so 


and  roll  from  A  to  B  by  different 
paths. 


=  V2000  s2 

=  40s  (to  one  sig.  fig.) 

Note  that  if  we  hadn’t  converted  the  altitude  to  units  of  meters, 
we  would  have  gotten  the  wrong  answer,  but  we  would  have  been 
alerted  to  the  problem  because  the  units  inside  the  square  root 
wouldn’t  have  come  out  to  be  s2.  In  general,  it’s  a  good  idea  to 
convert  all  your  data  into  SI  (meter-kilogram-second)  units  before 
you  do  anything  with  them. 

High  road,  low  road  example  9 

>  In  figure  i,  what  can  you  say  based  on  conservation  of  energy 
about  the  speeds  of  the  balls  when  the  reach  point  B?  What  does 
conservation  of  energy  tell  you  about  which  ball  will  get  there 
first?  Assume  friction  doesn’t  convert  any  mechanical  energy  to 
heat  or  sound  energy. 

>  Since  friction  is  assumed  to  be  negligible,  there  are  only  two 
forms  of  energy  involved:  kinetic  and  gravitational.  Since  both 
balls  start  from  rest,  and  both  lose  the  same  amount  of  gravi¬ 
tational  energy,  they  must  have  the  same  kinetic  energy  at  the 
end,  and  therefore  they’re  rolling  at  the  same  speed  when  they 
reach  B.  (A  subtle  point  is  that  the  balls  have  kinetic  energy  both 
because  they’re  moving  through  space  and  because  they’re  spin¬ 
ning  as  they  roll.  These  two  types  of  energy  must  be  in  fixed 
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proportion  to  one  another,  so  this  has  no  effect  on  the  conclu¬ 
sion.) 

Conservation  of  energy  does  not,  however,  tell  us  anything  obvi¬ 
ous  about  which  ball  gets  there  first.  This  is  a  general  problem 
with  applying  conservation  laws:  conservation  laws  don’t  refer  di¬ 
rectly  to  time,  since  they  are  statements  that  something  stays  the 
same  at  all  moments  in  time.  We  expect  on  intuitive  grounds  that 
the  ball  that  goes  by  the  lower  ramp  gets  to  B  first,  since  it  builds 
up  speed  early  on. 

Buoyancy  example  1 0 

>  A  cubical  box  with  mass  m  and  volume  V  =  b3  is  submerged 
in  a  fluid  of  density  p.  How  much  energy  is  required  to  raise  it 
through  a  height  Ay? 

>  As  the  box  moves  up,  it  invades  a  volume  V  =  b2Ay  previously 
occupied  by  some  of  the  fluid,  and  fluid  flows  into  an  equal  volume 
that  it  has  vacated  on  the  bottom.  Lowering  this  amount  of  fluid 
by  a  height  b  reduces  the  fluid’s  gravitational  energy  by  p  V'gb  = 
pgb3Ay,  so  the  net  change  in  energy  is 

AE  =  mg  Ay  -  pgb3Ay 
=  {m-  pV)gAy. 


In  other  words,  it’s  as  if  the  mass  of  the  box  had  been  reduced  by 
an  amount  equal  to  the  fluid  that  otherwise  would  have  occupied 
that  volume.  This  is  known  as  Archimedes’  principle,  and  it  is 
true  even  if  the  box  is  not  a  cube,  although  we’ll  defer  the  more 
general  proof  until  page  203  in  chapter  3.  If  the  box  is  less  dense 
than  the  fluid,  then  it  will  float. 

A  simple  machine  example  1 1 

>  If  the  father  and  son  on  the  seesaw  in  figure  k  start  from  rest, 
what  will  happen? 

o  Note  that  although  the  father  is  twice  as  massive,  he  is  at  half 
the  distance  from  the  fulcrum.  If  the  seesaw  was  going  to  start 
rotating,  it  would  have  to  be  losing  gravitational  energy  in  order  to 
gain  some  kinetic  energy.  However,  there  is  no  way  for  it  to  gain 
or  lose  gravitational  energy  by  rotating  in  either  direction.  The 
change  in  gravitational  energy  would  be 


A  U  =  AUi  +  A  U2 

=  g(m:Ayi  +  m2Ay2), 


but  Ay-i  and  Ay2  have  opposite  signs  and  are  in  the  proportion  of 
two  to  one,  since  the  son  moves  along  a  circular  arc  that  covers 
the  same  angle  as  the  father’s  but  has  half  the  radius.  Therefore 
A U  =  0,  and  there  is  no  way  for  the  seesaw  to  trade  gravitational 
energy  for  kinetic. 


j  /  How  much  energy  is  re¬ 
quired  to  raise  the  submerged 
box  through  a  height  Ay? 


k  /  A  seesaw. 


I /The  biceps  muscle  is  a 
reversed  lever. 
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o 

B 

m  /  Discussion  question  C. 


n  /  A  hydraulic  jack. 


o  /  The  surfaces  are  friction¬ 
less.  The  black  blocks  are  in 
equilibrium. 


The  seesaw  example  demonstrates  the  principle  of  the  lever, 
which  is  one  of  the  basic  mechanical  building  blocks  known  as  sim¬ 
ple  machines.  As  discussed  in  more  detail  in  chapters  3  and  4,  the 
principle  applies  even  when  the  interactions  involved  aren’t  gravita¬ 
tional. 

Note  that  although  a  lever  makes  it  easier  to  lift  a  heavy  weight, 
it  also  decreases  the  distance  traveled  by  the  load.  By  reversing  the 
lever,  we  can  make  the  load  travel  a  greater  distance,  at  the  expense 
of  increasing  the  amount  of  force  required.  The  human  muscular- 
skeletal  system  uses  reversed  levers  of  this  kind,  which  allows  us  to 
move  more  rapidly,  and  also  makes  our  bodies  more  compact,  at  the 
expense  of  brute  strength.  A  piano  uses  reversed  levers  so  that  a 
small  amount  of  motion  of  the  key  produces  a  longer  swing  of  the 
hammer.  Another  interesting  example  is  the  hydraulic  jack  shown 
in  figure  n.  The  analysis  in  terms  of  gravitational  energy  is  exactly 
the  same  as  for  the  seesaw,  except  that  the  relationship  between  Ayi 
and  Aj/2  is  now  determined  not  by  geometry  but  by  conservation 
of  mass:  since  water  is  highly  incompressible,  conservation  of  mass 
is  approximately  the  same  as  a  requirement  of  constant  volume, 
which  can  only  be  satisfied  if  the  distance  traveled  by  each  piston  is 
in  inverse  proportion  to  its  cross-sectional  area. 

Discussion  Questions 

A  Hydroelectric  power  (water  flowing  over  a  dam  to  spin  turbines) 
appears  to  be  completely  free.  Does  this  violate  conservation  of  energy? 
If  not,  then  what  is  the  ultimate  source  of  the  electrical  energy  produced 
by  a  hydroelectric  plant? 

B  You  throw  a  steel  ball  up  in  the  air.  How  can  you  prove  based  on 
conservation  of  energy  that  it  has  the  same  speed  when  it  falls  back  into 
your  hand?  What  if  you  threw  a  feather  up?  Is  energy  not  conserved  in 
this  case? 

C  Figure  m  shows  a  pendulum  that  is  released  at  A  and  caught  by  a 
peg  as  it  passes  through  the  vertical,  B.  To  what  height  will  the  bob  rise 
on  the  right? 

D  What  is  wrong  with  the  following  definitions  of  g? 

(a)  “g  is  gravity.” 

(b)  “g  is  the  speed  of  a  falling  object.” 

(c)  “g  is  how  hard  gravity  pulls  on  things.” 

2.1.6  Equilibrium  and  stability 

The  seesaw  in  figure  k  is  in  equilibrium,  meaning  that  if  it  starts 
out  being  at  rest,  it  will  stay  put.  This  is  known  as  a  neutral  equi¬ 
librium,  since  the  seesaw  has  no  preferred  position  to  which  it  will 
return  if  we  disturb  it.  If  we  move  it  to  a  different  position  and 
release  it,  it  will  stay  at  rest  there  as  well.  If  we  put  it  in  motion,  it 
will  simply  continue  in  motion  until  one  person’s  feet  hit  the  ground. 

Most  objects  around  you  are  in  stable  equilibria,  like  the  black 
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block  in  figure  o/3.  Even  if  the  block  is  moved  or  set  in  motion,  it 
will  oscillate  about  the  equilibrium  position.  The  pictures  are  like 
graphs  of  y  versus  x,  but  since  the  gravitational  energy  U  =  mgy 
is  proportional  to  y,  we  can  just  as  well  think  of  them  as  graphs 
of  U  versus  x.  The  block’s  stable  equilibrium  position  is  where 
the  function  U(x)  has  a  local  minimum.  The  book  you’re  reading 
right  now  is  in  equilibrium,  but  gravitational  energy  isn’t  the  only 
form  of  energy  involved.  To  move  it  upward,  we’d  have  to  supply 
gravitational  energy,  but  downward  motion  would  require  a  different 
kind  of  energy,  in  order  to  compress  the  table  more.  (As  we’ll  see 
in  section  2.4,  this  is  electrical  energy  due  to  interactions  between 
atoms  within  the  table.) 

A  differentiable  function’s  local  extrema  occur  where  its  deriva¬ 
tive  is  zero.  A  position  where  d U /  dx  is  zero  can  be  a  stable  (3), 
neutral  (2),  or  unstable  equilibrium,  (4).  An  unstable  equilibrium 
is  like  a  pencil  balanced  on  its  tip.  Although  it  could  theoretically 
remain  balanced  there  forever,  in  reality  it  will  topple  due  to  any 
tiny  perturbation,  such  as  an  air  current  or  a  vibration  from  a  pass¬ 
ing  truck.  This  is  a  technical,  mathematical  definition  of  instability, 
which  is  more  restrictive  than  the  way  the  word  is  used  in  ordinary 
speech.  Most  people  would  describe  a  domino  standing  upright  as 
being  unstable,  but  in  technical  usage  it  would  be  considered  stable, 
because  a  certain  finite  amount  of  energy  is  required  to  tip  it  over, 
and  perturbations  smaller  than  that  would  only  cause  it  to  oscillate 
around  its  equilibrium  position. 

The  domino  is  also  an  interesting  example  because  it  has  two 
local  minima,  one  in  which  it  is  upright,  and  another  in  which  it  is 
lying  flat.  A  local  minimum  that  is  not  the  global  minimum,  as  in 
figure  o/5,  is  referred  to  as  a  metastable  equilibrium. 


p  /  Example  12. 


A  neutral  equilibrium  example  12 

Figure  p  shows  a  special-purpose  one-block  funicular  railroad 
near  Hill  and  Fourth  Streets  in  Los  Angeles,  California,  used  for 
getting  passengers  up  and  down  a  very  steep  hill.  It  has  two  cars 
attached  to  a  single  loop  of  cable,  arranged  so  that  while  one  car 
goes  up,  the  other  comes  down.  They  pass  each  other  in  the 
middle.  Since  one  car’s  gravitational  energy  is  increasing  while 
the  other’s  is  decreasing,  the  system  is  in  neutral  equilibrium.  If 
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there  were  no  frictional  heating,  exactly  zero  energy  would  be  re¬ 
quired  in  order  to  operate  the  system.  A  similar  counterweighting 
principle  is  used  in  aerial  tramways  in  mountain  resorts,  and  in 
elevators  (with  a  solid  weight,  rather  than  a  second  car,  as  coun¬ 
terweight). 
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Water  in  a  U-shaped  tube  example  13 

o  The  U-shaped  tube  in  figure  q  has  cross-sectional  area  A,  and 
the  density  of  the  water  inside  is  p.  Find  the  gravitational  energy 
as  a  function  of  the  quantity  y  shown  in  the  figure,  and  show  that 
there  is  an  equilibrium  at  y= 0. 

o  The  question  is  a  little  ambiguous,  since  gravitational  energy 
is  only  well  defined  up  to  an  additive  constant.  To  fix  this  con¬ 
stant,  let’s  define  U  to  be  zero  when  y= 0.  The  difference  between 
U(y)  and  1/(0)  is  the  energy  that  would  be  required  to  lift  a  wa¬ 
ter  column  of  height  y  out  of  the  right  side,  and  place  it  above 
the  dashed  line,  on  the  left  side,  raising  it  through  a  height  y. 
This  water  column  has  height  y  and  cross-sectional  area  A,  so 
its  volume  is  Ay,  its  mass  is  p Ay,  and  the  energy  required  is 
mgy=(pAy)gy=pgAy2.  We  then  have  U{y)  =  U( 0)  +  p gAy2  = 
99  Ay2. 

To  find  equilibria,  we  look  for  places  where  the  derivative  dU/dy  = 
2p gAy  equals  0.  As  we’d  expect  intuitively,  the  only  equilibrium 
occurs  at  y=0.  The  second  derivative  test  shows  that  this  is  a 
local  minimum  (not  a  maximum  or  a  point  of  inflection),  so  this  is 
a  stable  equilibrium. 

2.1.7  Predicting  the  direction  of  motion 

Kinetic  energy  doesn’t  depend  on  the  direction  of  motion.  Some¬ 
times  this  is  helpful,  as  in  the  high  road-low  road  example  (p.  84, 
example  9),  where  we  were  able  to  predict  that  the  balls  would  have 
the  same  final  speeds,  even  though  they  followed  different  paths  and 
were  moving  in  different  directions  at  the  end.  In  general,  however, 
the  two  conservation  laws  we’ve  encountered  so  far  aren’t  enough  to 
predict  an  object’s  path  through  space,  for  which  we  need  conserva¬ 
tion  of  momentum  (chapter  3),  and  the  mathematical  technique  of 
vectors.  Before  we  develop  those  ideas  in  their  full  generality,  how¬ 
ever,  it  will  be  helpful  to  do  a  couple  of  simple  examples,  including 
one  that  we’ll  get  a  lot  of  mileage  out  of  in  section  2.3. 

Suppose  we  observe  an  air  hockey  puck  gliding  frictionlessly  to 
the  right  at  a  velocity  v,  and  we  want  to  predict  its  future  motion. 
Since  there  is  no  friction,  no  kinetic  energy  is  converted  to  heat. 
The  only  form  of  energy  involved  is  kinetic  energy,  so  conservation 
of  energy,  A E  =  0,  becomes  simply  A K  =  0.  There’s  no  particular 
reason  for  the  puck  to  do  anything  but  continue  moving  to  the  right 
at  constant  speed,  but  it  would  be  equally  consistent  with  conserva¬ 
tion  of  energy  if  it  spontaneously  decided  to  reverse  its  direction  of 
motion,  changing  its  velocity  to  —v.  Either  way,  we’d  have  A K  =  0. 
There  is,  however,  a  way  to  tell  which  motion  is  physical  and  which 
is  unphysical.  Suppose  we  consider  the  whole  thing  again  in  the 
frame  of  reference  that  is  initially  moving  right  along  with  the  puck. 
In  this  frame,  the  puck  starts  out  with  K  =  0.  What  we  origi¬ 
nally  described  as  a  reversal  of  its  velocity  from  v  to  —  v  is,  in  this 


q  /  Water  in  a  U-shaped  tube. 
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r  /  A  car  drives  over  a  cliff. 


new  frame  of  reference,  a  change  from  zero  velocity  to  —  2v,  which 
would  violate  conservation  of  energy.  In  other  words,  the  physically 
possible  motion  conserves  energy  in  all  frames  of  reference,  but  the 
unphysical  motion  only  conserves  energy  in  one  special  frame  of 
reference. 

For  our  second  example,  we  consider  a  car  driving  off  the  edge 
of  a  cliff  (r).  For  simplicity,  we  assume  that  air  friction  is  negligible, 
so  only  kinetic  and  gravitational  energy  are  involved.  Does  the  car 
follow  trajectory  1,  familiar  from  Road  Runner  cartoons,  trajectory 
2,  a  parabola,  or  3,  a  diagonal  line?  All  three  could  be  consistent 
with  conservation  of  energy,  in  the  ground’s  frame  of  reference.  For 
instance,  the  car  would  have  constant  gravitational  energy  along 
the  initial  horizontal  segment  of  trajectory  1,  so  during  that  time 
it  would  have  to  maintain  constant  kinetic  energy  as  well.  Only  a 
parabola,  however,  is  consistent  with  conservation  of  energy  com¬ 
bined  with  Galilean  relativity.  Consider  the  frame  of  reference  that 
is  moving  horizontally  at  the  same  speed  as  that  with  which  the  car 
went  over  the  edge.  In  this  frame  of  reference,  the  cliff  slides  out 
from  under  the  initially  motionless  car.  The  car  can’t  just  hover 
for  a  while,  so  trajectory  1  is  out.  Repeating  the  same  math  as  in 
example  8  on  p.  83,  we  have 


x*  =  0  ,  y*  =  ^ gt 2 

in  this  frame  of  reference,  where  the  stars  indicate  coordinates  mea¬ 
sured  in  the  moving  frame  of  reference.  These  coordinates  are  re¬ 
lated  to  the  ground- fixed  coordinates  (x,  y)  by  the  equations 


x  =  x*  +  vt  and  y  =  y* , 


where  v  is  the  velocity  of  one  frame  with  respect  to  the  other.  We 
therefore  have 

1  2 

X  =  Vt  ,  y=  -gt  , 

in  our  original  frame  of  reference.  Eliminating  t,  we  can  see  that 
this  has  the  form  of  a  parabola: 


9  2 

V=2^X- 


self-check  A 

What  would  the  car’s  motion  be  like  in  the  *  frame  of  reference  if  it 
followed  trajectory  3?  >  Answer,  p.  961 
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2.2  Numerical  Techniques 

Engineering  majors  are  a  majority  of  the  students  in  the  kind  of 
physics  course  for  which  this  book  is  designed,  so  most  likely  you 
fall  into  that  category.  Although  you  surely  recognize  that  physics 
is  an  important  part  of  your  training,  if  you’ve  had  any  exposure 
to  how  engineers  really  work,  you’re  probably  skeptical  about  the 
flavor  of  problem-solving  taught  in  most  science  courses.  You  real¬ 
ize  that  not  very  many  practical  engineering  calculations  fall  into 
the  narrow  range  of  problems  for  which  an  exact  solution  can  be 
calculated  with  a  piece  of  paper  and  a  sharp  pencil.  Real-life  prob¬ 
lems  are  usually  complicated,  and  typically  they  need  to  be  solved 
by  number-crunching  on  a  computer,  although  we  can  often  gain 
insight  by  working  simple  approximations  that  have  algebraic  solu¬ 
tions.  Not  only  is  numerical  problem-solving  more  useful  in  real  life, 
it’s  also  educational;  as  a  beginning  physics  student,  I  really  only 
felt  like  I  understood  projectile  motion  after  I  had  worked  it  both 
ways,  using  algebra  and  then  a  computer  program.  (This  was  back 
in  the  days  when  64  kilobytes  of  memory  was  considered  a  lot.) 

In  this  section,  we’ll  start  by  seeing  how  to  apply  numerical 
techniques  to  some  simple  problems  for  which  we  know  the  answer  in 
“closed  form,”  i.e.,  a  single  algebraic  expression  without  any  calculus 
or  infinite  sums.  After  that,  we’ll  solve  a  problem  that  would  have 
made  you  world-famous  if  you  could  have  done  it  in  the  seventeenth 
century  using  paper  and  a  quill  pen!  Before  you  continue,  you  should 
read  Appendix  1  on  page  948  that  introduces  you  to  the  Python 
programming  language. 

First  let’s  solve  the  trivial  problem  of  finding  how  much  time  it 
takes  an  object  moving  at  speed  v  to  travel  a  straight-line  distance 
dist.  This  closed-form  answer  is,  of  course,  dist/v,  but  the  point 
is  to  introduce  the  techniques  we  can  use  to  solve  other  problems  of 
this  type.  The  basic  idea  is  to  divide  the  distance  up  into  n  equal 
parts,  and  add  up  the  times  required  to  traverse  all  the  parts.  The 
following  Python  function  does  the  job.  Note  that  you  shouldn’t 
type  in  the  line  numbers  on  the  left,  and  you  don’t  need  to  type 
in  the  comments,  either.  I’ve  omitted  the  prompts  »>  and  ...  in 
order  to  save  space. 

1  import  math 

2  def  timel (dist , v,n) : 

3  x=0 

4  dx  =  dist/n 

5  t=0 

6  for  i  in  range (n): 

7  x  =  x+dx 

8  dt=dx/v 

9  t=t+dt 

10  return  t 
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#  Initialize  the  position. 

#  Divide  dist  into  n  equal  parts. 

#  Initialize  the  time. 

#  Change  x. 

#  time=distance/speed 
#  Keep  track  of  elapsed  time. 


How  long  does  it  take  to  move  1  meter  at  a  constant  speed  of  1  m/s? 
If  we  do  this, 

»>  print (timel (1 . 0 , 1 . 0 , 10) )  #  dist,  v,  n 

0.99999999999999989 

Python  produces  the  expected  answer  by  dividing  the  distance  into 
ten  equal  0.1-meter  segments,  and  adding  up  the  ten  0.1-second 
times  required  to  traverse  each  one.  Since  the  object  moves  at  con¬ 
stant  speed,  it  doesn’t  even  matter  whether  we  set  n  to  10,  1,  or  a 
million: 

»>  print  (timel  (1 . 0 , 1 . 0 , 1) )  #  dist,  v,  n 

1.0 


Now  let’s  do  an  example  where  the  answer  isn’t  obvious  to  people 
who  don’t  know  calculus:  how  long  does  it  take  an  object  to  fall 
through  a  height  h,  starting  from  rest?  We  know  from  example  8 
on  page  83  that  the  exact  answer,  found  using  calculus,  is  y /2h/g. 
Let’s  see  if  we  can  reproduce  that  answer  numerically.  The  main 
difference  between  this  program  and  the  previous  one  is  that  now 
the  velocity  isn’t  constant,  so  we  need  to  update  it  as  we  go  along. 
Conservation  of  energy  gives  mgh  =  (1/2 )mv2 +mgy  for  the  velocity 
v  at  height  y,  so  v  =  —  y/2g(h  —  y ).  (We  choose  the  negative  root 
because  the  object  is  moving  down,  and  our  coordinate  system  has 
the  positive  y  axis  pointing  up.) 
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import  math 
def  time2(h,n) : 

g=9.8  # 

y=h  # 

v=0  # 

dy  =  -h/n  # 

t=0  # 

for  i  in  range(n): 

y  =  y+dy  # 

v  =  -math. sqrt (2*g* (h-y) ) 
dt=dy/v  # 

t=t+dt  # 

return  t 


gravitational  field 
Initialize  the  height. 
Initialize  the  velocity. 
Divide  h  into  n  equal  parts. 
Initialize  the  time. 

Change  y.  (Note  dy<0.) 

#  from  cons,  of  energy 
dy  and  v  are  <0,  so  dt  is  >0 
Keep  track  of  elapsed  time. 


For  h=1.0  m,  the  closed-form  result  is  \J2  ■  1.0  m/9.8  m/s2  =  0.45  s. 
With  the  drop  split  up  into  only  10  equal  height  intervals,  the  nu¬ 
merical  technique  provides  a  pretty  lousy  approximation: 


»>  print (time2 (1 . 0 , 10) )  #  h,  n 

0.35864270709233342 
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But  by  increasing  n  to  ten  thousand,  we  get  an  answer  that’s  as 
close  as  we  need,  given  the  limited  accuracy  of  the  raw  data: 

»>  print  (time2(l .  0 , 10000) )  #  h,  n 

0.44846664060793945 


A  subtle  point  is  that  we  changed  y  in  line  9,  and  then  on  line 
10  we  calculated  v,  which  depends  on  y.  Since  y  is  only  changing 
by  a  ten-thousandth  of  a  meter  with  each  step,  you  might  think 
this  wouldn’t  make  much  of  a  difference,  and  you’d  be  almost  right, 
except  for  one  small  problem:  if  we  swapped  lines  9  and  10,  then 
the  very  first  time  through  the  loop,  we’d  have  v=0,  which  would 
produce  a  division- by-zero  error  when  we  calculated  dt!  Actually 
what  would  make  the  most  sense  would  be  to  calculate  the  velocity 
at  height  y  and  the  velocity  at  height  y+dy  (recalling  that  dy  is 
negative),  average  them  together,  and  use  that  value  of  y  to  calculate 
the  best  estimate  of  the  velocity  between  those  two  points.  Since 
the  acceleration  is  constant  in  the  present  example,  this  modification 
results  in  a  program  that  gives  an  exact  result  even  for  n=l: 

1 
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»>  print  (time3(l .  0 , 1) )  #  h,  n 

0.45175395145262565 


import  math 
def  time3(h,n) : 
g=9.8 
y=h 
v=0 

dy  =  -h/n 
t=0 

for  i  in  range (n): 
y_old  =  y 
y  =  y+dy 

v_old  =  math.sqrt(2*g*(h-y_old)) 
v  =  math. sqrt (2*g* (h-y) ) 
v_avg  =  -(v_old+v)/2. 
dt=dy/v_avg 
t=t+dt 
return  t 


Now  we’re  ready  to  attack  a  problem  that  challenged  the  best 
minds  of  Europe  back  in  the  days  when  there  were  no  computers. 
In  1696,  the  mathematician  Johann  Bernoulli  posed  the  following 
famous  question.  Starting  from  rest,  an  object  slides  frictionlessly 
over  a  curve  joining  the  point  (a,  b)  to  the  point  (0,0).  Of  all  the 
possible  shapes  that  such  a  curve  could  have,  which  one  gets  the 
object  to  its  destination  in  the  least  possible  time,  and  how  much 
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y  (m) 


a  /  Approximations  to  the 

brachistochrone  curve  using 
a  third-order  polynomial  (solid 
line),  and  a  seventh-order  poly¬ 
nomial  (dashed).  The  latter 
only  improves  the  time  by  four 
milliseconds. 


time  does  it  take?  The  optimal  curve  is  called  the  brachistochrone , 
from  the  Greek  “short  time.”  The  solution  to  the  brachistochrone 
problem  evaded  Bernoulli  himself,  as  well  as  Leibniz,  who  had  been 
one  of  the  inventors  of  calculus.  The  English  physicist  Isaac  Newton, 
however,  stayed  up  late  one  night  after  a  day’s  work  running  the 
royal  mint,  and,  according  to  legend,  produced  an  algebraic  solution 
at  four  in  the  morning.  He  then  published  it  anonymously,  but 
Bernoulli  is  said  to  have  remarked  that  when  he  read  it,  he  knew 
instantly  from  the  style  that  it  was  Newton  —  he  could  “tell  the 
lion  from  the  mark  of  his  claw.” 

Rather  than  attempting  an  exact  algebraic  solution,  as  Newton 
did,  we’ll  produce  a  numerical  result  for  the  shape  of  the  curve 
and  the  minimum  time,  in  the  special  case  of  a=1.0  m  and  b= 1.0  m. 
Intuitively,  we  want  to  start  with  a  fairly  steep  drop,  since  any  speed 
we  can  build  up  at  the  start  will  help  us  throughout  the  rest  of  the 
motion.  On  the  other  hand,  it’s  possible  to  go  too  far  with  this  idea: 
if  we  drop  straight  down  for  the  whole  vertical  distance,  and  then 
do  a  right-angle  turn  to  cover  the  horizontal  distance,  the  resulting 
time  of  0.68  s  is  quite  a  bit  longer  than  the  optimal  result,  the  reason 
being  that  the  path  is  unnecessarily  long.  There  are  infinitely  many 
possible  curves  for  which  we  could  calculate  the  time,  but  let’s  look 
at  third-order  polynomials, 

y  =  c\x  +  C2X2  +  C3X3, 

where  we  require  C3  =  (b  —  c±a  —  02a2)  /  a3  in  order  to  make  the  curve 
pass  through  the  point  (a,  b).  The  Python  program,  below,  is  not 
much  different  from  what  we’ve  done  before.  The  function  only  asks 
for  ci  and  C2,  and  calculates  C3  internally  at  line  4.  Since  the  motion 
is  two-dimensional,  we  have  to  calculate  the  distance  between  one 
point  and  the  next  using  the  Pythagorean  theorem,  at  line  16. 


1  import  math 

2  def  timeb(a,b, cl , c2,n) : 

3  g=9.8 

4  c3  =  (b-cl*a-c2*a**2) / (a**3) 

5  x=a 

6  y=b 

7  dx  =  -a/n 

8  t=0 

9  for  i  in  range (n) : 

10  y_old  =  y 

11  x  =  x+dx 

12  y  =  cl*x+c2*x**2+c3*x**3 

13  dy  =  y-y_old 

14  v_old  =  math . sqrt (2*g* (b-y_old) ) 

15  v  =  math. sqrt (2*g*(b-y)) 

16  v_avg  =  (v_old+v)/2. 
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#  Pythagorean  thm. 


17  ds  =  math . sqrt (dx**2+dy**2) 

18  dt=ds/v_avg 

19  t=t+dt 

20  return  t 


As  a  first  guess,  we  could  try  a  straight  diagonal  line,  y  =  x, 
which  corresponds  to  setting  ci  =  1,  and  all  the  other  coefficients  to 
zero.  The  result  is  a  fairly  long  time: 


»>  a=l. 

»>  b=l. 

»>  n=10000 
»>  cl=l. 

»>  c2=0 . 

»>  print(timeb(a,b,cl,c2,n)) 
0.63887656499994161 


What  we  really  need  is  a  curve  that’s  very  steep  on  the  right,  and 
flatter  on  the  left,  so  it  would  actually  make  more  sense  to  try  y  =  .x3: 


»>  cl=0. 

»>  c2=0 . 

»>  print(timeb(a,b,cl,c2,n)) 
0.59458339947087069 


This  is  a  significant  improvement,  and  turns  out  to  be  only  a  hun¬ 
dredth  of  a  second  off  of  the  shortest  possible  time!  It’s  possible, 
although  not  very  educational  or  entertaining,  to  find  better  ap¬ 
proximations  to  the  brachistochrone  curve  by  fiddling  around  with 
the  coefficients  of  the  polynomial  by  hand.  The  real  point  of  this 
discussion  was  to  give  an  example  of  a  nontrivial  problem  that  can 
be  attacked  successfully  with  numerical  techniques.  I  found  the  first 
approximation  shown  in  figure  a, 


y  =  (0.62)®  +  (— 0.93)x2  +  (1.31)®3 


by  using  the  program  listed  in  appendix  2  on  page  951  to  search 
automatically  for  the  optimal  curve.  The  seventh-order  approxima¬ 
tion  shown  in  the  figure  came  from  a  straightforward  extension  of 
the  same  program. 
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2.3  Gravitational  Phenomena 


a /An  ellipse  is  circle  that  has 
been  distorted  by  shrinking  and 
stretching  along  perpendicular 
axes. 


b/An  ellipse  can  be  con¬ 
structed  by  tying  a  string  to  two 
pins  and  drawing  like  this  with  a 
pencil  stretching  the  string  taut. 
Each  pin  constitutes  one  focus  of 
the  ellipse. 


c/lf  the  time  interval  taken 
by  the  planet  to  move  from  P  to  Q 
is  equal  to  the  time  interval  from 
R  to  S,  then  according  to  Kepler’s 
equal-area  law,  the  two  shaded 
areas  are  equal.  The  planet 
is  moving  faster  during  time 
interval  RS  than  it  was  during 
PQ,  because  gravitational  energy 
has  been  transformed  into  kinetic 
energy. 


Cruise  your  radio  dial  today  and  try  to  find  any  popular  song  that 
would  have  been  imaginable  without  Louis  Armstrong.  By  introduc¬ 
ing  solo  improvisation  into  jazz,  Armstrong  took  apart  the  jigsaw 
puzzle  of  popular  music  and  fit  the  pieces  back  together  in  a  dif¬ 
ferent  way.  In  the  same  way,  Newton  reassembled  our  view  of  the 
universe.  Consider  the  titles  of  some  recent  physics  books  written 
for  the  general  reader:  The  God  Particle,  Dreams  of  a  Fi¬ 
nal  Theory.  When  the  subatomic  particle  called  the  neutrino  was 
recently  proven  for  the  first  time  to  have  mass,  specialists  in  cos¬ 
mology  began  discussing  seriously  what  effect  this  would  have  on 
calculations  of  the  evolution  of  the  universe  from  the  Big  Bang  to 
its  present  state.  Without  the  English  physicist  Isaac  Newton,  such 
attempts  at  universal  understanding  would  not  merely  have  seemed 
ambitious,  they  simply  would  not  have  occurred  to  anyone. 

This  section  is  about  Newton’s  theory  of  gravity,  which  he  used 
to  explain  the  motion  of  the  planets  as  they  orbited  the  sun.  Newton 
tosses  off  a  general  treatment  of  motion  in  the  first  20  pages  of  his 
Mathematical  Principles  of  Natural  Philosophy,  and  then 
spends  the  next  130  discussing  the  motion  of  the  planets.  Clearly 
he  saw  this  as  the  crucial  scientific  focus  of  his  work.  Why?  Because 
in  it  he  showed  that  the  same  laws  of  nature  applied  to  the  heavens 
as  to  the  earth,  and  that  the  gravitational  interaction  that  made 
an  apple  fall  was  the  same  as  the  as  the  one  that  kept  the  earth’s 
motion  from  carrying  it  away  from  the  sun. 

2.3.1  Kepler’s  laws 

Newton  wouldn’t  have  been  able  to  figure  out  why  the  planets 
move  the  way  they  do  if  it  hadn’t  been  for  the  astronomer  Tycho 
Brahe  (1546-1601)  and  his  protege  Johannes  Kepler  (1571-1630), 
who  together  came  up  with  the  first  simple  and  accurate  description 
of  how  the  planets  actually  do  move.  The  difficulty  of  their  task  is 
suggested  by  the  figure  below,  which  shows  how  the  relatively  simple 
orbital  motions  of  the  earth  and  Mars  combine  so  that  as  seen  from 
earth  Mars  appears  to  be  staggering  in  loops  like  a  drunken  sailor. 

Brahe,  the  last  of  the  great  naked-eye  astronomers,  collected  ex¬ 
tensive  data  on  the  motions  of  the  planets  over  a  period  of  many 
years,  taking  the  giant  step  from  the  previous  observations’  accuracy 
of  about  10  minutes  of  arc  (10/60  of  a  degree)  to  an  unprecedented 
1  minute.  The  quality  of  his  work  is  all  the  more  remarkable  consid¬ 
ering  that  his  observatory  consisted  of  four  giant  brass  protractors 
mounted  upright  in  his  castle  in  Denmark.  Four  different  observers 
would  simultaneously  measure  the  position  of  a  planet  in  order  to 
check  for  mistakes  and  reduce  random  errors. 

With  Brahe’s  death,  it  fell  to  his  former  assistant  Kepler  to  try 
to  make  some  sense  out  of  the  volumes  of  data.  After  900  pages  of 
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d  /  As  the  earth  and  Mars  revolve 
around  the  sun  at  different  rates, 
the  combined  effect  of  their  mo¬ 
tions  makes  Mars  appear  to  trace 
a  strange,  looped  path  across  the 
background  of  the  distant  stars. 


calculations  and  many  false  starts  and  dead-end  ideas,  Kepler  finally 
synthesized  the  data  into  the  following  three  laws: 

Kepler’s  elliptical  orbit  law:  The  planets  orbit  the  sun  in 
elliptical  orbits  with  the  sun  at  one  focus. 

Kepler’s  equal-area  law:  The  line  connecting  a  planet  to  the 
sun  sweeps  out  equal  areas  in  equal  amounts  of  time. 

Kepler’s  law  of  periods:  The  time  required  for  a  planet  to  orbit 
the  sun,  called  its  period,  T,  is  proportional  to  the  long  axis  of  the 
ellipse  raised  to  the  3/2  power.  The  constant  of  proportionality 
is  the  same  for  all  the  planets. 

Although  the  planets’  orbits  are  ellipses  rather  than  circles,  most 
are  very  close  to  being  circular.  The  earth’s  orbit,  for  instance,  is 
only  flattened  by  1.7%  relative  to  a  circle.  In  the  special  case  of  a 
planet  in  a  circular  orbit,  the  two  foci  (plural  of  “focus”)  coincide 
at  the  center  of  the  circle,  and  Kepler’s  elliptical  orbit  law  thus  says 
that  the  circle  is  centered  on  the  sun.  The  equal-area  law  implies 
that  a  planet  in  a  circular  orbit  moves  around  the  sun  with  constant 
speed.  For  a  circular  orbit,  the  law  of  periods  then  amounts  to  a 
statement  that  the  time  for  one  orbit  is  proportional  to  ?’3/2,  where 
r  is  the  radius.  If  all  the  planets  were  moving  in  their  orbits  at  the 
same  speed,  then  the  time  for  one  orbit  would  simply  depend  on 
the  circumference  of  the  circle,  so  it  would  only  be  proportional  to 
r  to  the  first  power.  The  more  drastic  dependence  on  r3/2  means 
that  the  outer  planets  must  be  moving  more  slowly  than  the  inner 
planets. 

Our  main  focus  in  this  section  will  be  to  use  the  law  of  periods 
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e/A  cannon  fires  cannon¬ 
balls  at  different  velocities,  from 
the  top  of  an  imaginary  mountain 
that  rises  above  the  earth’s  atmo¬ 
sphere.  This  is  almost  the  same 
as  a  figure  Newton  included  in  his 
Mathematical  Principles. 


to  deduce  the  general  equation  for  gravitational  energy.  The  equal- 
area  law  turns  out  to  be  a  statement  on  conservation  of  angular 
momentum,  which  is  discussed  in  chapter  4.  We’ll  demonstrate 
the  elliptical  orbit  law  numerically  in  chapter  3,  and  analytically  in 
chapter  4. 

2.3.2  Circular  orbits 

Kepler’s  laws  say  that  planets  move  along  elliptical  paths  (with 
circles  as  a  special  case),  which  would  seem  to  contradict  the  proof 
on  page  90  that  objects  moving  under  the  influence  of  gravity  have 
parabolic  trajectories.  Kepler  was  right.  The  parabolic  path  was 
really  only  an  approximation,  based  on  the  assumption  that  the 
gravitational  field  is  constant,  and  that  vertical  lines  are  all  parallel. 
In  figure  e,  trajectory  1  is  an  ellipse,  but  it  gets  chopped  off  when 
the  cannonball  hits  the  earth,  and  the  small  piece  of  it  that  is  above 
ground  is  nearly  indistinguishable  from  a  parabola.  Our  goal  is 
to  connect  the  previous  calculation  of  parabolic  trajectories,  y  = 
(g/ 2v2)x2 ,  with  Kepler’s  data  for  planets  orbiting  the  sun  in  nearly 
circular  orbits.  Let’s  start  by  thinking  in  terms  of  an  orbit  that 
circles  the  earth,  like  orbit  2  in  figure  e.  It’s  more  natural  now 
to  choose  a  coordinate  system  with  its  origin  at  the  center  of  the 
earth,  so  the  parabolic  approximation  becomes  y  =  r  —  (g/2v2)x2 , 
where  r  is  the  distance  from  the  center  of  the  earth.  For  small 
values  of  x,  i.e.,  when  the  cannonball  hasn’t  traveled  very  far  from 
the  muzzle  of  the  gun,  the  parabola  is  still  a  good  approximation 
to  the  actual  circular  orbit,  defined  by  the  Pythagorean  theorem, 
r 2  =  x 2  +  y2,  or  y  =  ryjl  —  x2/r2.  For  small  values  of  x,  we  can  use 
the  approximation  \J  1  +  e  ~  l  +  e/2  to  find  y  ss  r— (l/2r)x2.  Setting 
this  equal  to  the  equation  of  the  parabola,  we  have  g/2v2  =  (l/2r), 
or 

v  =  \fffr  [condition  for  a  circular  orbit]. 

Low-earth  orbit  example  14 

To  get  a  feel  for  what  this  all  means,  let’s  calculate  the  velocity 
required  for  a  satellite  in  a  circular  low-earth  orbit.  Real  low-earth- 
orbit  satellites  are  only  a  few  hundred  km  up,  so  for  purposes  of 
rough  estimation  we  can  take  r  to  be  the  radius  of  the  earth,  and 
g  is  not  much  less  than  its  value  on  the  earth’s  surface,  10  m/s2. 
Taking  numerical  data  from  Appendix  5,  we  have 

v  =  VW 

=  ^(10  m/s2)(6.4  x  103  km) 

=  ^(10  m/s2)(6.4  x  106  m) 

=  yj 6.4  x  107  m2/s2 
=  8000  m/s 

(about  twenty  times  the  speed  of  sound). 
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In  one  second,  the  satellite  moves  8000  m  horizontally.  Dur¬ 
ing  this  time,  it  drops  the  same  distance  any  other  object  would: 
about  5  m.  But  a  drop  of  5  m  over  a  horizontal  distance  of  8000 
m  is  just  enough  to  keep  it  at  the  same  altitude  above  the  earth’s 
curved  surface. 

2.3.3  The  sun’s  gravitational  field 

We  can  now  use  the  circular  orbit  condition  v  =  sjgr,  combined 
with  Kepler’s  law  of  periods,  T  oc  r3/2  for  circular  orbits,  to  deter¬ 
mine  how  the  sun’s  gravitational  field  falls  off  with  distance.8  From 
there,  it  will  be  just  a  hop,  skip,  and  a  jump  to  get  to  a  universal 
description  of  gravitational  interactions. 

The  velocity  of  a  planet  in  a  circular  orbit  is  proportional  to 
r/T,  so 


r/T  oc  ^/gr 
r / r3/2  oc  y/gr 
go c  1  /r2 

If  gravity  behaves  systematically,  then  we  can  expect  the  same  to 
be  true  for  the  gravitational  field  created  by  any  object,  not  just  the 
sun. 

There  is  a  subtle  point  here,  which  is  that  so  far,  r  has  just  meant 
the  radius  of  a  circular  orbit,  but  what  we  have  come  up  with  smells 
more  like  an  equation  that  tells  us  the  strength  of  the  gravitational 
field  made  by  some  object  (the  sun)  if  we  know  how  far  we  are  from 
the  object.  In  other  words,  we  could  reinterpret  r  as  the  distance 
from  the  sun. 

2.3.4  Gravitational  energy  in  general 

We  now  want  to  find  an  equation  for  the  gravitational  energy  of 
any  two  masses  that  attract  each  other  from  a  distance  r.  We  assume 
that  r  is  large  enough  compared  to  the  distance  between  the  objects 
so  that  we  don’t  really  have  to  worry  about  whether  r  is  measured 
from  center  to  center  or  in  some  other  way.  This  would  be  a  good 
approximation  for  describing  the  solar  system,  for  example,  since 
the  sun  and  planets  are  small  compared  to  the  distances  between 
them  —  that’s  why  you  see  Venus  (the  “evening  star”)  with  your 
bare  eyes  as  a  dot,  not  a  disk. 

The  equation  we  seek  is  going  to  give  the  gravitational  energy, 
U,  as  a  function  of  mi,  m2,  and  r.  We  already  know  from  expe¬ 
rience  with  gravity  near  the  earth’s  surface  that  U  is  proportional 

8There  is  a  hidden  assumption  here,  which  is  that  the  sun  doesn’t  move. 
Actually  the  sun  wobbles  a  little  because  of  the  planets’  gravitational  interactions 
with  it,  but  the  wobble  is  small  due  to  the  sun’s  large  mass,  so  it’s  a  pretty  good 
approximation  to  assume  the  sun  is  stationary.  Chapter  3  provides  the  tools  to 
analyze  this  sort  of  thing  completely  correctly  —  see  p.  142. 
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to  the  mass  of  the  object  that  interacts  with  the  earth  gravitation¬ 
ally,  so  it  makes  sense  to  assume  the  relationship  is  symmetric:  U  is 
presumably  proportional  to  the  product  mim^.  We  can  no  longer 
assume  AU  oc  A r,  as  in  the  earth’s-surface  equation  AU  =  mgAy, 
since  we  are  trying  to  construct  an  equation  that  would  be  valid 
for  all  values  of  r,  and  g  depends  on  r.  We  can,  however,  con¬ 
sider  an  infinitesimally  small  change  in  distance  dr,  for  which  we’ll 
have  d U  =  777251  dr,  where  g\  is  the  gravitational  field  created  by 
m\.  (We  could  just  as  well  have  written  this  as  d U  =  m\g2  dr,  since 
we’re  not  assuming  either  mass  is  “special”  or  “active.”)  Integrating 
this  equation,  we  have 


Sw= 

u  = 

U  oc 
U  oc 


m25i  dr 


777.2  J  gi  dr 
77717772  [  ~ T  dr 


77717772 


where  we’re  free  to  take  the  constant  of  integration  to  be  equal  to 
zero,  since  gravitational  energy  is  never  a  well-defined  quantity  in 
absolute  terms.  Writing  G  for  the  constant  of  proportionality,  we 
have  the  following  fundamental  description  of  gravitational  interac¬ 
tions: 


r 


f  /  The  gravitational  energy 
U  =  -Gm^m2/r  graphed  as  a 
function  of  r. 


U  = - -  [gravitational  energy  of  two  masses 

r 

separated  by  a  distance  r] 

We’ll  refer  to  this  as  Newton’s  law  of  gravity,  although  in  reality 
he  stated  it  in  an  entirely  different  form,  which  turns  out  to  be 
mathematically  equivalent  to  this  one. 

Let’s  interpret  his  result.  First,  don’t  get  hung  up  on  the  fact 
that  it’s  negative,  since  it’s  only  differences  in  gravitational  energy 
that  have  physical  significance.  The  graph  in  figure  f  could  be  shifted 
up  or  down  without  having  any  physical  effect.  The  slope  of  this 
graph  relates  to  the  strength  of  the  gravitational  field.  For  instance, 
suppose  figure  f  is  a  graph  of  the  gravitational  energy  of  an  asteroid 
interacting  with  the  sun.  If  the  asteroid  drops  straight  toward  the 
sun,  from  A  to  B,  the  decrease  in  gravitational  energy  is  very  small, 
so  it  won’t  speed  up  very  much  during  that  motion.  Points  C  and 
D,  however,  are  in  a  region  where  the  graph’s  slope  is  much  greater. 
As  the  asteroid  moves  from  C  to  D,  it  loses  a  lot  of  gravitational 
energy,  and  therefore  speeds  up  considerably.  This  is  due  to  the 
stronger  gravitational  field. 
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Determining  G  example  15 

The  constant  G  is  not  easy  to  determine,  and  Newton  went  to  his 
grave  without  knowing  an  accurate  value  for  it.  If  we  knew  the 
mass  of  the  earth,  then  we  could  easily  determine  G  from  exper¬ 
iments  with  terrestrial  gravity,  but  the  only  way  to  determine  the 
mass  of  the  earth  accurately  in  units  of  kilograms  is  by  finding  G 
and  reasoning  the  other  way  around!  (If  you  estimate  the  average 
density  of  the  earth,  you  can  make  at  least  a  rough  estimate  of 
G.)  Figures  g  and  h  show  how  G  was  first  measured  by  Flenry 
Cavendish  in  the  nineteenth  century.The  rotating  arm  is  released 
from  rest,  and  the  kinetic  energy  of  the  two  moving  balls  is  mea¬ 
sured  when  they  pass  position  C.  Conservation  of  energy  gives 

GMm  GMm  GMm  GMm 

—2 - 2 - =  —2 - 2 - 1-  2 K, 

r BA  r bd  ^CA  rCD 

where  M  is  the  mass  of  one  of  the  large  balls,  m  is  the  mass  of 
one  of  the  small  ones,  and  the  factors  of  two,  which  will  cancel, 
occur  because  every  energy  is  mirrored  on  the  opposite  side  of 
the  apparatus.  (As  discussed  on  page  1 02,  it  turns  out  that  we  get 
the  right  result  by  measuring  all  the  distances  from  the  center  of 
one  sphere  to  the  center  of  the  other.)  This  can  easily  be  solved 
for  G.  The  best  modern  value  of  G,  from  later  versions  of  the 
same  experiment,  is  6.67  x  1CH1  J  •  m/kg2. 

Escape  velocity  example  16 

>  The  Pioneer  1 0  space  probe  was  launched  in  1 972,  and  contin¬ 
ued  sending  back  signals  for  30  years.  In  the  year  2001 ,  not  long 
before  contact  with  the  probe  was  lost,  it  was  about  1 .2  x  1013  m 
from  the  sun,  and  was  moving  almost  directly  away  from  the  sun 
at  a  velocity  of  1 .21  xIO4  m.  The  mass  of  the  sun  is  1 ,99x1030  kg. 
Will  Pioneer  1 0  escape  permanently,  or  will  it  fall  back  into  the  so¬ 
lar  system? 

o  We  want  to  know  whether  there  will  be  a  point  where  the  probe 
will  turn  around.  If  so,  then  it  will  have  zero  kinetic  energy  at  the 
turnaround  point: 


K;  +  Ui  =  Uf 

1  o  GMm  GMm 

-mv - = - 

2  n  rf 

1  o  GM  GM 

—  V - = - 

2  r j  rf  ’ 

where  M  is  the  mass  of  the  sun,  m  is  the  (irrelevant)  mass  of 
the  probe,  and  rf  is  the  distance  from  the  sun  of  the  hypothetical 
turnaround  point.  Plugging  in  numbers  on  the  left,  we  get  a  posi¬ 
tive  result.  There  can  therefore  be  no  solution,  since  the  right  side 
is  negative.  There  won’t  be  any  turnaround  point,  and  Pioneer  10 
is  never  coming  back. 


g  /  Cavendish’s  original  draw¬ 
ing  of  the  apparatus  for  his 
experiment,  discussed  in  exam¬ 
ple  15.  The  room  was  sealed 
to  exclude  air  currents,  and  the 
motion  was  observed  through 
telescopes  sticking  through  holes 
in  the  walls. 


h/A  simplified  drawing  of  the 
Cavendish  experiment,  viewed 
from  above.  The  rod  with  the  two 
small  masses  on  the  ends  hangs 
from  a  thin  fiber,  and  is  free  to 
rotate. 


Pluto 


i/The  Pioneer  10  space  probe’s 
trajectory  from  1974  to  1992, 
with  circles  marking  its  position 
at  one-year  intervals.  After  its 
1974  slingshot  maneuver  around 
Jupiter,  the  probe’s  motion  was 
determined  almost  exclusively  by 
the  sun’s  gravity. 
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The  minimum  velocity  required  for  this  to  happen  is  called  es¬ 
cape  velocity.  For  speeds  above  escape  velocity,  the  orbits  are 
open-ended  hyperbolas,  rather  than  repeating  elliptical  orbits.  In 
figure  i,  Pioneer’s  hyperbolic  trajectory  becomes  almost  indistin¬ 
guishable  from  a  line  at  large  distances  from  the  sun.  The  motion 
slows  perceptibly  in  the  first  few  years  after  1974,  but  later  the 
speed  becomes  nearly  constant,  as  shown  by  the  nearly  constant 
spacing  of  the  dots. 

The  gravitational  held 

We  got  the  energy  equation  U  =  —Grrixm^/r  by  integrating 
g  oc  1/r2  and  then  inserting  a  constant  of  proportionality  to  make 
the  proportionality  into  an  equation.  The  opposite  of  an  integral  is 
a  derivative,  so  we  can  now  go  backwards  and  insert  a  constant  of 
proportionality  in  g  oc  1/r2  that  will  be  consistent  with  the  energy 
equation: 


dU  =  rri2gi  dr 

1  d  U 

9i  =  —  X“ 
m2  dr 

1  d  /  Grn  1  m2  \ 
m2  dr  \  r  / 

=  -Gmi  Tr  (?) 

Grn  1 
—  r2 

This  kind  of  inverse-square  law  occurs  all  the  time  in  nature.  For 
instance,  if  you  go  twice  as  far  away  from  a  lightbulb,  you  receive 
1/4  as  much  light  from  it,  because  as  the  light  spreads  out,  it  is 
like  an  expanding  sphere,  and  a  sphere  with  twice  the  radius  has 
four  times  the  surface  area.  It’s  like  spreading  the  same  amount  of 
peanut  butter  on  four  pieces  of  bread  instead  of  one  —  we  have  to 
spread  it  thinner. 

Discussion  Questions 

A  A  bowling  ball  interacts  gravitationally  with  the  earth.  Would  it 
make  sense  for  the  gravitational  energy  to  be  inversely  proportional  to  the 
distance  between  their  surfaces  rather  than  their  centers? 


2.3.5  The  shell  theorem 

Newton’s  great  insight  was  that  gravity  near  the  earth’s  surface 
was  the  same  kind  of  interaction  as  the  one  that  kept  the  planets 
from  flying  away  from  the  sun.  He  told  his  niece  that  the  idea 
came  to  him  when  he  saw  an  apple  fall  from  a  tree,  which  made  him 
wonder  whether  the  earth  might  be  affecting  the  apple  and  the  moon 
in  the  same  way.  Up  until  now,  we’ve  generally  been  dealing  with 
gravitational  interactions  between  objects  that  are  small  compared 
to  the  distances  between  them,  but  that  assumption  doesn’t  apply  to 
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the  apple.  A  kilogram  of  dirt  a  few  feet  under  his  garden  in  England 
would  interact  much  more  strongly  with  the  apple  than  a  kilogram 
of  molten  rock  deep  under  Australia,  thousands  of  miles  away.  Also, 
we  know  that  the  earth  has  some  parts  that  are  more  dense,  and 
some  parts  that  are  less  dense.  The  solid  crust,  on  which  we  live, 
is  considerably  less  dense  than  the  molten  rock  on  which  it  floats. 
By  all  rights,  the  computation  of  the  total  gravitational  energy  of 
the  apple  should  be  a  horrendous  mess.  Surprisingly,  it  turns  out  to 
be  fairly  simple  in  the  end.  First,  we  note  that  although  the  earth 
doesn’t  have  the  same  density  throughout,  it  does  have  spherical 
symmetry:  if  we  imagine  dividing  it  up  into  thin  concentric  shells, 
the  density  of  each  shell  is  uniform. 

Second,  it  turns  out  that  a  uniform  spherical  shell  interacts  with 
external  masses  as  if  all  its  mass  were  concentrated  at  its  center. 


The  shell  theorem:  The  gravitational  energy  of  a  uniform  spher¬ 
ical  shell  of  mass  M  interacting  with  a  pointlike  mass  m  outside  it 
equals  — GMm/s ,  where  s  is  the  center-to-center  distance.  If  mass 
m  is  inside  the  shell,  then  the  energy  is  constant,  i.e.,  the  shell’s 
interior  gravitational  field  is  zero. 


Proof:  Let  b  be  the  radius  of  the  shell,  h  its  thickness,  and  p 
its  density.  Its  volume  is  then  V= (area)  (thickness)  =Airb2h,  and 
its  mass  is  M  =  pV  =  dirpb2h.  The  strategy  is  to  divide  the 
shell  up  into  rings  as  shown  in  figure  j,  with  each  ring  extending 
from  8  to  9  +  d9.  Since  the  ring  is  infinitesimally  skinny,  its  en¬ 
tire  mass  lies  at  the  same  distance,  r,  from  mass  m.  The  width  of 
such  a  ring  is  found  by  the  definition  of  radian  measure  to  be  w  = 
bdd,  and  its  mass  is  d M  =  (p)  (circumference)  (thickness)  (width) = 
(p)(2TrbsmO)(h)(bdd)=2irpb2hsm9  d9.  The  gravitational  energy  of 
the  ring  interacting  with  mass  m  is  therefore 


dU 


Gm  d  M 


r 


—2irGpb2hm 


sin  9  d 9 
r 


Integrating  both  sides,  we  find  the  total  gravitational  energy  of  the 
shell: 


U  =  —2irGpb2hm 


sin  9  d 9 


The  integral  has  a  mixture  of  the  variables  r  and  9,  which  are  related 
by  the  law  of  cosines, 


r 2  =  b2  +  s2  —  2 bs  cos  9, 


and  to  evaluate  the  integral,  we  need  to  get  everything  in  terms  of  ei¬ 
ther  r  and  dr  or  9  and  d 9.  The  relationship  between  the  differentials 
is  found  by  differentiating  the  law  of  cosines, 


2 r  dr  =  2 bs  sin  9  d 9, 


w 


s 


j/A  spherical  shell  of  mass 
M  interacts  with  a  pointlike  mass 
m. 
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and  since  sin#d0  occurs  in  the  integral,  the  easiest  path  is  to  sub¬ 
stitute  for  it,  and  get  everything  in  terms  of  r  and  dr: 

2tt Gpbhm  fs+b  , 

U  = - /  dr 

®  J  s—b 

4:7rGpb2hm 

s 

GMm 

s 

This  was  all  under  the  assumption  that  mass  m  was  on  the  outside 
of  the  shell.  To  complete  the  proof,  we  consider  the  case  where  it’s 
inside.  In  this  case,  the  only  change  is  that  the  limits  of  integration 
are  different: 


k  /  The  gravitational  energy 
of  a  mass  m  at  a  distance  s  from 
the  center  of  a  hollow  spherical 
shell  of  mass. 


2tt Gpbhm  fb+s  , 

U  = - /  dr 

®  Jb—s 

=  —AttG  pbhm 
GMm 

=  V~ 

The  two  results  are  equal  at  the  surface  of  the  sphere,  s  =  b, 
so  the  constant-energy  part  joins  continuously  onto  the  1/s  part, 
and  the  effect  is  to  chop  off  the  steepest  part  of  the  graph  that  we 
would  have  had  if  the  whole  mass  M  had  been  concentrated  at  its 
center.  Dropping  a  mass  m  from  A  to  B  in  figure  k  releases  the 
same  amount  of  energy  as  if  mass  M  had  been  concentrated  at  its 
center,  but  there  is  no  release  of  gravitational  energy  at  all  when 
moving  between  two  interior  points  like  C  and  D.  In  other  words, 
the  internal  gravitational  field  is  zero.  Moving  from  C  to  D  brings 
mass  m  farther  away  from  the  nearby  side  of  the  shell,  but  closer 
to  the  far  side,  and  the  cancellation  between  these  two  effects  turns 
out  to  be  perfect.  Although  the  gravitational  field  has  to  be  zero 
at  the  center  due  to  symmetry,  it’s  much  more  surprising  that  it 
cancels  out  perfectly  in  the  whole  interior  region;  this  is  a  special 
mathematical  characteristic  of  a  l/r  interaction  like  gravity. 

Newton’s  apple  example  17 

Over  a  period  of  27.3  days,  the  moon  travels  the  circumference 
of  its  orbit,  so  using  data  from  Appendix  5,  we  can  calculate  its 
speed,  and  solve  the  circular  orbit  condition  to  determine  the 
strength  of  the  earth’s  gravitational  field  at  the  moon’s  distance 
from  the  earth,  g  =  v2jr  =  2.72  x  1CD3  m/s2,  which  is  3600 
times  smaller  than  the  gravitational  field  at  the  earth’s  surface. 
The  center-to-center  distance  from  the  moon  to  the  earth  is  60 
times  greater  than  the  radius  of  the  earth.  The  earth  is,  to  a  very 
good  approximation,  a  sphere  made  up  of  concentric  shells,  each 
with  uniform  density,  so  the  shell  theorem  tells  us  that  its  external 
gravitational  field  is  the  same  as  if  all  its  mass  was  concentrated 
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at  its  center.  We  already  know  that  a  gravitational  energy  that 
varies  as  -1  /r  is  equivalent  to  a  gravitational  field  proportional  to 
1  /r2,  so  it  makes  sense  that  a  distance  that  is  greater  by  a  fac¬ 
tor  of  60  corresponds  to  a  gravitational  field  that  is  60  x  60=3600 
times  weaker.  Note  that  the  calculation  didn’t  require  knowledge 
of  the  earth’s  mass  or  the  gravitational  constant,  which  Newton 
didn’t  know. 


In  1665,  shortly  after  Newton  graduated  from  Cambridge,  the 
Great  Plague  forced  the  college  to  close  for  two  years,  and  New¬ 
ton  returned  to  the  family  farm  and  worked  intensely  on  scientific 
problems.  During  this  productive  period,  he  carried  out  this  calcu¬ 
lation,  but  it  came  out  wrong,  causing  him  to  doubt  his  new  theory 
of  gravity.  The  problem  was  that  during  the  plague  years,  he  was 
unable  to  use  the  university’s  library,  so  he  had  to  use  a  figure  for 
the  radius  of  the  moon’s  orbit  that  he  had  memorized,  and  he  for¬ 
got  that  the  memorized  value  was  in  units  of  nautical  miles  rather 
than  statute  miles.  Once  he  realized  his  mistake,  he  found  that 
the  calculation  came  out  just  right,  and  became  confident  that  his 
theory  was  right  after  all.  9 

Weighing  the  earth  example  18 

>  Once  Cavendish  had  found  G  =  6.67  x  1CH1  J  •  m/kg2  (p.  101 , 
example  15),  it  became  possible  to  determine  the  mass  of  the 
earth.  By  the  shell  theorem,  the  gravitational  energy  of  a  mass 
mat  a  distance  r  from  the  center  of  the  earth  is  U  =  -GMm/r, 
where  M  is  the  mass  of  the  earth.  The  gravitational  field  is  related 
to  this  by  mg  dr  =  d  U,  or  g  =  (1  /m)dU/dr  =  GM/r2.  Solving  for 
M,  we  have 


M  =  gr2/G 

(9.8  m/s2)(6.4  x  106  m)2 
6.67  x  10-11  J  •  m/kg2 


=  6.0  x  1024 


m2  •  kg^ 


J  •  s2 


=  6.0  x  1024  kg 


Gravity  inside  the  earth  example  19 

>  The  earth  is  somewhat  more  dense  at  greater  depths,  but  as  an 
approximation  let’s  assume  it  has  a  constant  density  throughout. 
How  does  its  internal  gravitational  field  vary  with  the  distance  r 
from  the  center? 

>  Let’s  write  b  for  the  radius  of  the  earth.  The  shell  theorem  tell  us 
that  at  a  given  location  r,  we  only  need  to  consider  the  mass  M<r 

9  Some  historians  are  suspicious  that  the  story  of  the  apple  and  the  mistake 
in  conversions  may  have  been  fabricated  by  Newton  later  in  life.  The  conversion 
incident  may  have  been  a  way  of  explaining  his  long  delay  in  publishing  his  work, 
which  led  to  a  conflict  with  Leibniz  over  priority  in  the  invention  of  calculus. 
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I  /  The  actual  trajectory  of 
the  Apollo  1 1  spacecraft,  A,  and 
the  straight-line  trajectory,  B, 
assumed  in  the  example. 


that  is  deeper  than  r.  Under  the  assumption  of  constant  density, 
this  mass  is  related  to  the  total  mass  of  the  earth  by 

M<r  _  r3 
~W  ~  t>3’ 

and  by  the  same  reasoning  as  in  example  18, 

GM<r 
9 =-f2~’ 

so 

GMr 


In  other  words,  the  gravitational  field  interpolates  linearly  between 
zero  at  r  =  0  and  its  ordinary  surface  value  at  r  =  b. 

The  following  example  applies  the  numerical  techniques  of  sec¬ 
tion  2.2. 

From  the  earth  to  the  moon  example  20 

The  Apollo  11  mission  landed  the  first  humans  on  the  moon  in 
1969.  In  this  example,  we’ll  estimate  the  time  it  took  to  get  to 
the  moon,  and  compare  our  estimate  with  the  actual  time,  which 
was  73.0708  hours  from  the  engine  burn  that  took  the  ship  out  of 
earth  orbit  to  the  engine  burn  that  inserted  it  into  lunar  orbit.  Dur¬ 
ing  this  time,  the  ship  was  coasting  with  the  engines  off,  except 
for  a  small  course-correction  burn,  which  we  neglect.  More  im¬ 
portantly,  we  do  the  calculation  for  a  straight-line  trajectory  rather 
than  the  real  S-shaped  one,  so  the  result  can  only  be  expected 
to  agree  roughly  with  what  really  happened.  The  following  data 
come  from  the  original  press  kit,  which  NASA  has  scanned  and 
posted  on  the  Web: 

initial  altitude  3.363  x  105  m 
initial  velocity  1 .083  x  104  m/s 

The  endpoint  of  the  the  straight-line  trajectory  is  a  free-fall  im¬ 
pact  on  the  lunar  surface,  which  is  also  unrealistic  (luckily  for  the 
astronauts). 

The  ship’s  energy  is 

GMem  GMmrn  1  ? 

E  = - - - —  +  ^mv2, 


but  since  everything  is  proportional  to  the  mass  of  the  ship,  m,  we 
can  divide  it  out 

E  GMe  GMm  1  ? 

—  = - - - —  +  —  v  , 

m  r  rm  -  r  2 

and  the  energy  variables  in  the  program  with  names  like  e,  k,  and 
u  are  actually  energies  per  unit  mass.  The  program  is  a  straight¬ 
forward  modification  of  the  function  time3  on  page  93. 
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1 
2 

3 

4 

5 

6 

7 

8 
9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 
23 

»>  re=6.378e6  #  radius  of  earth 
»>  rm=1.74e6  #  radius  of  moon 

»>  ri=re+3 . 363e5  #  re+initial  altitude 

»>  rf=3.8e8-rm  #  earth-moon  distance  minus  rm 
»>  vi=1.083e4  #  initial  velocity 

»>  print (tmoon(vi ,ri ,rf , 1000) /3600 . )  #  convert  seconds  to  hours 
59.654047441976552 

This  is  pretty  decent  agreement,  considering  the  wildly  inaccurate 
trajectory  assumed.  It’s  interesting  to  see  how  much  the  duration 
of  the  trip  changes  if  we  increase  the  initial  velocity  by  only  ten 
percent: 

»>  vi=1.2e4 

»>  print (tmoon(vi ,ri ,rf , 1000) /3600 . ) 

18.177752636111677 

The  most  important  reason  for  using  the  lower  speed  was  that 
if  something  had  gone  wrong,  the  ship  would  have  been  able  to 
whip  around  the  moon  and  take  a  “free  return”  trajectory  back  to 
the  earth,  without  having  to  do  any  further  burns.  At  a  higher 
speed,  the  ship  would  have  had  so  much  kinetic  energy  that  in 
the  absence  of  any  further  engine  burns,  it  would  have  escaped 
from  the  earth-moon  system.  The  Apollo  13  mission  had  to  take  a 
free  return  trajectory  after  an  explosion  crippled  the  spacecraft. 


import  math 

def  tmoon(vi ,ri ,rf ,n)  : 


#  gravitational  constant 

#  mass  of  earth 

#  mass  of  moon 

#  earth-moon  distance 


bigg=6 . 67e-ll 
me=5 . 97e24 
mm=7 . 35e22 
rm=3 . 84e8 
r=ri 
v=vi 

dr  =  (rf-ri)/n 

e=-bigg*me/ri-bigg*mm/ (rm-ri) + . 5*vi**2 
t=0 

for  i  in  range (n): 

u_old  =  -bigg*me/r-bigg*mm/ (rm-r) 
k_old  =  e  -  u_old 
v_old  =  math.sqrt(2.*k_old) 
r  =  r+dr 

u  =  -bigg*me/r-bigg*mm/ (rm-r) 
k  =  e  -  u 

v  =  math. sqrt (2 . *k) 
v_avg  =  .5*(v_old+v) 
dt=dr/v_avg 
t=t+dt 
return  t 
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2.3.6  *  Evidence  for  repulsive  gravity 

Until  recently,  physicists  thought  they  understood  gravity  fairly 
well.  Einstein  had  modified  Newton’s  theory,  but  certain  character- 
istrics  of  gravitational  forces  were  firmly  established.  For  one  thing, 
they  were  always  attractive.  If  gravity  always  attracts,  then  it  is 
logical  to  ask  why  the  universe  doesn’t  collapse.  Newton  had  an¬ 
swered  this  question  by  saying  that  if  the  universe  was  infinite  in 
all  directions,  then  it  would  have  no  geometric  center  toward  which 
it  would  collapse;  the  forces  on  any  particular  star  or  planet  ex¬ 
erted  by  distant  parts  of  the  universe  would  tend  to  cancel  out  by 
symmetry.  More  careful  calculations,  however,  show  that  Newton’s 
universe  would  have  a  tendency  to  collapse  on  smaller  scales:  any 
part  of  the  universe  that  happened  to  be  slightly  more  dense  than 
average  would  contract  further,  and  this  contraction  would  result 
in  stronger  gravitational  forces,  which  would  cause  even  more  rapid 
contraction,  and  so  on. 

When  Einstein  overhauled  gravity,  the  same  problem  reared  its 
ugly  head.  Like  Newton,  Einstein  was  predisposed  to  believe  in  a 
universe  that  was  static,  so  he  added  a  special  repulsive  term  to  his 
equations,  intended  to  prevent  a  collapse.  This  term  was  not  associ¬ 
ated  with  any  interaction  of  mass  with  mass,  but  represented  merely 
an  overall  tendency  for  space  itself  to  expand  unless  restrained  by 
the  matter  that  inhabited  it.  It  turns  out  that  Einstein’s  solution, 
like  Newton’s,  is  unstable.  Furthermore,  it  was  soon  discovered 
observationally  that  the  universe  was  expanding,  and  this  was  in¬ 
terpreted  by  creating  the  Big  Bang  model,  in  which  the  universe’s 
current  expansion  is  the  aftermath  of  a  fantastically  hot  explosion.10 
An  expanding  universe,  unlike  a  static  one,  was  capable  of  being  ex¬ 
plained  with  Einstein’s  equations,  without  any  repulsion  term.  The 
universe’s  expansion  would  simply  slow  down  over  time  due  to  the 
attractive  gravitational  forces.  After  these  developments,  Einstein 
said  woefully  that  adding  the  repulsive  term,  known  as  the  cosmo¬ 
logical  constant,  had  been  the  greatest  blunder  of  his  life. 


m  /  The  WMAP  probe’s  map 
of  the  cosmic  microwave  back¬ 
ground  is  like  a  “baby  picture”  of 
the  universe. 


1(,Subsection  6.1.5  presents  some  evidence  for  the  Big  Bang  theory. 
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This  was  the  state  of  things  until  1999,  when  evidence  began  to 
turn  up  that  the  universe’s  expansion  has  been  speeding  up  rather 
than  slowing  down!  The  first  evidence  came  from  using  a  telescope 
as  a  sort  of  time  machine:  light  from  a  distant  galaxy  may  have 
taken  billions  of  years  to  reach  us,  so  we  are  seeing  it  as  it  was  far 
in  the  past.  Looking  back  in  time,  astronomers  saw  the  universe 
expanding  at  speeds  that  were  lower,  rather  than  higher.  At  first 
they  were  mortified,  since  this  was  exactly  the  opposite  of  what 
had  been  expected.  The  statistical  quality  of  the  data  was  also  not 
good  enough  to  constitute  ironclad  proof,  and  there  were  worries 
about  systematic  errors.  The  case  for  an  accelerating  expansion  has 
however  been  supported  by  high-precision  mapping  of  the  dim,  sky¬ 
wide  afterglow  of  the  Big  Bang,  known  as  the  cosmic  microwave 
background. 

So  now  Einstein’s  “greatest  blunder”  has  been  resurrected.  Since 
we  don’t  actually  know  whether  or  not  this  self-repulsion  of  space 
has  a  constant  strength,  the  term  “cosmological  constant ”  has  lost 
currency.  Nowadays  physicists  usually  refer  to  the  phenomenon  as 
“dark  energy.”  Picking  an  impressive-sounding  name  for  it  should 
not  obscure  the  fact  that  we  know  absolutely  nothing  about  the 
nature  of  the  effect  or  why  it  exists. 

2.4  Atomic  Phenomena 

Variety  is  the  spice  of  life,  not  of  science.  So  far  this  chapter  has 
focused  on  heat  energy,  kinetic  energy,  and  gravitational  energy, 
but  it  might  seem  that  in  addition  to  these  there  is  a  bewildering 
array  of  other  forms  of  energy.  Gasoline,  chocolate  bars,  batteries, 
melting  water  —  in  each  case  there  seems  to  be  a  whole  new  type 
of  energy.  The  physicist’s  psyche  rebels  against  the  prospect  of  a 
long  laundry  list  of  types  of  energy,  each  of  which  would  require 
its  own  equations,  concepts,  notation,  and  terminology.  The  point 
at  which  we’ve  arrived  in  the  study  of  energy  is  analogous  to  the 
period  in  the  1960’s  when  a  half  a  dozen  new  subatomic  particles 
were  being  discovered  every  year  in  particle  accelerators.  It  was  an 
embarrassment.  Physicists  began  to  speak  of  the  “particle  zoo,” 
and  it  seemed  that  the  subatomic  world  was  distressingly  complex. 
The  particle  zoo  was  simplified  by  the  realization  that  most  of  the 
new  particles  being  whipped  up  were  simply  clusters  of  a  previously 
unsuspected  set  of  fundamental  particles  (which  were  whimsically 
dubbed  quarks,  a  made-up  word  from  a  line  of  poetry  by  James 
Joyce,  “Three  quarks  for  Master  Mark.”)  The  energy  zoo  can  also 
be  simplified,  and  it’s  the  purpose  of  this  section  to  demonstrate  the 
hidden  similarities  between  forms  of  energy  as  seemingly  different 
as  heat  and  motion. 
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a  /  A  vivid  demonstration  that 
heat  is  a  form  of  motion.  A  small 
amount  of  boiling  water  is  poured 
into  the  empty  can,  which  rapidly 
fills  up  with  hot  steam.  The  can 
is  then  sealed  tightly,  and  soon 
crumples. 


b  /  Random  motion  of  atoms 
in  a  gas,  a  liquid,  and  a  solid. 


2.4.1  Heat  is  kinetic  energy. 

What  is  heat  really?  Is  it  an  invisible  fluid  that  your  bare  feet 
soak  up  from  a  hot  sidewalk?  Can  one  ever  remove  all  the  heat  from 
an  object?  Is  there  a  maximum  to  the  temperature  scale? 

The  theory  of  heat  as  a  fluid  seemed  to  explain  why  colder  ob¬ 
jects  absorbed  heat  from  hotter  ones,  but  once  it  became  clear  that 
heat  was  a  form  of  energy,  it  began  to  seem  unlikely  that  a  material 
substance  could  transform  itself  into  and  out  of  all  those  other  forms 
of  energy  like  motion  or  light.  For  instance,  a  compost  pile  gets  hot, 
and  we  describe  this  as  a  case  where,  through  the  action  of  bacteria, 
chemical  energy  stored  in  the  plant  cuttings  is  transformed  into  heat 
energy.  The  heating  occurs  even  if  there  is  no  nearby  warmer  object 
that  could  have  been  leaking  “heat  fluid”  into  the  pile. 

An  alternative  interpretation  of  heat  was  suggested  by  the  theory 
that  matter  is  made  of  atoms.  Since  gases  are  thousands  of  times  less 
dense  than  solids  or  liquids,  the  atoms  (or  clusters  of  atoms  called 
molecules)  in  a  gas  must  be  far  apart.  In  that  case,  what  is  keeping 
all  the  air  molecules  from  settling  into  a  thin  film  on  the  floor  of  the 
room  in  which  you  are  reading  this  book?  The  simplest  explanation 
is  that  they  are  moving  very  rapidly,  continually  ricocheting  off  of 
the  floor,  walls,  and  ceiling.  Though  bizarre,  the  cloud-of-bullets 
image  of  a  gas  did  give  a  natural  explanation  for  the  surprising 
ability  of  something  as  tenuous  as  a  gas  to  exert  huge  forces. 

The  experiment  shown  in  figure  a,  for  instance,  can  be  explained 
as  follows.  The  high  temperature  of  the  steam  is  interpreted  as  a 
high  average  speed  of  random  motions  of  its  molecules.  Before  the 
lid  was  put  on  the  can,  the  rapidly  moving  steam  molecules  pushed 
their  way  out  of  the  can,  forcing  the  slower  air  molecules  out  of  the 
way.  As  the  steam  inside  the  can  thinned  out,  a  stable  situation 
was  soon  achieved,  in  which  the  force  from  the  less  dense  steam 
molecules  moving  at  high  speed  balanced  against  the  force  from  the 
more  dense  but  slower  air  molecules  outside.  The  cap  was  put  on, 
and  after  a  while  the  steam  inside  the  can  began  to  cool  off.  The 
force  from  the  cooler,  thin  steam  no  longer  matched  the  force  from 
the  cool,  dense  air  outside,  and  the  imbalance  of  forces  crushed  the 
can. 

This  type  of  observation  leads  naturally  to  the  conclusion  that 
hotter  matter  differs  from  colder  in  that  its  atoms’  random  motion  is 
more  rapid.  In  a  liquid,  the  motion  could  be  visualized  as  people  in  a 
milling  crowd  shoving  past  each  other  more  quickly.  In  a  solid,  where 
the  atoms  are  packed  together,  the  motion  is  a  random  vibration  of 
each  atom  as  it  knocks  against  its  neighbors. 

We  thus  achieve  a  great  simplification  in  the  theory  of  heat.  Heat 
is  simply  a  form  of  kinetic  energy,  the  total  kinetic  energy  of  random 
motion  of  all  the  atoms  in  an  object.  With  this  new  understanding, 
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it  becomes  possible  to  answer  at  one  stroke  the  questions  posed 
at  the  beginning  of  the  section.  Yes,  it  is  at  least  theoretically 
possible  to  remove  all  the  heat  from  an  object.  The  coldest  possible 
temperature,  known  as  absolute  zero,  is  that  at  which  all  the  atoms 
have  zero  velocity,  so  that  their  kinetic  energies,  K  =  (1/2 )mv2,  are 
all  zero.  No,  there  is  no  maximum  amount  of  heat  that  a  certain 
quantity  of  matter  can  have,  and  no  maximum  to  the  temperature 
scale,  since  arbitrarily  large  values  of  v  can  create  arbitrarily  large 
amounts  of  kinetic  energy  per  atom. 

The  kinetic  theory  of  heat  also  provides  a  simple  explanation  of 
the  true  nature  of  temperature.  Temperature  is  a  measure  of  the 
amount  of  energy  per  molecule,  whereas  heat  is  the  total  amount  of 
energy  possessed  by  all  the  molecules  in  an  object. 

There  is  an  entire  branch  of  physics,  called  thermodynamics, 
that  deals  with  heat  and  temperature  and  forms  the  basis  for  tech¬ 
nologies  such  as  refrigeration.  Thermodynamics  is  discussed  in  more 
detail  in  chapter  5,  and  I’ve  provided  here  only  a  brief  overview  of 
the  thermodynamic  concepts  that  relate  directly  to  energy. 

2.4.2  All  energy  comes  from  particles  moving  or  interacting. 

If  I  stretch  the  spring  in  figure  c  and  then  release  it,  it  snaps  taut 
again.  The  creation  of  some  kinetic  energy  shows  that  there  must 
have  been  some  other  form  of  energy  that  was  destroyed.  What  was 
it? 

We  could  just  invent  a  new  type  of  energy  called  “spring  energy,” 
study  its  behavior,  and  call  it  quits,  but  that  would  be  ugly.  Are 
we  going  to  have  to  invent  a  new  forms  of  energy  like  this,  over 
and  over?  No:  the  title  of  this  book  doesn’t  lie,  and  physics  really 
is  fundamentally  simple.  As  shown  in  figure  d,  when  we  bend  or 
stretch  an  object,  we’re  really  changing  the  distances  between  the 
atoms,  resulting  in  a  change  in  electrical  energy.  Electrical  energy 
isn’t  really  our  topic  right  now  —  that’s  what  most  of  the  second 
half  of  this  book  is  about  —  but  conceptually  it’s  very  similar  to 
gravitational  energy.  Like  gravitational  energy,  it  depends  on  1/r, 
although  there  are  some  interesting  new  phenomena,  such  as  the 
existence  of  both  attraction  and  repulsion,  which  doesn’t  occur  with 
gravity  because  gravitational  mass  can’t  be  negative.  The  real  point 
is  that  all  the  apparently  dissimilar  forms  of  energy  in  figure  d  turn 
out  to  be  due  to  electrical  interactions  among  atoms.  Even  if  we 
wish  to  include  nuclear  reactions  (figure  e)  in  the  picture,  there  still 
turn  out  to  be  only  four  fundamental  types  of  energy: 

kinetic  energy  (including  heat) 
gravitational  energy 
electrical  and  magnetic  energy 
nuclear  energy 

Astute  students  have  often  asked  me  how  light  fits  into  this  pic- 


c  /  The  spring’s  energy  is  re¬ 
ally  due  to  electrical  interactions 
among  atoms. 


d  /  All  these  energy  transfor¬ 
mations  turn  out  at  the  atomic 
level  to  be  due  to  changes  in  the 
distances  between  atoms  that 
interact  electrically. 
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e/This  figure  looks  similar 
to  the  previous  ones,  but  the 
scale  is  a  million  times  smaller. 
The  little  balls  are  the  neutrons 
and  protons  that  make  up  the 
tiny  nucleus  at  the  center  of  a 
uranium  atom.  When  the  nucleus 
splits  (fissions),  the  source  of  the 
kinetic  energy  is  partly  electrical 
and  partly  nuclear. 


ture.  This  is  a  very  good  question,  and  in  fact  it  could  be  argued 
that  it  is  the  basic  question  that  led  to  Einstein’s  theory  of  relativ¬ 
ity  as  well  as  the  modern  quantum  picture  of  nature.  Since  these 
are  topics  for  the  second  half  of  the  book,  we’ll  have  to  be  content 
with  half  an  answer  at  this  point.  For  now,  we  may  think  of  light 
energy  as  a  form  of  kinetic  energy,  but  one  calculated  not  according 
to  (1/2 )mv2  but  by  some  other  equation.  (We  know  that  (1/2 )mv2 
would  not  make  sense,  because  light  has  no  mass,  and  furthermore, 
high-energy  beams  of  light  do  not  differ  in  speed  from  low-energy 
ones.) 

Temperature  during  boiling  example  21 

>  If  you  stick  a  thermometer  in  a  pan  of  water,  and  watch  the 
temperature  as  you  bring  the  water  to  a  boil,  you’ll  notice  an  in¬ 
teresting  fact.  The  temperature  goes  up  until  the  boiling  point  is 
reached,  but  then  stays  at  1 00°C  during  the  whole  time  the  water 
is  being  boiled  off.  The  temperature  of  the  steam  is  also  100°C. 
Why  does  the  temperature  “stick”  like  this?  What’s  happening  to 
all  the  energy  that  the  stove’s  burner  is  putting  into  the  pan? 

>  As  shown  in  figure  d,  boiling  requires  an  increase  in  electrical 
energy,  because  the  atoms  coming  out  as  gas  are  moving  away 
from  the  other  atoms,  which  attract  them  electrically.  It  is  only  this 
electrical  energy  that  is  increasing,  not  the  atoms’  kinetic  energy, 
which  is  what  the  thermometer  can  measure. 

Diffusion  example  22 

t>  A  drop  of  food  coloring  in  a  cup  of  water  will  gradually  spread 
out,  even  if  you  don’t  do  any  mixing  with  a  spoon.  This  is  called 
diffusion.  Why  would  this  happen,  and  what  effect  would  temper¬ 
ature  have?  What  would  happen  with  solids  or  gases? 

>  Figure  b  shows  that  the  atoms  in  a  liquid  mingle  because  of 
their  random  thermal  motion.  Diffusion  is  slow  (typically  on  the 
order  of  a  centimeter  a  minute),  despite  the  high  speeds  of  the 
atoms  (typically  hundreds  of  miles  per  hour).  This  is  due  to  the 
randomness  of  the  motion:  a  particular  atom  will  take  a  long  time 
to  travel  any  significant  distance,  because  it  doesn’t  travel  in  a 
straight  line. 

Based  on  this  picture,  we  expect  that  the  speed  of  diffusion  should 
increase  as  a  function  of  temperature,  and  experiments  show  that 
this  is  true. 

Diffusion  also  occurs  in  gases,  which  is  why  you  can  smell  things 
even  when  the  air  is  still.  The  speeds  are  much  faster,  because 
the  typical  distance  between  collisions  is  much  longer  than  in  a 
liquid. 

We  can  see  from  figure  b  that  diffusion  won’t  occur  in  solids,  be¬ 
cause  each  atom  vibrates  around  an  equilibrium  position. 
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Discussion  Questions 


A  I’m  not  making  this  up.  XS  Energy  Drink  has  ads  that  read  like  this: 
All  the  “Energy” ...  Without  the  Sugar!  Only  8  Calories!”  Comment  on 
this. 


2.5  Oscillations 

Let’s  revisit  the  example  of  the  stretched  spring  from  the  previ¬ 
ous  section.  We  know  that  its  energy  is  a  form  of  electrical  energy 
of  interacting  atoms,  which  is  nice  conceptually  but  doesn’t  help 
us  to  solve  problems,  since  we  don’t  know  how  the  energy,  U,  de¬ 
pends  on  the  length  of  the  spring.  All  we  know  is  that  there’s  an 
equilibrium  (figure  a/1),  which  is  a  local  minimum  of  the  function 
JJ .  An  extremely  important  problem  which  arises  in  this  connec¬ 
tion  is  how  to  calculate  oscillatory  motion  around  an  equilibrium, 
as  in  a/4-13.  Even  if  we  did  special  experiments  to  find  out  how  the 
spring’s  energy  worked,  it  might  seem  like  we’d  have  to  go  through 
just  as  much  work  to  deal  with  any  other  kind  of  oscillation,  such 
as  a  sapling  swinging  back  and  forth  in  the  breeze. 


5 

6 

7 

8 


10 

11 


12 

13 


Surprisingly,  it’s  possible  to  analyze  this  type  of  oscillation  in  a 
very  general  and  elegant  manner,  as  long  as  the  analysis  is  limited 
to  small  oscillations.  We’ll  talk  about  the  mass  on  the  spring  for 
concreteness,  but  there  will  be  nothing  in  the  discussion  at  all  that 
is  restricted  to  that  particular  physical  system.  First,  let’s  choose 
a  coordinate  system  in  which  x  =  0  corresponds  to  the  position  of 
the  mass  where  the  spring  is  in  equilibrium,  and  since  interaction 
energies  like  U  are  only  well  defined  up  to  an  additive  constant,  we’ll 
simply  define  it  to  be  zero  at  equilibrium: 


a /The  spring  has  a  minimum- 
energy  length,  1,  and  energy  is 
required  in  order  to  compress  or 
stretch  it,  2  and  3.  A  mass  at¬ 
tached  to  the  spring  will  oscillate 
around  the  equilibrium,  4-13. 


17(0)  =  0 


Since  x  =  0  is  an  equilibrium,  U(x)  must  have  a  local  minimum 
there,  and  a  differentiable  function  (which  we  assume  U  is)  has  a 
zero  derivative  at  a  local  minimum: 


dU 

dx 


(0)  =  0 


There  are  still  infinitely  many  functions  that  could  satisfy  these 
criteria,  including  the  three  shown  in  figure  b,  which  are  x2/2, 
x2/2(l  +  x2),  and  (e3x  +  e~3x  —  2) / 18.  Note,  however,  how  all  three 
functions  are  virtually  identical  right  near  the  minimum.  That’s 
because  they  all  have  the  same  curvature.  More  specifically,  each 
function  has  its  second  derivative  equal  to  1  at  x  =  0,  and  the  sec¬ 
ond  derivative  is  a  measure  of  curvature.  We  write  k  for  the  second 
derivative  of  the  energy  at  an  equilibrium  point, 


d2U 

dx2 


(0)  =  k. 
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b/ Three  functions  with  the 
same  curvature  at  x=0. 


■M 


c/The  amplitude  would  usu¬ 
ally  be  defined  as  the  distance 
from  equilibrium  to  one  extreme 
of  the  motion,  i.e. ,  half  the  total 
travel. 


Physically,  k  is  a  measure  of  stiffness.  For  example,  the  heavy-duty 
springs  in  a  car’s  shock  absorbers  would  have  a  high  value  of  k.  It 
is  often  referred  to  as  the  spring  constant,  but  we’re  only  using  a 
spring  as  an  example  here.  As  shown  in  figure  b,  any  two  functions 
that  have  f7(0)  =  0,  dll /  d:r;  =  0,  and  d?U /  dx 2  =  k ,  with  the  same 
value  of  k.  are  virtually  indistinguishable  for  small  values  of  x,  so  if 
we  want  to  analyze  small  oscillations,  it  doesn’t  even  matter  which 
function  we  assume.  For  simplicity,  we’ll  just  use  U(x)  =  (1/2 )kx2 
from  now  on. 

Now  we’re  ready  to  analyze  the  mass-on-a-spring  system,  while 
keeping  in  mind  that  it’s  really  only  a  representative  example  of 
a  whole  class  of  similar  oscillating  systems.  We  expect  that  the 
motion  is  going  to  repeat  itself  over  and  over  again,  and  since  we’re 
not  going  to  include  frictional  heating  in  our  model,  that  repetition 
should  go  on  forever  without  dying  out.  The  most  interesting  thing 
to  know  about  the  motion  would  be  the  period,  T,  which  is  the 
amount  of  time  required  for  one  complete  cycle  of  the  motion.  We 
might  expect  that  the  period  would  depend  on  the  spring  constant, 
k ,  the  mass,  m,  and  and  the  amplitude,  A,  defined  in  figure  c.11 

In  examples  like  the  brachistochrone  and  the  Apollo  11  mission, 
it  was  generally  necessary  to  use  numerical  techniques  to  determine 
the  amount  of  time  required  for  a  certain  motion.  Once  again,  let’s 
dust  off  the  time3  function  from  page  93  and  modify  it  for  our 
purposes.  For  flexibility,  we’ll  define  the  function  U (. x )  as  a  separate 
Python  function.  We  really  want  to  calculate  the  time  required  for 
the  mass  to  come  back  to  its  starting  point,  but  that  would  be 
awkward  to  set  up,  since  our  function  works  by  dividing  up  the 
distance  to  be  traveled  into  tiny  segments.  By  symmetry,  the  time 
required  to  go  from  one  end  to  the  other  equals  the  time  required 
to  come  back  to  the  start,  so  we’ll  just  calculate  the  time  for  half  a 
cycle  and  then  double  it  when  we  return  the  result  at  the  end  of  the 
function.  The  test  at  lines  16-19  is  necessary  because  otherwise  at 
the  very  end  of  the  motion  we  can  end  up  trying  to  take  the  square 
root  of  a  negative  number  due  to  rounding  errors. 


nMany  kinds  of  oscillations  are  possible,  so  there  is  no  standard  definition  of 
the  amplitude.  For  a  pendulum,  the  natural  definition  would  be  in  terms  of  an 
angle.  For  a  radio  transmitter,  we’d  use  some  kind  of  electrical  units. 
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»>  print  (osc  (1 1 1 100000) ) 
warning,  K=  -1 . 43707268307e-12  <0 
6.2831854132667919 

The  first  thing  to  notice  is  that  with  this  particular  set  of  inputs 
(m=l  kg,  k  =  1  J/nr2,  and  A  =  1  m),  the  program  has  done  an  ex¬ 
cellent  job  of  computing  27t  =  6.2831853  ....  This  is  Mother  Nature 
giving  us  a  strong  hint  that  the  problem  has  an  algebraic  solution, 
not  just  a  numerical  one.  The  next  interesting  thing  happens  when 
we  change  the  amplitude  from  1  m  to  2  m: 

»>  print  (osc  (1 .,  1 .  ,2 .,  100000) ) 
warning,  K=  -5 . 7482907323e-12  <0 
6.2831854132667919 


import  math 
def  u(k,x) : 

return  ,5*k*x**2 

def  osc (m,k, a,n) : 
x=a 
v=0 

dx  =  -2.*a/n 
t=0 

e  =  u(k,x)+ . 5*m*v**2 
for  i  in  range (n): 
x_old  =  x 
v_old  =  v 
x  =  x+dx 

kinetic  =  e-u(k,x) 
if  kinetic<0.  : 

v=0 . 

print  "warning,  K=" , kinetic, "<0" 
else  : 

v  =  -math. sqrt (2 . *kinetic/m) 
v_avg  =  (v+v_old)/2. 
dt=dx/v_avg 
t=t+dt 
return  2 . *t 


Even  though  the  mass  had  to  travel  double  the  distance  in  each 
direction,  the  period  is  the  same  to  within  the  numerical  accuracy 
of  the  calculation! 

With  these  hints,  it  seems  like  we  should  start  looking  for  an 
algebraic  solution.  For  guidance,  here’s  a  graph  of  x  as  a  function 
of  t,  as  calculated  by  the  osc  function  with  n=10. 
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This  looks  like  a  cosine  function,  so  let’s  see  if  a  x  =  Hcos  (urt+d) 
is  a  solution  to  the  conservation  of  energy  equation  —  it’s  not  un¬ 
common  to  try  to  “reverse-engineer”  the  cryptic  results  of  a  numer¬ 
ical  calculation  like  this.  The  symbol  i j  =  2tt/T  (Greek  omega), 
called  angular  frequency,  is  a  standard  symbol  for  the  number  of 
radians  per  second  of  oscillation.  Except  for  the  factor  of  2ir ,  it  is 
identical  to  the  ordinary  frequency  /  =  1/T,  which  has  units  of  s-1 
or  Hz  (Hertz).  The  phase  angle  5  is  to  allow  for  the  possibility  that 
t  =  0  doesn’t  coincide  with  the  beginning  of  the  motion.  The  energy 
is 


d  /  Example  23.  The  rod  piv¬ 
ots  on  the  hinge  at  the  bottom. 


1  9  1  9 

=  -m  [—Hu;  sin  (t ot  +  <5)]  +  -k  [Hcos  (cut  +  (5)] 
=  -A2  [mu;2  sin  2 (cut  +  6)  +  k  cos  2 (cot  +  5)] 


According  to  conservation  of  energy,  this  has  to  be  a  constant.  Using 
the  identity  sin2  +  cos2  =  1 ,  we  can  see  that  it  will  be  a  constant  if 
we  have  mu2  =  k,  or  u>  =  \J k/m ,  i.e.,  T  =  2tt \frnjk.  Note  that  the 
period  is  independent  of  amplitude. 

A  spring  and  a  lever  example  23 

>  What  is  the  period  of  small  oscillations  of  the  system  shown  in 
the  figure?  Neglect  the  mass  of  the  lever  and  the  spring.  Assume 
that  the  spring  is  so  stiff  that  gravity  is  not  an  important  effect. 
The  spring  is  relaxed  when  the  lever  is  vertical. 

>  This  is  a  little  tricky,  because  the  spring  constant  k,  although 
it  is  relevant,  is  not  the  k  we  should  be  putting  into  the  equation 
T  =  Zn^fmjk.  The  k  that  goes  in  there  has  to  be  the  second 
derivative  of  U  with  respect  to  the  position,  x,  of  the  mass  that’s 
moving.  The  energy  U  stored  in  the  spring  depends  on  how  far 
the  tip  of  the  lever  is  from  the  center.  This  distance  equals  ( L/b)x , 
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so  the  energy  in  the  spring  is 


kL2  2 
2  b2 


and  the  k  we  have  to  put  in  T  =  2n^m/k  is 

cP  U  _  kL2 
dx2  "  W 

The  result  is 


T  =  2n 


mb 2 


kL 2 
2nb  [m 


The  leverage  of  the  lever  makes  it  as  if  the  spring  was  stronger, 
and  decreases  the  period  of  the  oscillations  by  a  factor  of  b/L. 

Water  in  a  U-shaped  tube  example  24 

>  What  is  the  period  of  oscillation  of  the  water  in  figure  e? 

>  In  example  13  on  p.  89,  we  found  U(y)  =  p gAy2,  so  the  “spring 
constant,”  which  really  isn’t  a  spring  constant  here  at  all,  is 

d2U 
d  y2 

=  2pgA. 

This  is  an  interesting  example,  because  k  can  be  calculated  with¬ 
out  any  approximations,  but  the  kinetic  energy  requires  an  ap¬ 
proximation,  because  we  don’t  know  the  details  of  the  pattern 
of  flow  of  the  water.  It  could  be  very  complicated.  There  will 
be  a  tendency  for  the  water  near  the  walls  to  flow  more  slowly 
due  to  friction,  and  there  may  also  be  swirling,  turbulent  motion. 
However,  if  we  make  the  approximation  that  all  the  water  moves 
with  the  same  velocity  as  the  surface,  dy/df,  then  the  mass-on- 
a-spring  analysis  applies.  Letting  L  be  the  total  length  of  the  filled 
part  of  the  tube,  the  mass  is  p LA,  and  we  have 

T  =  2n  yfmjk 


e  /  Water  in  a  U-shaped  tube. 


This  chapter  is  summarized  on  page  990.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 


Section  2.5 
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The  symbols  \/,  ^ ,  etc.  are  explained  on  page  125. 

1  Experiments  show  that  the  power  consumed  by  a  boat’s  engine 
is  approximately  proportional  to  the  third  power  of  its  speed.  (We 
assume  that  it  is  moving  at  constant  speed.) 

(a)  When  a  boat  is  cruising  at  constant  speed,  what  type  of  energy 
transformation  do  you  think  is  being  performed? 

(b)  If  you  upgrade  to  a  motor  with  double  the  power,  by  what  factor 
is  your  boat’s  maximum  cruising  speed  increased? 

>  Solution,  p.  972 

2  Object  A  has  a  kinetic  energy  of  13.4  J.  Object  B  has  a  mass 
that  is  greater  by  a  factor  of  3.77,  but  is  moving  more  slowly  by  a 
factor  of  2.34.  What  is  object  B’s  kinetic  energy? 

>  Solution,  p.  972 

3  My  1.25  kW  microwave  oven  takes  126  seconds  to  bring  250 

g  of  water  from  room  temperature  to  a  boil.  What  percentage  of 
the  power  is  being  wasted?  Where  might  the  rest  of  the  energy  be 
going?  >  Solution,  p.  972  ■ 

4  The  multiflash  photograph  shows  a  collision  between  two  pool 
balls.  The  ball  that  was  initially  at  rest  shows  up  as  a  dark  image 
in  its  initial  position,  because  its  image  was  exposed  several  times 
before  it  was  struck  and  began  moving.  By  making  measurements 
on  the  figure,  determine  numerically  whether  or  not  energy  appears 
to  have  been  conserved  in  the  collision.  What  systematic  effects 
would  limit  the  accuracy  of  your  test?  [From  an  example  in  PSSC 
Physics.] 


1 


Problem  4. 
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5  A  grasshopper  with  a  mass  of  110  mg  falls  from  rest  from  a 
height  of  310  cm.  On  the  way  down,  it  dissipates  1.1  mJ  of  heat  due 
to  air  resistance.  At  what  speed,  in  m/s,  does  it  hit  the  ground? 

>  Solution,  p.  973  - 

6  A  ball  rolls  up  a  ramp,  turns  around,  and  comes  back  down. 
When  does  it  have  the  greatest  gravitational  energy?  The  greatest 
kinetic  energy?  [Based  on  a  problem  by  Serway  and  Faughn.] 

7  (a)  You  release  a  magnet  on  a  tabletop  near  a  big  piece  of  iron, 
and  the  magnet  leaps  across  the  table  to  the  iron.  Does  the  magnetic 
energy  increase,  or  decrease?  Explain,  (b)  Suppose  instead  that  you 
have  two  repelling  magnets.  You  give  them  an  initial  push  towards 
each  other,  so  they  decelerate  while  approaching  each  other.  Does 
the  magnetic  energy  increase,  or  decrease?  Explain. 

8  Estimate  the  kinetic  energy  of  an  Olympic  sprinter. 

9  You  are  driving  your  car,  and  you  hit  a  brick  wall  head  on, 

at  full  speed.  The  car  has  a  mass  of  1500  kg.  The  kinetic  energy 
released  is  a  measure  of  how  much  destruction  will  be  done  to  the  car 
and  to  your  body.  Calculate  the  energy  released  if  you  are  traveling 
at  (a)  40  mi/hr,  and  again  (b)  if  you’re  going  80  mi/hr.  What  is 
counterintuitive  about  this,  and  what  implication  does  this  have  for 
driving  at  high  speeds?  v 

10  A  closed  system  can  be  a  bad  thing  -  for  an  astronaut 

sealed  inside  a  space  suit,  getting  rid  of  body  heat  can  be  difficult. 
Suppose  a  60-kg  astronaut  is  performing  vigorous  physical  activity, 
expending  200  W  of  power.  If  none  of  the  heat  can  escape  from  her 
space  suit,  how  long  will  it  take  before  her  body  temperature  rises 
by  6°C  (11°F),  an  amount  sufficient  to  kill  her?  Assume  that  the 
amount  of  heat  required  to  raise  her  body  temperature  by  1°C  is 
the  same  as  it  would  be  for  an  equal  mass  of  water.  Express  your 
answer  in  units  of  minutes.  V  ^ 

11  The  following  table  gives  the  amount  of  energy  required  in 
order  to  heat,  melt,  or  boil  a  gram  of  water. 


heat 

1 

g 

of  ice  by 

1°C 

2.05  J 

melt 

1 

g 

of  ice 

333  J 

heat 

1 

g 

of  water 

by  1°C 

4.19  J 

boil 

1 

g  ' 

of  water 

2500  J 

heat 

1 

g 

of  steam 

by  1°C 

2.01  J 

(a)  How  much  energy  is  required  in  order  to  convert  1.00  g  of  ice  at 

-20  °C  into  steam  at  137  °C?  V 

(b)  What  is  the  minimum  amount  of  hot  water  that  could  melt  1.00 

g  of  ice?  v  ■ 

12  Anya  climbs  to  the  top  of  a  tree,  while  Ivan  climbs  half-way 
to  the  top.  They  both  drop  pennies  to  the  ground.  Compare  the 
kinetic  energies  and  velocities  of  the  pennies  on  impact,  using  ratios. 


Problem  16. 


m2 


m1 


Problem  17. 


Problem  18. 


Problem  19. 


13  Anya  and  Ivan  lean  over  a  balcony  side  by  side.  Anya  throws 
a  penny  downward  with  an  initial  speed  of  5  m/s.  Ivan  throws 
a  penny  upward  with  the  same  speed.  Both  pennies  end  up  on 
the  ground  below.  Compare  their  kinetic  energies  and  velocities  on 
impact. 

14  (a)  A  circular  hoop  of  mass  m  and  radius  r  spins  like  a  wheel 
while  its  center  remains  at  rest.  Let  cj  (Greek  letter  omega)  be  the 
number  of  radians  it  covers  per  unit  time,  i.e. ,  u  =  2n/T,  where 
the  period,  T,  is  the  time  for  one  revolution.  Show  that  its  kinetic 
energy  equals  (1/2 )muj2r2. 

(b)  Show  that  the  answer  to  part  a  has  the  right  units.  (Note 
that  radians  aren’t  really  units,  since  the  definition  of  a  radian  is  a 
unitless  ratio  of  two  lengths.) 

(c)  If  such  a  hoop  rolls  with  its  center  moving  at  velocity  v,  its 
kinetic  energy  equals  (1/2 )mv2,  plus  the  amount  of  kinetic  energy 
found  in  part  a.  Show  that  a  hoop  rolls  down  an  inclined  plane  with 
half  the  acceleration  that  a  frictionless  sliding  block  would  have. 

u 

15  On  page  83,  I  used  the  chain  rule  to  prove  that  the  acceler¬ 
ation  of  a  free- falling  object  is  given  by  a  =  —g.  In  this  problem, 
you’ll  use  a  different  technique  to  prove  the  same  thing.  Assume 
that  the  acceleration  is  a  constant,  a,  and  then  integrate  to  find 
v  and  y,  including  appropriate  constants  of  integration.  Plug  your 
expressions  for  v  and  y  into  the  equation  for  the  total  energy,  and 
show  that  a  =  —g  is  the  only  value  that  results  in  constant  energy. 

I 

16  The  figure  shows  two  unequal  masses,  m\  and  m2,  connected 
by  a  string  running  over  a  pulley.  Find  the  acceleration. 

>  Hint,  p.  958  V  - 

17  What  ratio  of  masses  will  balance  the  pulley  system  shown 

in  the  figure?  >  Hint,  p.  958  ■ 

18  (a)  For  the  apparatus  shown  in  the  figure,  find  the  equilib¬ 
rium  angle  9  in  terms  of  the  two  masses.  V 

(b)  Interpret  your  result  in  the  case  of  M  3>  m  (M  much  greater 
than  m).  Does  it  make  sense  physically? 

(c)  For  what  combinations  of  masses  would  your  result  give  non¬ 
sense?  Interpret  this  physically.  >  Hint,  p.  958  Hi 

19  In  the  system  shown  in  the  figure,  the  pulleys  on  the  left  and 

right  are  fixed,  but  the  pulley  in  the  center  can  move  to  the  left  or 
right.  The  two  hanging  masses  are  identical,  and  the  pulleys  and 
ropes  are  all  massless.  Find  the  upward  acceleration  of  the  mass  on 
the  left,  in  terms  of  g  only.  >  Hint,  p.  958  V  it 

20  Two  atoms  will  interact  through  electrical  forces  between 
their  protons  and  electrons.  One  fairly  good  approximation  to  the 
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electrical  energy  is  the  Lennard-Jones  formula. 


U (r)  =  k 


where  r  is  the  center-to-center  distance  between  the  atoms  and  k  is 
a  positive  constant.  Show  that  (a)  there  is  an  equilibrium  point  at 
r  =  a, 

(b)  the  equilibrium  is  stable,  and 

(c)  the  energy  required  to  bring  the  atoms  from  their  equilibrium 

separation  to  infinity  is  k.  >  Hint,  p.  958  - 


21  The  International  Space  Station  orbits  at  an  altitude  of  about 
360  to  400  km.  What  is  the  gravitational  field  of  the  earth  at  this 
altitude?  V 


22  (a)  A  geosynchronous  orbit  is  one  in  which  the  satellite  orbits 

above  the  equator,  and  has  an  orbital  period  of  24  hours,  so  that  it 
is  always  above  the  same  point  on  the  spinning  earth.  Calculate  the 
altitude  of  such  a  satellite.  V 

(b)  What  is  the  gravitational  field  experienced  by  the  satellite?  Give 
your  answer  as  a  percentage  in  relation  to  the  gravitational  field  at 
the  earth’s  surface.  >  Hint,  p.  958  V 


23  Astronomers  calculating  orbits  of  planets  often  work  in  a 

nonmetric  system  of  units,  in  which  the  unit  of  time  is  the  year, 
the  unit  of  mass  is  the  sun’s  mass,  and  the  unit  of  distance  is  the 
astronomical  unit  (A.U.),  defined  as  half  the  long  axis  of  the  earth’s 
orbit.  In  these  units,  find  an  exact  expression  for  the  gravitational 
constant,  G.  V  U 

24  The  star  Lalande  21185  was  found  in  1996  to  have  two  planets 

in  roughly  circular  orbits,  with  periods  of  6  and  30  years.  What  is 
the  ratio  of  the  two  planets’  orbital  radii?  C  ^ 

25  A  projectile  is  moving  directly  away  from  a  planet  of  mass 

M  at  exactly  escape  velocity,  (a)  Find  r,  the  distance  from  the 
projectile  to  the  center  of  the  planet,  as  a  function  of  time,  t,  and 
also  find  v(t).  V 

(b)  Check  the  units  of  your  answer. 

(c)  Does  v  show  the  correct  behavior  as  t  approaches  infinity? 

>  Hint,  p.  958  - 

26  The  purpose  of  this  problem  is  to  estimate  the  height  of  the 
tides.  The  main  reason  for  the  tides  is  the  moon’s  gravity,  and  we’ll 
neglect  the  effect  of  the  sun.  Also,  real  tides  are  heavily  influenced 
by  landforms  that  channel  the  flow  of  water,  but  we’ll  think  of  the 
earth  as  if  it  was  completely  covered  with  oceans.  Under  these 
assumptions,  the  ocean  surface  should  be  a  surface  of  constant  U/m. 
That  is,  a  thimbleful  of  water,  m,  should  not  be  able  to  gain  or  lose 
any  gravitational  energy  by  moving  from  one  point  on  the  ocean 
surface  to  another.  If  only  the  spherical  earth’s  gravity  was  present, 
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then  we’d  have  U/m  =  —GMe/r,  and  a  surface  of  constant  U/m 
would  be  a  surface  of  constant  r,  i.e.,  the  ocean’s  surface  would  be 
spherical.  Taking  into  account  the  moon’s  gravity,  the  main  effect  is 
to  shift  the  center  of  the  sphere,  but  the  sphere  also  becomes  slightly 
distorted  into  an  approximately  ellipsoidal  shape.  (The  shift  of  the 
center  is  not  physically  related  to  the  tides,  since  the  solid  part  of 
the  earth  tends  to  be  centered  within  the  oceans;  really,  this  effect 
has  to  do  with  the  motion  of  the  whole  earth  through  space,  and 
the  way  that  it  wobbles  due  to  the  moon’s  gravity.)  Determine  the 
amount  by  which  the  long  axis  of  the  ellipsoid  exceeds  the  short 
axis.  >  Hint,  p.  958  ■ 

27  You  are  considering  going  on  a  space  voyage  to  Mars,  in  which 

your  route  would  be  half  an  ellipse,  tangent  to  the  Earth’s  orbit  at 
one  end  and  tangent  to  Mars’  orbit  at  the  other.  Your  spacecraft’s 
engines  will  only  be  used  at  the  beginning  and  end,  not  during  the 
voyage.  How  long  would  the  outward  leg  of  your  trip  last?  (The 
orbits  of  Earth  and  Mars  are  nearly  circular,  and  Mars’s  is  bigger 
by  a  factor  of  1.52.)  V  ^ 

28  When  you  buy  a  helium-filled  balloon,  the  seller  has  to  inflate 
it  from  a  large  metal  cylinder  of  the  compressed  gas.  The  helium 
inside  the  cylinder  has  energy,  as  can  be  demonstrated  for  example 
by  releasing  a  little  of  it  into  the  air:  you  hear  a  hissing  sound, 
and  that  sound  energy  must  have  come  from  somewhere.  The  total 
amount  of  energy  in  the  cylinder  is  very  large,  and  if  the  valve  is 
inadvertently  damaged  or  broken  off,  the  cylinder  can  behave  like  a 
bomb  or  a  rocket. 

Suppose  the  company  that  puts  the  gas  in  the  cylinders  prepares 
cylinder  A  with  half  the  normal  amount  of  pure  helium,  and  cylinder 
B  with  the  normal  amount.  Cylinder  B  has  twice  as  much  energy, 
and  yet  the  temperatures  of  both  cylinders  are  the  same.  Explain,  at 
the  atomic  level,  what  form  of  energy  is  involved,  and  why  cylinder 
B  has  twice  as  much.  ■ 

29  Explain  in  terms  of  conservation  of  energy  why  sweating 
cools  your  body,  even  though  the  sweat  is  at  the  same  temperature 
as  your  body.  Describe  the  forms  of  energy  involved  in  this  energy 
transformation.  Why  don’t  you  get  the  same  cooling  effect  if  you 
wipe  the  sweat  off  with  a  towel?  Hint:  The  sweat  is  evaporating. 

30  [This  problem  is  now  problem  3-73.]  1 

31  All  stars,  including  our  sun,  show  variations  in  their  light 
output  to  some  degree.  Some  stars  vary  their  brightness  by  a  factor 
of  two  or  even  more,  but  our  sun  has  remained  relatively  steady  dur¬ 
ing  the  hundred  years  or  so  that  accurate  data  have  been  collected. 
Nevertheless,  it  is  possible  that  climate  variations  such  as  ice  ages 
are  related  to  long-term  irregularities  in  the  sun’s  light  output.  If 
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the  sun  was  to  increase  its  light  output  even  slightly,  it  could  melt 
enough  Antarctic  ice  to  flood  all  the  world’s  coastal  cities.  The  total 
sunlight  that  falls  on  Antarctica  amounts  to  about  1  x  1016  watts. 
In  the  absence  of  natural  or  human-caused  climate  change,  this  heat 
input  to  the  poles  is  balanced  by  the  loss  of  heat  via  winds,  ocean 
currents,  and  emission  of  infrared  light,  so  that  there  is  no  net  melt¬ 
ing  or  freezing  of  ice  at  the  poles  from  year  to  year.  Suppose  that 
the  sun  changes  its  light  output  by  some  small  percentage,  but  there 
is  no  change  in  the  rate  of  heat  loss  by  the  polar  caps.  Estimate  the 
percentage  by  which  the  sun’s  light  output  would  have  to  increase 
in  order  to  melt  enough  ice  to  raise  the  level  of  the  oceans  by  10  me¬ 
ters  over  a  period  of  10  years.  (This  would  be  enough  to  flood  New 
York,  London,  and  many  other  cities.)  Melting  1  kg  of  ice  requires 
3  x  103  J. 

32  The  figure  shows  the  oscillation  of  a  microphone  in  response 

to  the  author  whistling  the  musical  note  “A.”  The  horizontal  axis, 
representing  time,  has  a  scale  of  1.0  ms  per  square.  Find  the  period 
T,  the  frequency  /,  and  the  angular  frequency  ui.  'J 

33  (a)  A  mass  nn  is  hung  from  a  spring  whose  spring  constant  is 

k.  Write  down  an  expression  for  the  total  interaction  energy  of  the 
system,  U,  and  find  its  equilibrium  position.  >  Hint,  p.  958 

(b)  Explain  how  you  could  use  your  result  from  part  a  to  determine 
an  unknown  spring  constant.  - 

34  A  certain  mass,  when  hung  from  a  certain  spring,  causes 

the  spring  to  stretch  by  an  amount  h  compared  to  its  equilibrium 
length.  If  the  mass  is  displaced  vertically  from  this  equilibrium,  it 
will  oscillate  up  and  down  with  a  period  Tosc.  Give  a  numerical 
comparison  between  Tosc  and  Tfau,  the  time  required  for  the  mass 
to  fall  from  rest  through  a  height  h,  when  it  isn’t  attached  to  the 
spring.  (You  will  need  the  result  of  problem  33).  v  ^ 

35  Find  the  period  of  vertical  oscillations  of  the  mass  m.  The 
spring,  pulley,  and  ropes  have  negligible  mass. 

o  Hint,  p.  958  V  - 

36  The  equilibrium  length  of  each  spring  in  the  figure  is  b,  so 
when  the  mass  m  is  at  the  center,  neither  spring  exerts  any  force 
on  it.  When  the  mass  is  displaced  to  the  side,  the  springs  stretch; 
their  spring  constants  are  both  k. 

(a)  Find  the  energy,  U ,  stored  in  the  springs,  as  a  function  of  y,  the 

distance  of  the  mass  up  or  down  from  the  center.  'J 

(b)  Show  that  the  period  of  small  up-down  oscillations  is  infinite. 


Problem  32. 


1_ 


Problem  35. 


< - b — - b - ► 

Problem  36. 


Problem  37. 
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37  Two  springs  with  spring  constants  k\  and  k^  are  put  to¬ 
gether  end-to-end.  Let  x\  be  the  amount  by  which  the  first  spring 
is  stretched  relative  to  its  equilibrium  length,  and  similarly  for  x'2-  If 
the  combined  double  spring  is  stretched  by  an  amount  b  relative  to 
its  equilibrium  length,  then  b  =  x\  +  X2-  Find  the  spring  constant, 
K .  of  the  combined  spring  in  terms  of  k±  and  k^- 

>  Hint,  p.  959  >  Answer,  p.  969  V 


38  A  mass  m  on  a  spring  oscillates  around  an  equilibrium  at 
x  =  0.  Any  function  U(x)  with  an  equilibrium  at  x  =  0  can  be 
approximated  as  U(x)  =  (l/2)kx2,  and  if  the  energy  is  symmet¬ 
ric  with  respect  to  positive  and  negative  values  of  x ,  then  the  next 
level  of  improvement  in  such  an  approximation  would  be  U(x)  = 
(l/2)£ix2  +  6x4.  The  general  idea  here  is  that  any  smooth  function 
can  be  approximated  locally  by  a  polynomial,  and  if  you  want  a  bet¬ 
ter  approximation,  you  can  use  a  polynomial  with  more  terms  in  it. 
When  you  ask  your  calculator  to  calculate  a  function  like  sin  or  ex, 
it’s  using  a  polynomial  approximation  with  10  or  12  terms.  Physi¬ 
cally,  a  spring  with  a  positive  value  of  b  gets  stiffer  when  stretched 
strongly  than  an  “ideal”  spring  with  6  =  0.  A  spring  with  a  nega¬ 
tive  6  is  like  a  person  who  cracks  under  stress  —  when  you  stretch 
it  too  much,  it  becomes  more  elastic  than  an  ideal  spring  would. 
We  should  not  expect  any  spring  to  give  totally  ideal  behavior  no 
matter  no  matter  how  much  it  is  stretched.  For  example,  there  has 
to  be  some  point  at  which  it  breaks. 

Do  a  numerical  simulation  of  the  oscillation  of  a  mass  on  a  spring 
whose  energy  has  a  nonvanishing  6.  Is  the  period  still  independent 
of  amplitude?  Is  the  amplitude-independent  equation  for  the  period 
still  approximately  valid  for  small  enough  amplitudes?  Does  the 
addition  of  a  positive  x 4  term  tend  to  increase  the  period,  or  decrease 
it?  Include  a  printout  of  your  program  and  its  output  with  your 
homework  paper.  ■ 

39  An  idealized  pendulum  consists  of  a  pointlike  mass  m  on  the 
end  of  a  massless,  rigid  rod  of  length  L.  Its  amplitude,  9,  is  the  angle 
the  rod  makes  with  the  vertical  when  the  pendulum  is  at  the  end 
of  its  swing.  Write  a  numerical  simulation  to  determine  the  period 
of  the  pendulum  for  any  combination  of  m,  L,  and  9.  Examine  the 
effect  of  changing  each  variable  while  manipulating  the  others.  S 

40  A  ball  falls  from  a  height  h.  Without  air  resistance,  the 
time  it  takes  to  reach  the  floor  is  y/2 h/g.  A  numerical  version 
of  this  calculation  was  given  in  program  time2  on  page  92.  Now 
suppose  that  air  resistance  is  not  negligible.  For  a  smooth  sphere  of 
radius  r,  moving  at  speed  v  through  air  of  density  p,  the  amount  of 
energy  d Q  dissipated  as  heat  as  the  ball  falls  through  a  height  d y  is 
given  (ignoring  signs)  by  dQ  =  (it /4:)pv2r2  dy.  Modify  the  program 
to  incorporate  this  effect,  and  find  the  resulting  change  in  the  fall 
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time  in  the  case  of  a  21  g  ball  of  radius  1.0  cm,  falling  from  a  height 
of  1.0  m.  The  density  of  air  at  sea  level  is  about  1.2  kg/rn3.  Turn  in 
a  printout  of  both  your  program  and  its  output.  Answer:  0.34  ms. 

■ 

41  The  factorial  of  an  integer  n,  written  n\,  is  defined  as  the 
product  of  all  the  positive  integers  less  than  or  equal  to  n.  For 
example,  3!  =  1x2x3  =  6.  Write  a  Python  program  to  compute  the 
factorial  of  a  number.  Test  it  with  a  small  number  whose  factorial 
you  can  check  by  hand.  Then  use  it  to  compute  30!.  (Python 
computes  integer  results  with  unlimited  precision,  so  you  won’t  get 
any  problems  with  rounding  or  overflows.)  Turn  in  a  printout  of 
your  program  and  its  output,  including  the  test. 

42  Estimate  the  kinetic  energy  of  a  buzzing  fly’s  wing.  (You  may 
wish  to  review  subsection  0.2.3  on  order-of-magnitude  estimates.) 

■ 

43  A  blade  of  grass  moves  upward  as  it  grows.  Estimate  its 

kinetic  energy.  (You  may  wish  to  review  subsection  0.2.3  on  order- 
of-magnitude  estimates.)  S 


Key  to  symbols: 

easy  -  typical  S  challenging  )i  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  2A:  Reasoning  with  Ratios  and  Powers 

Equipment: 

ping-pong  balls  and  paddles 
two-meter  sticks 

You  have  probably  bounced  a  ping  pong  ball  straight  up  and  down  in  the  air.  The  time  between 
hits  is  related  to  the  height  to  which  you  hit  the  ball.  If  you  take  twice  as  much  time  between 
hits,  how  many  times  higher  do  you  think  you  will  have  to  hit  the  ball?  Write  down  your 
hypothesis: _ 

Your  instructor  will  first  beat  out  a  tempo  of  240  beats  per  minute  (four  beats  per  second), 
which  you  should  try  to  match  with  the  ping-pong  ball.  Measure  the  height  to  which  the  ball 
rises: _ 

Now  try  it  at  120  beats  per  minute: _ 

Compare  your  hypothesis  and  your  results  with  the  rest  of  the  class. 

Exercise  2B:  The  Shell  Theorem 

This  exercise  is  an  approximate  numerical  test  of  the  shell  theorem.  There  are  seven  masses 
A-G,  each  being  one  kilogram.  Masses  A-E,  each  one  meter  from  the  center,  form  a  shape  like 
two  Egyptian  pyramids  joined  at  their  bases;  this  is  a  rough  approximation  to  a  six-kilogram 
spherical  shell  of  mass.  Mass  G  is  five  meters  from  the  center  of  the  main  group.  The  class  will 
divide  into  six  groups  and  split  up  the  work  required  in  order  to  calculate  the  total  gravitational 
energy  of  mass  G. 

A 


C 


B 


1.  Each  group  should  write  its  results  on  the  board  in  units  of  picojoules,  retaining  six  significant 
figures  of  precision. 

2.  The  class  will  add  the  results  and  compare  with  the  result  that  would  be  obtained  with  the 
shell  theorem. 
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Exercise  2C:  Vibrations 

Equipment: 

•  air  track  and  carts  of  two  different  masses 

•  springs 

•  weights 


spring 


Place  the  cart  on  the  air  track  and  attach  springs  so  that  it  can  vibrate. 

1.  Test  whether  the  period  of  vibration  depends  on  amplitude.  Try  at  least  one  moderate 
amplitude,  for  which  the  springs  do  not  go  slack,  at  least  one  amplitude  that  is  large  enough  so 
that  they  do  go  slack,  and  one  amplitude  that’s  the  very  smallest  you  can  possibly  observe. 

2.  Try  a  cart  with  a  different  mass.  Does  the  period  change  by  the  expected  factor,  based  on 
the  equation  T  =  27Ty/m//c? 

3.  In  homework  problem  33  on  page  123,  you  showed  that  a  spring’s  spring  constant  can  be 
determined  by  hanging  a  weight  from  it.  Use  this  technique  to  find  the  spring  constant  of  each 
of  the  two  springs.  The  equivalent  spring  constant  of  these  two  springs,  attached  to  the  cart  in 
this  way,  can  be  found  by  adding  their  spring  constants. 

4.  Test  the  equation  T  =  27t \J m/k  numerically. 
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Chapter  3 

Conservation  of  Momentum 


I  think,  therefore  I  am. 

I  hope  that  posterity  will  judge  me  kindly,  not  only  as  to  the  things 
which  I  have  explained,  but  also  to  those  which  I  have  intentionally 
omitted  so  as  to  leave  to  others  the  pleasure  of  discovery. 

Rene  Descartes 
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a  /  Systems  consisting  of  material 
particles  that  interact  through  an 
energy  U(r).  Top:  The  galaxy 
Ml 00.  Here  the  “particles”  are 
stars.  Middle:  The  pool  balls 
don’t  interact  until  they  come  to¬ 
gether  and  become  compressed; 
the  energy  U(r)  has  a  sharp 
upturn  when  the  center-to-center 
distance  r  gets  small  enough 
for  the  balls  to  be  in  contact. 
Bottom:  A  uranium  nucleus 

undergoing  fission.  The  energy 
U(r)  has  a  repulsive  contribution 
from  the  electrical  interactions 
of  the  protons,  plus  an  attractive 
one  due  to  the  strong  nuclear 
interaction.  (Ml 00:  Hubble  Space 
Telescope  image.) 


3.1  Momentum  In  One  Dimension 

3.1.1  Mechanical  momentum 

In  the  martial  arts  movie  Crouching  Tiger,  Hidden  Dragon ,  those 
who  had  received  mystical  enlightenment  are  able  to  violate  the  laws 
of  physics.  Some  of  the  violations  are  obvious,  such  as  their  ability 
to  fly,  but  others  are  a  little  more  subtle.  The  rebellious  young 
heroine/antiheroine  Jen  Yu  gets  into  an  argument  while  sitting  at  a 
table  in  a  restaurant.  A  young  tough,  Iron  Arm  Lu,  comes  running 
toward  her  at  full  speed,  and  she  puts  up  one  arm  and  effortlessly 
makes  him  bounce  back,  without  even  getting  out  of  her  seat  or 
bracing  herself  against  anything.  She  does  all  this  between  bites. 

Although  kinetic  energy  doesn’t  depend  on  the  direction  of  mo¬ 
tion,  we’ve  already  seen  on  page  89  how  conservation  of  energy  com¬ 
bined  with  Galilean  relativity  allows  us  to  make  some  predictions 
about  the  direction  of  motion.  One  of  the  examples  was  a  demon¬ 
stration  that  it  isn’t  possible  for  a  hockey  puck  to  spontaneously  re¬ 
verse  its  direction  of  motion.  In  the  scene  from  the  movie,  however, 
the  woman’s  assailant  isn’t  just  gliding  through  space.  He’s  inter¬ 
acting  with  her,  so  the  previous  argument  doesn’t  apply  here,  and 
we  need  to  generalize  it  to  more  than  one  object.  We  consider  the 
case  of  a  physical  system  composed  of  pointlike  material  particles, 
in  which  every  particle  interacts  with  every  other  particle  through 
an  energy  U(r)  that  depends  only  on  the  distance  r  between  them. 
This  still  allows  for  a  fairly  general  mechanical  system ,  by  which  I 
mean  roughly  a  system  made  of  matter,  not  light.  The  characters 
in  the  movie  are  made  of  protons,  neutrons,  and  electrons,  so  they 
would  constitute  such  a  system  if  the  interactions  among  all  these 
particles  were  of  the  form  U(r).1  We  might  even  be  able  to  get 
away  with  thinking  of  each  person  as  one  big  particle,  if  it’s  a  good 
approximation  to  say  that  every  part  of  each  person’s  whole  body 
moves  in  the  same  direction  at  the  same  speed. 

The  basic  insight  can  be  extracted  from  the  special  case  where 
there  are  only  two  particles  interacting,  and  they  only  move  in  one 
dimension,  as  in  the  example  shown  in  figure  b.  Conservation  of 
energy  says 

Ku  +  K2i  +  Ui  =  Klf  +  K2  f  +  Uf. 

For  simplicity,  let’s  assume  that  the  interactions  start  after  the  time 
we’re  calling  initial,  and  end  before  the  instant  we  choose  as  final. 
This  is  true  in  figure  b,  for  example.  Then  Ui  =  Uf,  and  we  can 
subtract  the  interaction  energies  from  both  sides,  giving. 

K\t  +  K2i  =  Rif  +  K2f 
\rnivli  +  ^m2  v22i  =  xvfj  +  ^m2v%f. 

1Electrical  and  magnetic  interactions  don’t  quite  behave  like  this,  which  is  a 
point  we’ll  take  up  later  in  the  book. 
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As  in  the  one-particle  argument  on  page  89,  the  trick  is  to  require 
conservation  of  energy  not  just  in  one  particular  frame  of  reference, 
but  in  every  frame  of  reference.  In  a  frame  of  reference  moving  at 
velocity  u  relative  to  the  first  one,  the  velocities  all  have  u  added 
onto  them:2 

^mi(vii  +  u)2+^m2{v2i  +  u)2  =  mni(vif  +  u)2  +  ^m2{v2f  +  u)2 

When  we  square  a  quantity  like  (vu  +  u)2,  we  get  the  same  that 
occurred  in  the  original  frame  of  reference,  plus  two  ^-dependent 
terms,  2 vuu  +  u2.  Subtracting  the  original  conservation  of  energy 
equation  from  the  version  in  the  new  frame  of  reference,  we  have 

mivuu  +  m2v2iU  =  m\VifU  +  m2v2fU, 


or,  dividing  by  u, 


mivii  +  m2V2i  =  mivif  +  m2v2f. 


This  is  a  statement  that  when  you  add  up  mv  for  the  whole  system, 
that  total  remains  constant  over  time.  In  other  words,  this  is  a 
conservation  law.  The  quantity  mv  is  called  momentum ,  notated  p 
for  obscure  historical  reasons.  Its  units  are  kg  •  m/s. 

Unlike  kinetic  energy,  momentum  depends  on  the  direction  of 
motion,  since  the  velocity  is  not  squared.  In  one  dimension,  motion 
in  the  same  direction  as  the  positive  x  axis  is  represented  with  posi¬ 
tive  values  of  v  and  p.  Motion  in  the  opposite  direction  has  negative 
v  and  p. 

Jen  Yu  meets  Iron  Arm  Lu  example  1 

>  Initially,  Jen  Yu  is  at  rest,  and  Iron  Arm  Lu  is  charging  to  the  left, 
toward  her,  at  5  m/s.  Jen  Yu’s  mass  is  50  kg,  and  Lu’s  is  100  kg. 
After  the  collision,  the  movie  shows  Jen  Yu  still  at  rest,  and  Lu 
rebounding  at  5  m/s  to  the  right.  Is  this  consistent  with  the  laws 
of  physics,  or  would  it  be  impossible  in  real  life? 

>  This  is  perfectly  consistent  with  conservation  of  mass  (50  kg+1 00 
kg=50  kg+1 00  kg),  and  also  with  conservation  of  energy,  since 
neither  person’s  kinetic  energy  changes,  and  there  is  therefore 
no  change  in  the  total  energy.  (We  don’t  have  to  worry  about 
interaction  energies,  because  the  two  points  in  time  we’re  con¬ 
sidering  are  ones  at  which  the  two  people  aren’t  interacting.)  To 
analyze  whether  the  scene  violates  conservation  of  momentum, 
we  have  to  pick  a  coordinate  system.  Let’s  define  positive  as 

2  We  can  now  see  that  the  derivation  would  have  been  equally  valid  for  t/j  A 
Uf.  The  two  observers  agree  on  the  distance  between  the  particles,  so  they 
also  agree  on  the  interaction  energies,  even  though  they  disagree  on  the  kinetic 
energies. 


v1i 

v2i 

•— 

©> 

i  1  ; 

v1f 

v2f 

o- 

0 - - 

b  /  A  collision  between  two 
pool  balls  is  seen  in  two  different 
frames  of  reference.  The  solid 
ball  catches  up  with  the  striped 
ball.  Velocities  are  shown  with 
arrows.  The  second  observer  is 
moving  to  the  left  at  velocity  u 
compared  to  the  first  observer, 
so  all  the  velocities  in  the  second 
frame  have  u  added  onto  them. 
The  two  observers  must  agree  on 
conservation  of  energy. 
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c/The  ion  drive  engine  of 
the  NASA  Deep  Space  1  probe, 
shown  under  construction  (top) 
and  being  tested  in  a  vacuum 
chamber  (bottom)  prior  to  its 
October  1998  launch.  Intended 
mainly  as  a  test  vehicle  for 
new  technologies,  the  craft 
nevertheless  also  carried  out  a 
scientific  program  that  included 
a  rendezvous  with  a  comet  in 
2004.  {NASA) 


being  to  the  right.  The  initial  momentum  is  (50  kg)(0  m/s)+(100 
kg)(-5  m/s)=-500  kg  •  m/s,  and  the  final  momentum  is  (50  kg)(0 
m/s)+(100  kg) (5  m/s)=500  kg  ■  m/s.  This  is  a  change  of  1000 
kg  •  m/s,  which  is  impossible  if  the  two  people  constitute  a  closed 
system. 

One  could  argue  that  they’re  not  a  closed  system,  since  Lu  might 
be  exchanging  momentum  with  the  floor,  and  Jen  Yu  might  be 
exchanging  momentum  with  the  seat  of  her  chair.  This  is  a  rea¬ 
sonable  objection,  but  in  the  following  section  we’ll  see  that  there 
are  physical  reasons  why,  in  this  situation,  the  force  of  friction 
would  be  relatively  weak,  and  would  not  be  able  to  transfer  that 
much  momentum  in  a  fraction  of  a  second. 

This  example  points  to  an  intuitive  interpretation  of  conserva¬ 
tion  of  momentum,  which  is  that  interactions  are  always  mutual. 
That  is,  Jen  Yu  can’t  change  Lu’s  momentum  without  having  her 
own  momentum  changed  as  well. 

A  cannon  example  2 

>  A  cannon  of  mass  1000  kg  fires  a  10-kg  shell  at  a  velocity  of 
200  m/s.  At  what  speed  does  the  cannon  recoil? 

>  The  law  of  conservation  of  momentum  tells  us  that 

PcannonJ  +  PshellJ  =  PcannonJ  +  PshellJ- 

Choosing  a  coordinate  system  in  which  the  cannon  points  in  the 
positive  direction,  the  given  information  is 

PcannonJ  =  0 
Pshell,  i  =  0 
psheiij  =  2000  kg  •  m/s. 

We  must  have  PcannonJ  =  -2000  kg  •  m/s,  so  the  recoil  velocity  of 
the  cannon  is  2  m/s. 

Ion  drive  example  3 

>  The  experimental  solar-powered  ion  drive  of  the  Deep  Space 
1  space  probe  expels  its  xenon  gas  exhaust  at  a  speed  of  30,000 
m/s,  ten  times  faster  than  the  exhaust  velocity  for  a  typical  chemical- 
fuel  rocket  engine.  Roughly  how  many  times  greater  is  the  maxi¬ 
mum  speed  this  spacecraft  can  reach,  compared  with  a  chemical- 
fueled  probe  with  the  same  mass  of  fuel  (“reaction  mass”)  avail¬ 
able  for  pushing  out  the  back  as  exhaust? 

>  Momentum  equals  mass  multiplied  by  velocity.  Both  spacecraft 
are  assumed  to  have  the  same  amount  of  reaction  mass,  and  the 
ion  drive’s  exhaust  has  a  velocity  ten  times  greater,  so  the  mo¬ 
mentum  of  its  exhaust  is  ten  times  greater.  Before  the  engine 
starts  firing,  neither  the  probe  nor  the  exhaust  has  any  momen¬ 
tum,  so  the  total  momentum  of  the  system  is  zero.  By  conserva¬ 
tion  of  momentum,  the  total  momentum  must  also  be  zero  after 
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all  the  exhaust  has  been  expelled.  If  we  define  the  positive  di¬ 
rection  as  the  direction  the  spacecraft  is  going,  then  the  negative 
momentum  of  the  exhaust  is  canceled  by  the  positive  momen¬ 
tum  of  the  spacecraft.  The  ion  drive  allows  a  final  speed  that  is 
ten  times  greater.  (This  simplified  analysis  ignores  the  fact  that 
the  reaction  mass  expelled  later  in  the  burn  is  not  moving  back¬ 
ward  as  fast,  because  of  the  forward  speed  of  the  already-moving 
spacecraft.) 


3.1.2  Nonmechanical  momentum 

So  far,  it  sounds  as  though  conservation  of  momentum  can  be 
proved  mathematically,  unlike  conservation  of  mass  and  energy,  which 
are  entirely  based  on  observations.  The  proof,  however,  was  only  for 
a  mechanical  system,  with  interactions  of  the  form  U{r).  Conser¬ 
vation  of  momentum  can  be  extended  to  other  systems  as  well,  but 
this  generalization  is  based  on  experiments,  not  mathematical  proof. 
Light  is  the  most  important  example  of  momentum  that  doesn’t 
equal  mv  —  light  doesn’t  have  mass  at  all,  but  it  does  have  mo¬ 
mentum.  For  example,  a  flashlight  left  on  for  an  hour  would  absorb 
about  ICC5  kg  •  m/s  of  momentum  as  it  recoiled. 

Halley’s  comet  example  4 

Momentum  is  not  always  equal  to  mv.  Halley’s  comet,  shown  in 
figure  d,  has  a  very  elongated  elliptical  orbit,  like  those  of  many 
other  comets.  About  once  per  century,  its  orbit  brings  it  close 
to  the  sun.  The  comet’s  head,  or  nucleus,  is  composed  of  dirty 
ice,  so  the  energy  deposited  by  the  intense  sunlight  gradually  re¬ 
moves  ice  from  the  surface  and  turns  it  into  water  vapor.  The 
bottom  photo  shows  a  view  of  the  water  coming  off  of  the  nucleus 
from  the  European  Giotto  space  probe,  which  passed  within  596 
km  of  the  comet’s  head  on  March  13,  1986. 

The  sunlight  does  not  just  carry  energy,  however.  It  also  carries 
momentum.  Once  the  steam  comes  off,  the  momentum  of  the 
sunlight  impacting  on  it  pushes  it  away  from  the  sun,  forming  a 
tail  as  shown  in  in  the  top  image.  The  tail  always  points  away 
from  the  sun,  so  when  the  comet  is  receding  from  the  sun,  the  tail 
is  in  front.  By  analogy  with  matter,  for  which  momentum  equals 
mv,  you  would  expect  that  massless  light  would  have  zero  mo¬ 
mentum,  but  the  equation  p  =  mv  is  not  the  correct  one  for  light, 
and  light  does  have  momentum.  (Some  comets  also  have  a  sec¬ 
ond  tail,  which  is  propelled  by  electrical  forces  rather  than  by  the 
momentum  of  sunlight.) 

The  reason  for  bringing  this  up  is  not  so  that  you  can  plug 
numbers  into  formulas  in  these  exotic  situations.  The  point  is  that 
the  conservation  laws  have  proven  so  sturdy  exactly  because  they 
can  easily  be  amended  to  fit  new  circumstances.  The  momentum  of 
light  will  be  a  natural  consequence  of  the  discussion  of  the  theory 
of  relativity  in  chapter  7. 


d  /  Halley’s  comet.  Top:  A 
photograph  made  from  earth. 
Bottom:  A  view  of  the  nucleus 
from  the  Giotto  space  probe.  ( W. 
Liller  and  European  Space  Agency) 
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e  /  Examples  5  and  6.  The 
momenta  cancel,  but  the  ener¬ 
gies  don’t. 


3.1.3  Momentum  compared  to  kinetic  energy 

Momentum  and  kinetic  energy  are  both  measures  of  the  quantity 
of  motion,  and  a  sideshow  in  the  Newton-Leibniz  controversy  over 
who  invented  calculus  was  an  argument  over  whether  mv  (i.e. ,  mo¬ 
mentum)  or  mv 2  (i.e.,  kinetic  energy  without  the  1/2  in  front)  was 
the  “true”  measure  of  motion.  The  modern  student  can  certainly 
be  excused  for  wondering  why  we  need  both  quantities,  when  their 
complementary  nature  was  not  evident  to  the  greatest  minds  of  the 
1700s.  The  following  table  highlights  their  differences. 


Kinetic  energy... 

Momentum... 

doesn’t  depend  on  direc¬ 
tion. 

depends  on  direction. 

is  always  positive,  and  can¬ 
not  cancel  out. 

cancels  with  momentum  in 
the  opposite  direction. 

can  be  traded  for  forms  of 
energy  that  do  not  involve 
motion.  Kinetic  energy  is 
not  a  conserved  quantity 
by  itself. 

is  always  conserved  in  a 
closed  system. 

is  quadrupled  if  the  veloc¬ 
ity  is  doubled. 

is  doubled  if  the  velocity  is 
doubled. 

Here  are  some  examples  that  show  the  different  behaviors  of  the 
two  quantities. 

A  spinning  top  example  5 

A  spinning  top  has  zero  total  momentum,  because  for  every  mov¬ 
ing  point,  there  is  another  point  on  the  opposite  side  that  cancels 
its  momentum.  It  does,  however,  have  kinetic  energy. 

Why  a  tuning  fork  has  two  prongs  example  6 

A  tuning  fork  is  made  with  two  prongs  so  that  they  can  vibrate  in 
opposite  directions,  canceling  their  momenta.  In  a  hypothetical 
version  with  only  one  prong,  the  momentum  would  have  to  oscil¬ 
late,  and  this  momentum  would  have  to  come  from  somewhere, 
such  as  the  hand  holding  the  fork.  The  result  would  be  that  vi¬ 
brations  would  be  transmitted  to  the  hand  and  rapidly  die  out. 
In  a  two-prong  fork,  the  two  momenta  cancel,  but  the  energies 
don’t. 

Momentum  and  kinetic  energy  in  firing  a  rifle  example  7 

The  rifle  and  bullet  have  zero  momentum  and  zero  kinetic  energy 
to  start  with.  When  the  trigger  is  pulled,  the  bullet  gains  some  mo¬ 
mentum  in  the  forward  direction,  but  this  is  canceled  by  the  rifle’s 
backward  momentum,  so  the  total  momentum  is  still  zero.  The 
kinetic  energies  of  the  gun  and  bullet  are  both  positive  numbers, 
however,  and  do  not  cancel.  The  total  kinetic  energy  is  allowed  to 
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increase,  because  kinetic  energy  is  being  traded  for  other  forms 
of  energy.  Initially  there  is  chemical  energy  in  the  gunpowder. 
This  chemical  energy  is  converted  into  heat,  sound,  and  kinetic 
energy.  The  gun’s  “backward”  kinetic  energy  does  not  refrigerate 
the  shooter’s  shoulder! 

The  wobbly  earth  example  8 

As  the  moon  completes  half  a  circle  around  the  earth,  its  motion 
reverses  direction.  This  does  not  involve  any  change  in  kinetic 
energy.  The  reversed  velocity  does,  however,  imply  a  reversed 
momentum,  so  conservation  of  momentum  in  the  closed  earth- 
moon  system  tells  us  that  the  earth  must  also  reverse  its  momen¬ 
tum.  In  fact,  the  earth  wobbles  in  a  little  “orbit”  about  a  point  below 
its  surface  on  the  line  connecting  it  and  the  moon.  The  two  bod¬ 
ies’  momenta  always  point  in  opposite  directions  and  cancel  each 
other  out. 

The  earth  and  moon  get  a  divorce  example  9 

Why  can’t  the  moon  suddenly  decide  to  fly  off  one  way  and  the 
earth  the  other  way?  It  is  not  forbidden  by  conservation  of  mo¬ 
mentum,  because  the  moon’s  newly  acquired  momentum  in  one 
direction  could  be  canceled  out  by  the  change  in  the  momen¬ 
tum  of  the  earth,  supposing  the  earth  headed  off  in  the  opposite 
direction  at  the  appropriate,  slower  speed.  The  catastrophe  is  for¬ 
bidden  by  conservation  of  energy,  because  their  energies  would 
have  to  increase  greatly. 

Momentum  and  kinetic  energy  of  a  glacier  example  1 0 

A  cubic-kilometer  glacier  would  have  a  mass  of  about  1012  kg.  If 
it  moves  at  a  speed  of  10~5  m/s,  then  its  momentum  is  107  kg  • 
m/s.  This  is  the  kind  of  heroic-scale  result  we  expect,  perhaps 
the  equivalent  of  the  space  shuttle  taking  off,  or  all  the  cars  in  LA 
driving  in  the  same  direction  at  freeway  speed.  Its  kinetic  energy, 
however,  is  only  50  J,  the  equivalent  of  the  calories  contained 
in  a  poppy  seed  or  the  energy  in  a  drop  of  gasoline  too  small 
to  be  seen  without  a  microscope.  The  surprisingly  small  kinetic 
energy  is  because  kinetic  energy  is  proportional  to  the  square  of 
the  velocity,  and  the  square  of  a  small  number  is  an  even  smaller 
number. 

Discussion  Questions 

A  If  all  the  air  molecules  in  the  room  settled  down  in  a  thin  film  on  the 
floor,  would  that  violate  conservation  of  momentum  as  well  as  conserva¬ 
tion  of  energy? 

B  A  refrigerator  has  coils  in  the  back  that  get  hot,  and  heat  is  molecular 
motion.  These  moving  molecules  have  both  energy  and  momentum.  Why 
doesn’t  the  refrigerator  need  to  be  tied  to  the  wall  to  keep  it  from  recoiling 
from  the  momentum  it  loses  out  the  back? 


0 


f  /  Example  9. 
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3.1.4  Collisions  in  one  dimension 


g/This  Hubble  Space  Tele¬ 
scope  photo  shows  a  small 
galaxy  (yellow  blob  in  the  lower 
right)  that  has  collided  with  a 
larger  galaxy  (spiral  near  the 
center),  producing  a  wave  of  star 
formation  (blue  track)  due  to  the 
shock  waves  passing  through 
the  galaxies’  clouds  of  gas.  This 
is  considered  a  collision  in  the 
physics  sense,  even  though  it  is 
statistically  certain  that  no  star  in 
either  galaxy  ever  struck  a  star 
in  the  other  —  the  stars  are  very 
small  compared  to  the  distances 
between  them.(A//\S4) 


Physicists  employ  the  term  “collision”  in  a  broader  sense  than  in 
ordinary  usage,  applying  it  to  any  situation  where  objects  interact 
for  a  certain  period  of  time.  A  bat  hitting  a  baseball,  a  cosmic  ray 
damaging  DNA,  and  a  gun  and  a  bullet  going  their  separate  ways 
are  all  examples  of  collisions  in  this  sense.  Physical  contact  is  not 
even  required.  A  comet  swinging  past  the  sun  on  a  hyperbolic  orbit 
is  considered  to  undergo  a  collision,  even  though  it  never  touches 
the  sun.  All  that  matters  is  that  the  comet  and  the  sun  interacted 
gravitationally  with  each  other. 

The  reason  for  broadening  the  term  “collision”  in  this  way  is 
that  all  of  these  situations  can  be  attacked  mathematically  using 
the  same  conservation  laws  in  similar  ways.  In  our  first  example, 
conservation  of  momentum  is  all  that  is  required. 

Getting  rear-ended  example  1 1 

>  Ms.  Chang  is  rear-ended  at  a  stop  light  by  Mr.  Nelson,  and  sues 
to  make  him  pay  her  medical  bills.  He  testifies  that  he  was  only 
going  55  km  per  hour  when  he  hit  Ms.  Chang.  She  thinks  he  was 
going  much  faster  than  that.  The  cars  skidded  together  after  the 
impact,  and  measurements  of  the  length  of  the  skid  marks  show 
that  their  joint  velocity  immediately  after  the  impact  was  30  km 
per  hour.  Mr.  Nelson’s  Nissan  has  a  mass  of  1400  kg,  and  Ms. 
Chang ’s  Cadillac  is  2400  kg.  Is  Mr.  Nelson  telling  the  truth? 

>  Since  the  cars  skidded  together,  we  can  write  down  the  equation 
for  conservation  of  momentum  using  only  two  velocities,  v  for  Mr. 
Nelson’s  velocity  before  the  crash,  and  v'  for  their  joint  velocity 
afterward: 

mNv  =  mN  v'  +  mcv' . 

Solving  for  the  unknown,  v,  we  find 


=  80  km/hr. 


He  is  lying. 

The  above  example  was  simple  because  both  cars  had  the  same 
velocity  afterward.  In  many  one-dimensional  collisions,  however,  the 
two  objects  do  not  stick.  If  we  wish  to  predict  the  result  of  such  a 
collision,  conservation  of  momentum  does  not  suffice,  because  both 
velocities  after  the  collision  are  unknown,  so  we  have  one  equation 
in  two  unknowns. 

Conservation  of  energy  can  provide  a  second  equation,  but  its 
application  is  not  as  straightforward,  because  kinetic  energy  is  only 
the  particular  form  of  energy  that  has  to  do  with  motion.  In  many 
collisions,  part  of  the  kinetic  energy  that  was  present  before  the 
collision  is  used  to  create  heat  or  sound,  or  to  break  the  objects 
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or  permanently  bend  them.  Cars,  in  fact,  are  carefully  designed  to 
crumple  in  a  collision.  Crumpling  the  car  uses  up  energy,  and  that’s 
good  because  the  goal  is  to  get  rid  of  all  that  kinetic  energy  in  a 
relatively  safe  and  controlled  way.  At  the  opposite  extreme,  a  super¬ 
ball  is  “super”  because  it  emerges  from  a  collision  with  almost  all  its 
original  kinetic  energy,  having  only  stored  it  briefly  as  interatomic 
electrical  energy  while  it  was  being  squashed  by  the  impact. 

Collisions  of  the  superball  type,  in  which  almost  no  kinetic  en¬ 
ergy  is  converted  to  other  forms  of  energy,  can  thus  be  analyzed 
more  thoroughly,  because  they  have  Kf  =  Ki,  as  opposed  to  the 
less  useful  inequality  Kf  <  I\{  for  a  case  like  a  tennis  ball  bouncing 
on  grass. 

Pool  balls  colliding  head-on  example  12 

>  Two  pool  balls  collide  head-on,  so  that  the  collision  is  restricted 
to  one  dimension.  Pool  balls  are  constructed  so  as  to  lose  as  little 
kinetic  energy  as  possible  in  a  collision,  so  under  the  assumption 
that  no  kinetic  energy  is  converted  to  any  other  form  of  energy, 
what  can  we  predict  about  the  results  of  such  a  collision? 

>  Pool  balls  have  identical  masses,  so  we  use  the  same  symbol 
m  for  both.  Conservation  of  energy  and  no  loss  of  kinetic  energy 
give  us  the  two  equations 


mv-\j  +  mv2i  =  mv-\f  +  mv2f 
\mv i2,  +  A-mvli  =  A-mv\f  +  ^mv%f 

The  masses  and  the  factors  of  1/2  can  be  divided  out,  and  we 
eliminate  the  cumbersome  subscripts  by  replacing  the  symbols 
v1(J...  with  the  symbols  A,  B,  C,  and  D: 

A  +  B  =  C  +  D 
A2  +  B2  =  C2  +  D2. 

A  little  experimentation  with  numbers  shows  that  given  values  of  A 
and  6,  it  is  impossible  to  find  C  and  D  that  satisfy  these  equations 
unless  C  and  D  equal  A  and  B,  or  C  and  D  are  the  same  as  A 
and  B  but  swapped  around.  A  formal  proof  of  this  fact  is  given 
in  the  sidebar.  In  the  special  case  where  ball  2  is  initially  at  rest, 
this  tells  us  that  ball  1  is  stopped  dead  by  the  collision,  and  ball 
2  heads  off  at  the  velocity  originally  possessed  by  ball  1 .  This 
behavior  will  be  familiar  to  players  of  pool. 

Often,  as  in  example  12,  the  details  of  the  algebra  are  the  least 
interesting  part  of  the  problem,  and  considerable  physical  insight 
can  be  gained  simply  by  counting  the  number  of  unknowns  and 
comparing  to  the  number  of  equations.  Suppose  a  beginner  at  pool 
notices  a  case  where  her  cue  ball  hits  an  initially  stationary  ball  and 
stops  dead.  “Wow,  what  a  good  trick,”  she  thinks.  “I  bet  I  could 


Gory  Details  of  the  Proof  in 
Example  12 


The  equation  A+B  =  C+D  says 
that  the  change  in  one  ball’s  ve¬ 
locity  is  equal  and  opposite  to 
the  change  in  the  other’s.  We 
invent  a  symbol  x  =  C  -  A  for 
the  change  in  ball  1’s  velocity. 
The  second  equation  can  then 
be  rewritten  as  A2  +  B2  =  (A  + 
x)2  +  (B  -  x)2.  Squaring  out  the 
quantities  in  parentheses  and  then 
simplifying,  we  get  0  =  Ax~Bx+ 
x2.  The  equation  has  the  trivial 
solution  x  =  0,  i.e.,  neither  ball’s 
velocity  is  changed,  but  this  is 
physically  impossible  because  the 
balls  can’t  travel  through  each 
other  like  ghosts.  Assuming  x  y 
0,  we  can  divide  by  x  and  solve 
for  x  =  B  -  A.  This  means  that 
ball  1  has  gained  an  amount  of 
velocity  exactly  sufficient  to  mat¬ 
ch  ball  2’s  initial  velocity,  and  vice- 
versa.  The  balls  must  have  swap¬ 
ped  velocities. 
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never  do  that  again  in  a  million  years.”  But  she  tries  again,  and 
finds  that  she  can’t  help  doing  it  even  if  she  doesn’t  want  to.  Luckily 
she  has  just  learned  about  collisions  in  her  physics  course.  Once  she 
has  written  down  the  equations  for  conservation  of  momentum  and 
no  loss  of  kinetic  energy,  she  really  doesn’t  have  to  complete  the 
algebra.  She  knows  that  she  has  two  equations  in  two  unknowns, 
so  there  must  be  a  well-defined  solution.  Once  she  has  seen  the 
result  of  one  such  collision,  she  knows  that  the  same  thing  must 
happen  every  time.  The  same  thing  would  happen  with  colliding 
marbles  or  croquet  balls.  It  doesn’t  matter  if  the  masses  or  velocities 
are  different,  because  that  just  multiplies  both  equations  by  some 
constant  factor. 

The  discovery  of  the  neutron 

This  was  the  type  of  reasoning  employed  by  James  Chadwick  in 
his  1932  discovery  of  the  neutron.  At  the  time,  the  atom  was  imag¬ 
ined  to  be  made  out  of  two  types  of  fundamental  particles,  protons 
and  electrons.  The  protons  were  far  more  massive,  and  clustered 
together  in  the  atom’s  core,  or  nucleus.  Electrical  attraction  caused 
the  electrons  to  orbit  the  nucleus  in  circles,  in  much  the  same  way 
that  gravity  kept  the  planets  from  cruising  out  of  the  solar  system. 
Experiments  showed,  for  example,  that  twice  as  much  energy  was 
required  to  strip  the  last  electron  off  of  a  helium  atom  as  was  needed 
to  remove  the  single  electron  from  a  hydrogen  atom,  and  this  was 
explained  by  saying  that  helium  had  two  protons  to  hydrogen’s  one. 
The  trouble  was  that  according  to  this  model,  helium  would  have 
two  electrons  and  two  protons,  giving  it  precisely  twice  the  mass  of 
a  hydrogen  atom  with  one  of  each.  In  fact,  helium  has  about  four 
times  the  mass  of  hydrogen. 

Chadwick  suspected  that  the  helium  nucleus  possessed  two  addi¬ 
tional  particles  of  a  new  type,  which  did  not  participate  in  electrical 
interactions  at  all,  i.e.,  were  electrically  neutral.  If  these  particles 
had  very  nearly  the  same  mass  as  protons,  then  the  four-to-one  mass 
ratio  of  helium  and  hydrogen  could  be  explained.  In  1930,  a  new 
type  of  radiation  was  discovered  that  seemed  to  fit  this  description. 
It  was  electrically  neutral,  and  seemed  to  be  coming  from  the  nuclei 
of  light  elements  that  had  been  exposed  to  other  types  of  radia¬ 
tion.  At  this  time,  however,  reports  of  new  types  of  particles  were 
a  dime  a  dozen,  and  most  of  them  turned  out  to  be  either  clusters 
made  of  previously  known  particles  or  else  previously  known  parti¬ 
cles  with  higher  energies.  Many  physicists  believed  that  the  “new” 
particle  that  had  attracted  Chadwick’s  interest  was  really  a  pre¬ 
viously  known  particle  called  a  gamma  ray,  which  was  electrically 
neutral.  Since  gamma  rays  have  no  mass,  Chadwick  decided  to  try 
to  determine  the  new  particle’s  mass  and  see  if  it  was  nonzero  and 
approximately  equal  to  the  mass  of  a  proton. 

Unfortunately  a  subatomic  particle  is  not  something  you  can 
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to  vacuum 
pump 


polonium  beryllium 


h  /  Chadwick’s  subatomic  pool  table.  A  disk  of  the  naturally  occur¬ 
ring  metal  polonium  provides  a  source  of  radiation  capable  of  kicking 
neutrons  out  of  the  beryllium  nuclei.  The  type  of  radiation  emitted  by 
the  polonium  is  easily  absorbed  by  a  few  mm  of  air,  so  the  air  has  to  be 
pumped  out  of  the  left-hand  chamber.  The  neutrons,  Chadwick’s  mystery 
particles,  penetrate  matter  far  more  readily,  and  fly  out  through  the  wall 
and  into  the  chamber  on  the  right,  which  is  filled  with  nitrogen  or  hydrogen 
gas.  When  a  neutron  collides  with  a  nitrogen  or  hydrogen  nucleus,  it 
kicks  it  out  of  its  atom  at  high  speed,  and  this  recoiling  nucleus  then  rips 
apart  thousands  of  other  atoms  of  the  gas.  The  result  is  an  electrical 
pulse  that  can  be  detected  in  the  wire  on  the  right.  Physicists  had  already 
calibrated  this  type  of  apparatus  so  that  they  could  translate  the  strength 
of  the  electrical  pulse  into  the  velocity  of  the  recoiling  nucleus.  The 
whole  apparatus  shown  in  the  figure  would  fit  in  the  palm  of  your  hand,  in 
dramatic  contrast  to  today’s  giant  particle  accelerators. 


just  put  on  a  scale  and  weigh.  Chadwick  came  up  with  an  ingenious 
solution.  The  masses  of  the  nuclei  of  the  various  chemical  elements 
were  already  known,  and  techniques  had  already  been  developed  for 
measuring  the  speed  of  a  rapidly  moving  nucleus.  He  therefore  set 
out  to  bombard  samples  of  selected  elements  with  the  mysterious 
new  particles.  When  a  direct,  head-on  collision  occurred  between 
a  mystery  particle  and  the  nucleus  of  one  of  the  target  atoms,  the 
nucleus  would  be  knocked  out  of  the  atom,  and  he  would  measure 
its  velocity. 

Suppose,  for  instance,  that  we  bombard  a  sample  of  hydrogen 
atoms  with  the  mystery  particles.  Since  the  participants  in  the 
collision  are  fundamental  particles,  there  is  no  way  for  kinetic  energy 
to  be  converted  into  heat  or  any  other  form  of  energy,  and  Chadwick 
thus  had  two  equations  in  three  unknowns: 

equation  $T:  conservation  of  momentum 

equation  #2:  no  loss  of  kinetic  energy 

unknown  $T:  mass  of  the  mystery  particle 

unknown  ^2:  initial  velocity  of  the  mystery  particle 
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i  /  The  highjumper’s  body  passes 
over  the  bar,  but  his  center  of 
mass  passes  under  it  .{Dunia 
Young) 


j  /  Two  pool  balls  collide. 


unknown  ^3:  final  velocity  of  the  mystery  particle 

The  number  of  unknowns  is  greater  than  the  number  of  equa¬ 
tions,  so  there  is  no  unique  solution.  But  by  creating  collisions  with 
nuclei  of  another  element,  nitrogen,  he  gained  two  more  equations 
at  the  expense  of  only  one  more  unknown: 

equation  ^3:  conservation  of  momentum  in  the  new  collision 

equation  #4:  no  loss  of  kinetic  energy  in  the  new  collision 

unknown  #4:  final  velocity  of  the  mystery  particle  in  the  new 
collision 

He  was  thus  able  to  solve  for  all  the  unknowns,  including  the 
mass  of  the  mystery  particle,  which  was  indeed  within  1%  of  the 
mass  of  a  proton.  He  named  the  new  particle  the  neutron,  since  it 
is  electrically  neutral. 

Discussion  Questions 

A  Good  pool  players  learn  to  make  the  cue  ball  spin,  which  can  cause 
it  not  to  stop  dead  in  a  head-on  collision  with  a  stationary  ball.  If  this  does 
not  violate  the  laws  of  physics,  what  hidden  assumption  was  there  in  the 
example  in  the  text  where  it  was  proved  that  the  cue  ball  must  stop? 

3.1 .5  The  center  of  mass 

Figures  i  and  k  show  two  examples  where  a  motion  that  appears 
complicated  actually  has  a  very  simple  feature.  In  both  cases,  there 
is  a  particular  point,  called  the  center  of  mass,  whose  motion  is 
surprisingly  simple.  The  highjumper  flexes  his  body  as  he  passes 
over  the  bar,  so  his  motion  is  intrinsically  very  complicated,  and 
yet  his  center  of  mass’s  motion  is  a  simple  parabola,  just  like  the 
parabolic  arc  of  a  pointlike  particle.  The  wrench’s  center  of  mass 
travels  in  a  straight  line  as  seen  from  above,  which  is  what  we’d 
expect  for  a  pointlike  particle  flying  through  the  air. 

The  highjumper  and  the  wrench  are  both  complicated  systems, 


k/ln  this  multiple-flash  photograph,  we  see  the  wrench  from  above 
as  it  flies  through  the  air,  rotating  as  it  goes.  Its  center  of  mass,  marked 
with  the  black  cross,  travels  along  a  straight  line,  unlike  the  other  points 
on  the  wrench,  which  execute  loops.  ( PSSC  Physics) 


each  consisting  of  zillions  of  subatomic  particles.  To  understand 
what’s  going  on,  let’s  instead  look  at  a  nice  simple  system,  two  pool 
balls  colliding.  We  assume  the  balls  are  a  closed  system  (i.e. ,  their 
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interaction  with  the  felt  surface  is  not  important)  and  that  their 
rotation  is  unimportant,  so  that  we’ll  be  able  to  treat  each  one  as  a 
single  particle.  By  symmetry,  the  only  place  their  center  of  mass  can 
be  is  half-way  in  between,  at  an  x  coordinate  equal  to  the  average 
of  the  two  balls’  positions,  xcm  =  (x\  +  x2)/Z. 

Figure  j  makes  it  appear  that  the  center  of  mass,  marked  with 
an  x,  moves  with  constant  velocity  to  the  right,  regardless  of  the 
collision,  and  we  can  easily  prove  this  using  conservation  of  momen¬ 
tum: 


Vcrn  —  d XCm/  df 
=  ^{vi  +  V2) 

=  -^(mw  1  +  mv2) 

2m 

 Ptotal 
ni'total 

Since  momentum  is  conserved,  the  last  expression  is  constant,  which 
proves  that  vcm  is  constant. 

Rearranging  this  a  little,  we  have  Ptotal  =  mtotalvcm-  In  other 
words,  the  total  momentum  of  the  system  is  the  same  as  if  all  its 
mass  was  concentrated  at  the  center  of  mass  point. 

Sigma  notation 

When  there  is  a  large,  potentially  unknown  number  of  particles, 
we  can  write  sums  like  the  ones  occurring  above  using  symbols  like 
but  that  gets  awkward.  It’s  more  convenient  to  use  the 
Greek  uppercase  sigma,  £,  to  indicate  addition.  For  example,  the 
sum  l2  +  22  +  32  +  42  =  30  could  be  written  as 

n 

Ei2  =  3»> 

3  = 1 

read  “the  sum  from  j  =  1  to  n  of  j2.”  The  variable  j  is  a  dummy 
variable,  just  like  the  dx  in  an  integral  that  tells  you  you’re  integrat¬ 
ing  with  respect  to  x,  but  has  no  significance  outside  the  integral. 
The  j  below  the  sigma  tells  you  what  variable  is  changing  from  one 
term  of  the  sum  to  the  next,  but  j  has  no  significance  outside  the 
sum. 

As  an  example,  let’s  generalize  the  proof  of  ptotal  =  'mtotalvcm  to 
the  case  of  an  arbitrary  number  n  of  identical  particles  moving  in 
one  dimension,  rather  than  just  two  particles.  The  center  of  mass 
is  at 


^  cm 


1 

n 


n 


3= 1 


I  /  No  matter  what  point  you 
hang  the  pear  from,  the  string 
lines  up  with  the  pear’s  center 
of  mass.  The  center  of  mass 
can  therefore  be  defined  as  the 
intersection  of  all  the  lines  made 
by  hanging  the  pear  in  this  way. 
Note  that  the  X  in  the  figure 
should  not  be  interpreted  as 
implying  that  the  center  of  mass 
is  on  the  surface  —  it  is  actually 
inside  the  pear. 


m  /  The  circus  performers 
hang  with  the  ropes  passing 
through  their  centers  of  mass. 
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where  x\  is  the  mass  of  the  first  particle,  and  so  on.  The  velocity  of 
the  center  of  mass  is 


n/An  improperly  balanced 
wheel  has  a  center  of  mass  that 
is  not  at  its  geometric  center. 
When  you  get  a  new  tire,  the 
mechanic  clamps  little  weights  to 
the  rim  to  balance  the  wheel. 


o/This  toy  was  intentionally 
designed  so  that  the  mushroom¬ 
shaped  piece  of  metal  on  top 
would  throw  off  the  center  of 
mass.  When  you  wind  it  up,  the 
mushroom  spins,  but  the  center 
of  mass  doesn’t  want  to  move, 
so  the  rest  of  the  toy  tends  to 
counter  the  mushroom’s  motion, 
causing  the  whole  thing  to  jump 
around. 


Vcm  —  dxcm/  dt 

_  1 
n 


E 

3= 1 


1 


nm 


E 


mVn 


3= 1 
Ptotal 

Wl'total 


What  about  a  system  containing  objects  with  unequal  masses, 
or  containing  more  than  two  objects?  The  reasoning  above  can  be 
generalized  to  a  weighted  average: 


En 
j=l 

z2j=i  mi 


The  solar  system ’s  center  of  mass  example  1 3 

In  the  discussion  of  the  sun’s  gravitational  field  on  page  99,  I 
mentioned  in  a  footnote  that  the  sun  doesn’t  really  stay  in  one 
place  while  the  planets  orbit  around  it.  Actually,  motion  is  relative, 
so  it’s  meaningless  to  ask  whether  the  sun  is  absolutely  at  rest, 
but  it  is  meaningful  to  ask  whether  it  moves  in  a  straight  line  at 
constant  velocity.  We  can  now  see  that  since  the  solar  system 
is  a  closed  system,  its  total  momentum  must  be  constant,  and 
Ptotai  =  m total vcm  then  tells  us  that  it’s  the  solar  system’s  center 
of  mass  that  has  constant  velocity,  not  the  sun.  The  sun  wob¬ 
bles  around  this  point  irregularly  due  to  its  interactions  with  the 
planets,  Jupiter  in  particular. 

The  earth-moon  system  example  14 

The  earth-moon  system  is  much  simpler  than  the  solar  system 
because  it  contains  only  two  objects.  Where  is  the  center  of  mass 
of  this  system?  Let  x=0  be  the  earth’s  center,  so  that  the  moon 
lies  at  x  =  3.8  x  105  km.  Then 


m- \x-\  +  m2x2 
m-\  +  m2 

and  letting  1  be  the  earth  and  2  the  moon,  we  have 

_  dearth  X  0  +  ^moon^moon 
Xcm  — 

dearth  +  mmoon 

=  4600  km, 

or  about  three  quarters  of  the  way  from  the  earth’s  center  to  its 
surface. 
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The  center  of  mass  as  an  average  example  15 

>  Explain  how  we  know  that  the  center  of  mass  of  each  object  is 
at  the  location  shown  in  figure  p. 


>  The  center  of  mass  is  a  sort  of  average,  so  the  height  of  the 
centers  of  mass  in  1  and  2  has  to  be  midway  between  the  two 
squares,  because  that  height  is  the  average  of  the  heights  of  the 
two  squares.  Example  3  is  a  combination  of  examples  1  and 
2,  so  we  can  find  its  center  of  mass  by  averaging  the  horizontal 
positions  of  their  centers  of  mass.  In  example  4,  each  square 
has  been  skewed  a  little,  but  just  as  much  mass  has  been  moved 
up  as  down,  so  the  average  vertical  position  of  the  mass  hasn’t 
changed.  Example  5  is  clearly  not  all  that  different  from  example 
4,  the  main  difference  being  a  slight  clockwise  rotation,  so  just  as 
in  example  4,  the  center  of  mass  must  be  hanging  in  empty  space, 
where  there  isn’t  actually  any  mass.  Horizontally,  the  center  of 
mass  must  be  between  the  heels  and  toes,  or  else  it  wouldn’t  be 
possible  to  stand  without  tipping  over. 

Momentum  and  Galilean  relativity  example  16 

The  principle  of  Galilean  relativity  states  that  the  laws  of  physics 
are  supposed  to  be  equally  valid  in  all  inertial  frames  of  refer¬ 
ence.  If  we  first  calculate  some  momenta  in  one  frame  of  refer¬ 
ence  and  find  that  momentum  is  conserved,  and  then  rework  the 
whole  problem  in  some  other  frame  of  reference  that  is  moving 
with  respect  to  the  first,  the  numerical  values  of  the  momenta  will 
all  be  different.  Even  so,  momentum  will  still  be  conserved.  All 
that  matters  is  that  we  work  a  single  problem  in  one  consistent 
frame  of  reference. 

One  way  of  proving  this  is  to  apply  the  equation  ptotai  =  mt0taivcm. 
If  the  velocity  of  one  frame  relative  to  the  other  is  u,  then  the  only 
effect  of  changing  frames  of  reference  is  to  change  vcm  from  its 
original  value  to  vcm  +  u.  This  adds  a  constant  onto  the  momen¬ 
tum,  which  has  no  effect  on  conservation  of  momentum. 

seif-check  A 

The  figure  shows  a  gymnast  holding  onto  the  inside  of  a  big  wheel. 
From  inside  the  wheel,  how  could  he  make  it  roll  one  way  or  the  other? 

>  Answer,  p.  962 


q  /  Self-check  A. 
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r/The  same  collision  of  two 
pools  balls,  but  now  seen  in 
the  center  of  mass  frame  of 
reference. 


s  /  The  sun’s  frame  of  refer¬ 
ence. 


t  /  The  c.m.  frame. 


3.1.6  The  center  of  mass  frame  of  reference 

A  particularly  useful  frame  of  reference  in  many  cases  is  the 
frame  that  moves  along  with  the  center  of  mass,  called  the  center 
of  mass  (c.m.)  frame.  In  this  frame,  the  total  momentum  is  zero. 
The  following  examples  show  how  the  center  of  mass  frame  can  be 
a  powerful  tool  for  simplifying  our  understanding  of  collisions. 

A  collision  of  pool  balls  viewed  in  the  c.m.  frame  example  17 
If  you  move  your  head  so  that  your  eye  is  always  above  the  point 
halfway  in  between  the  two  pool  balls,  as  in  figure  r,  you  are  view¬ 
ing  things  in  the  center  of  mass  frame.  In  this  frame,  the  balls 
come  toward  the  center  of  mass  at  equal  speeds.  By  symme¬ 
try,  they  must  therefore  recoil  at  equal  speeds  along  the  lines  on 
which  they  entered.  Since  the  balls  have  essentially  swapped 
paths  in  the  center  of  mass  frame,  the  same  must  also  be  true  in 
any  other  frame.  This  is  the  same  result  that  required  laborious 
algebra  to  prove  previously  without  the  concept  of  the  center  of 
mass  frame. 

The  slingshot  effect  example  18 

It  is  a  counterintuitive  fact  that  a  spacecraft  can  pick  up  speed 
by  swinging  around  a  planet,  if  it  arrives  in  the  opposite  direction 
compared  to  the  planet’s  motion.  Although  there  is  no  physical 
contact,  we  treat  the  encounter  as  a  one-dimensional  collision, 
and  analyze  it  in  the  center  of  mass  frame.  Since  Jupiter  is  so 
much  more  massive  than  the  spacecraft,  the  center  of  mass  is 
essentially  fixed  at  Jupiter’s  center,  and  Jupiter  has  zero  velocity 
in  the  center  of  mass  frame,  as  shown  in  figure  3.1.6.  The  c.m. 
frame  is  moving  to  the  left  compared  to  the  sun-fixed  frame  used 
in  figure  3.1.6,  so  the  spacecraft’s  initial  velocity  is  greater  in  this 
frame  than  in  the  sun’s  frame. 

Things  are  simpler  in  the  center  of  mass  frame,  because  it  is  more 
symmetric.  In  the  sun-fixed  frame,  the  incoming  leg  of  the  en¬ 
counter  is  rapid,  because  the  two  bodies  are  rushing  toward  each 
other,  while  their  separation  on  the  outbound  leg  is  more  grad¬ 
ual,  because  Jupiter  is  trying  to  catch  up.  In  the  c.m.  frame, 
Jupiter  is  sitting  still,  and  there  is  perfect  symmetry  between  the 
incoming  and  outgoing  legs,  so  by  symmetry  we  have  v1f  =  -v1(-. 
Going  back  to  the  sun-fixed  frame,  the  spacecraft’s  final  velocity 
is  increased  by  the  frames’  motion  relative  to  each  other.  In  the 
sun-fixed  frame,  the  spacecraft’s  velocity  has  increased  greatly. 

Einstein’s  motorcycle  example  19 

We’ve  assumed  we  were  dealing  with  a  system  of  material  ob¬ 
jects,  for  which  the  equation  p  =  mv  was  true.  What  if  our  system 
contains  only  light  rays,  or  a  mixture  of  light  and  matter?  As  a  col¬ 
lege  student,  Einstein  kept  worrying  about  was  what  a  beam  of 
light  would  look  like  if  you  could  ride  alongside  it  on  a  motorcycle. 
In  other  words,  he  imagined  putting  himself  in  the  light  beam’s 
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center  of  mass  frame.  Chapter  7  discusses  Einstein’s  resolution 
of  this  problem,  but  the  basic  point  is  that  you  can’t  ride  the  mo¬ 
torcycle  alongside  the  light  beam,  because  material  objects  can’t 
go  as  fast  as  the  speed  of  light.  A  beam  of  light  has  no  center  of 
mass  frame  of  reference. 

Discussion  Questions 

A  Make  up  a  numerical  example  of  two  unequal  masses  moving  in  one 
dimension  at  constant  velocity,  and  verify  the  equation  ptotai  =  mtotaivcm 
over  a  time  interval  of  one  second. 

B  A  more  massive  tennis  racquet  or  baseball  bat  makes  the  ball  fly 
off  faster.  Explain  why  this  is  true,  using  the  center  of  mass  frame.  For 
simplicity,  assume  that  the  racquet  or  bat  is  simply  sitting  still  before  the 
collision,  and  that  the  hitter’s  hands  do  not  make  any  force  large  enough 
to  have  a  significant  effect  over  the  short  duration  of  the  impact. 


3.2  Force  In  One  Dimension 


3.2.1  Momentum  transfer 

For  every  conserved  quantity,  we  can  define  an  associated  rate 
of  flow.  An  open  system  can  have  mass  transferred  in  or  out  of  it, 
and  we  can  measure  the  rate  of  mass  flow,  dm/  dt  in  units  of  kg/s. 
Energy  can  flow  in  or  out,  and  the  rate  of  energy  transfer  is  the 
power,  P  =  d E /  dt,  measured  in  watts.3  The  rate  of  momentum 
transfer  is  called  force, 


[definition  of  force]. 


The  units  of  force  are  kg-m/s2,  which  can  be  abbreviated  as  newtons, 
1  N  =  kg-m/s2.  Newtons  are  unfortunately  not  as  familiar  as  watts. 
A  newton  is  about  how  much  force  you’d  use  to  pet  a  dog.  The 
most  powerful  rocket  engine  ever  built,  the  first  stage  of  the  Saturn 
V  that  sent  astronauts  to  the  moon,  had  a  thrust  of  about  30  million 
newtons.  In  one  dimension,  positive  and  negative  signs  indicate  the 
direction  of  the  force  —  a  positive  force  is  one  that  pushes  or  pulls 
in  the  direction  of  the  positive  x  axis. 


Walking  into  a  lamppost  example  20 

>  Starting  from  rest,  you  begin  walking,  bringing  your  momentum 
up  to  100  kg-m/s.  You  walk  straight  into  a  lamppost.  Why  is  the 
momentum  change  of  -100  kg-m/s  so  much  more  painful  than 
the  change  of  +100  kg-m/s  when  you  started  walking? 

o  The  forces  are  not  really  constant,  but  for  this  type  of  qualitative 
discussion  we  can  pretend  they  are,  and  approximate  dp/df  as 
Ap/At.  It  probably  takes  you  about  1  s  to  speed  up  initially,  so  the 
ground’s  force  on  you  is  F  =  Ap/At  «  100  N.  Your  impact  with 
the  lamppost,  however,  is  over  in  the  blink  of  an  eye,  say  1/10  s  or 

3Recall  that  uppercase  P  is  power,  while  lowercase  p  is  momentum. 


energy 


system 


power  =  rate  of 
transferring  energy 


A 


momentum 


/  force  =  rate  of 

transferring  momentum 


a  /  Power  and  force  are  the 
rates  at  which  energy  and 
momentum  are  transferred. 
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b/The  airbag  increases  At 
so  as  to  reduce  F  =  Ap/At. 


less.  Dividing  by  this  much  smaller  At  gives  a  much  larger  force, 
perhaps  thousands  of  newtons  (with  a  negative  sign). 

This  is  also  the  principle  of  airbags  in  cars.  The  time  required 
for  the  airbag  to  decelerate  your  head  is  fairly  long:  the  time  it 
takes  your  face  to  travel  20  or  30  cm.  Without  an  airbag,  your  face 
would  have  hit  the  dashboard,  and  the  time  interval  would  have 
been  the  much  shorter  time  taken  by  your  skull  to  move  a  couple  of 
centimeters  while  your  face  compressed.  Note  that  either  way,  the 
same  amount  of  momentum  is  transferred:  the  entire  momentum  of 
your  head. 

Force  is  defined  as  a  derivative,  and  the  derivative  of  a  sum  is 
the  sum  of  the  derivatives.  Therefore  force  is  additive:  when  more 
than  one  force  acts  on  an  object,  you  add  the  forces  to  find  out 
what  happens.  An  important  special  case  is  that  forces  can  cancel. 
Consider  your  body  sitting  in  a  chair  as  you  read  this  book.  Let 
the  positive  x  axis  be  upward.  The  chair’s  upward  force  on  you 
is  represented  with  a  positive  number,  which  cancels  out  with  the 
earth’s  downward  gravitational  force,  which  is  negative.  The  total 
rate  of  momentum  transfer  into  your  body  is  zero,  and  your  body 
doesn’t  change  its  momentum. 

Finding  momentum  from  force  example  21 

>  An  object  of  mass  m  starts  at  rest  at  t  =  t0.  A  force  varying  as 
F  =  bt~2,  where  b  is  a  constant,  begins  acting  on  it.  Find  the 
greatest  speed  it  will  ever  have. 

> 


dp  =  Pdf 


p  =  J  Fdt  +  p0 

b 

=  —  y  +  Po, 

where  p0  is  a  constant  of  integration.  The  given  initial  condition 
is  that  p  =  0  at  t  =  t0,  so  we  find  that  p0  =  b/t0.  The  negative 
term  gets  closer  to  zero  with  increasing  time,  so  the  maximum 
momentum  is  achieved  by  letting  t  approach  infinity.  That  is,  the 
object  will  never  stop  speeding  up,  but  it  will  also  never  surpass  a 
certain  speed.  In  the  limit  t  ->•  oo,  we  identify  p0  as  the  momen¬ 
tum  that  the  object  will  approach  asymptotically.  The  maximum 
velocity  is  v  =  p0/m  =  b/mt0. 

Discussion  Question 

A  Many  collisions,  like  the  collision  of  a  bat  with  a  baseball,  appear  to 
be  instantaneous.  Most  people  also  would  not  imagine  the  bat  and  ball  as 
bending  or  being  compressed  during  the  collision.  Consider  the  following 
possibilities: 


146 


Chapter  3  Conservation  of  Momentum 


(1)  The  collision  is  instantaneous. 

(2)  The  collision  takes  a  finite  amount  of  time,  during  which  the  ball  and 
bat  retain  their  shapes  and  remain  in  contact. 

(3)  The  collision  takes  a  finite  amount  of  time,  during  which  the  ball  and 
bat  are  bending  or  being  compressed. 

How  can  two  of  these  be  ruled  out  based  on  energy  or  momentum  con¬ 
siderations? 


3.2.2  Newton’s  laws 

Although  momentum  is  the  third  conserved  quantity  we’ve  en¬ 
countered,  historically  it  was  the  first  to  be  discovered.  Isaac  New¬ 
ton  formulated  a  complete  treatment  of  mechanical  systems  in  terms 
of  force  and  momentum.  Newton’s  theory  was  based  on  three  laws 
of  motion,  which  we  now  think  of  as  consequences  of  conservation 
of  mass,  energy,  and  momentum. 

Newton’s  laws  in  one  dimension: 

Newton’s  Erst  law:  If  there  is  no  force  acting  on  an 
object,  it  stays  in  the  same  state  of  motion. 

Newton ’s  second  law:  The  sum  of  all  the  forces  acting 
on  an  object  determines  the  rate  at  which  its  momen- 
tum  changes,  Ftotai  =  dp/  d t. _ 

Newton’s  third  law:  Forces  occur  in  opposite  pairs.  If 
object  A  interacts  with  object  B,  then  A’s  force  on  B 
and  B’s  force  on  A  are  related  by  Fab  =  —Fba- 

The  second  law  is  the  definition  of  force,  which  we’ve  already  en¬ 
countered.4  The  first  law  is  a  special  case  of  the  second  law  —  if 
dp/  dt  is  zero,  then  p  =  mv  is  a  constant,  and  since  mass  is  con¬ 
served,  constant  p  implies  constant  v.  The  third  law  is  a  restatement 
of  conservation  of  momentum:  for  two  objects  interacting,  we  have 
constant  total  momentum,  so  0  =  -£( pa  +  Pb)  =  Fba  +  Fab- 

a=F/m  example  22 

Many  modern  textbooks  restate  Newton’s  second  law  as  a  = 
F/m,  i.e.,  as  an  equation  that  predicts  an  object’s  acceleration 
based  on  the  force  exerted  on  it.  This  is  easily  derived  from  New¬ 
ton’s  original  form  as  follows:  a  =  dv/d t  =  ( 6p/dt)/m  =  F/m. 

Gravitational  force  related  to  g  example  23 

As  a  special  case  of  the  previous  example,  consider  an  object 
in  free  fall,  and  let  the  x  axis  point  down.  Then  a  =  +g,  and 
F  =  ma  =  mg.  For  example,  the  gravitational  force  on  a  1  kg 
mass  at  the  earth’s  surface  is  about  9.8  N.  Even  if  other  forces 
act  on  the  object,  and  it  isn’t  in  free  fall,  the  gravitational  force  on 
it  is  still  the  same,  and  can  still  be  calculated  as  mg. 


c  /  Isaac  Newton  (1 643-1 727). 


4This  is  with  the  benefit  of  hindsight.  At  the  time,  the  word  “force”  already 
had  certain  connotations,  and  people  thought  they  understood  what  it  meant 
and  how  to  measure  it,  e.g.,  by  using  a  spring  scale.  From  their  point  of  view, 
F  =  dp/  dt  was  not  a  definition  but  a  testable  —  and  controversial!  —  statement. 
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d  /  Two  magnets  exert  forces 
on  each  other. 


e  /  It  doesn’t  make  sense  for 
the  man  to  talk  about  the 
woman’s  money  canceling  out  his 
bar  tab,  because  there  is  no  good 
reason  to  combine  his  debts  and 
her  assets. 


f/ Newton’s  third  law  does 
not  mean  that  forces  always  can¬ 
cel  out  so  that  nothing  can  ever 
move.  If  these  two  ice  skaters, 
initially  at  rest,  push  against  each 
other,  they  will  both  move. 


Changing  frames  of  reference  example  24 

Suppose  we  change  from  one  frame  reference  into  another,  which 
is  moving  relative  to  the  first  one  at  a  constant  velocity  u.  If  an 
object  of  mass  m  is  moving  at  velocity  v  (which  need  not  be  con¬ 
stant),  then  the  effect  is  to  change  its  momentum  from  mv  in  one 
frame  to  mv+mu  in  the  other.  Force  is  defined  as  the  derivative  of 
momentum  with  respect  to  time,  and  the  derivative  of  a  constant 
is  zero,  so  adding  the  constant  mu  has  no  effect  on  the  result. 
We  therefore  conclude  that  observers  in  different  inertial  frames 
of  reference  agree  on  forces. 

Using  the  third  law  correctly 

If  you’ve  already  accepted  Galilean  relativity  in  your  heart,  then 
there  is  nothing  really  difficult  about  the  first  and  second  laws.  The 
third  law,  however,  is  more  of  a  conceptual  challenge.  The  first 
hurdle  is  that  it  is  counterintuitive.  Is  it  really  true  that  if  a  fighter 
jet  collides  with  a  mosquito,  the  mosquito’s  force  on  the  jet  is  just 
as  strong  as  the  jet’s  force  on  the  mosquito?  Yes,  it  is  true,  but  it 
is  hard  to  believe  at  first.  That  amount  of  force  simply  has  more  of 
an  effect  on  the  mosquito,  because  it  has  less  mass. 

A  more  humane  and  practical  experiment  is  shown  in  figure  d.  A 
large  magnet  and  a  small  magnet  are  weighed  separately,  and  then 
one  magnet  is  hung  from  the  pan  of  the  top  balance  so  that  it  is 
directly  above  the  other  magnet.  There  is  an  attraction  between  the 
two  magnets,  causing  the  reading  on  the  top  scale  to  increase  and  the 
reading  on  the  bottom  scale  to  decrease.  The  large  magnet  is  more 
“powerful”  in  the  sense  that  it  can  pick  up  a  heavier  paperclip  from 
the  same  distance,  so  many  people  have  a  strong  expectation  that 
one  scale’s  reading  will  change  by  a  far  different  amount  than  the 
other.  Instead,  we  find  that  the  two  changes  are  equal  in  magnitude 
but  opposite  in  direction,  so  the  upward  force  of  the  top  magnet  on 
the  bottom  magnet  is  of  the  same  magnitude  as  the  downward  force 
of  the  bottom  magnet  on  the  top  magnet. 

To  students,  it  often  sounds  as  though  Newton’s  third  law  im¬ 
plies  nothing  could  ever  change  its  motion,  since  the  two  equal  and 
opposite  forces  would  always  cancel.  As  illustrated  in  figure  e,  the 
fallacy  arises  from  assuming  that  we  can  add  things  that  it  doesn’t 
make  sense  to  add.  It  only  makes  sense  to  add  up  forces  that  are 
acting  on  the  same  object,  whereas  two  forces  related  to  each  other 
by  Newton’s  third  law  are  always  acting  on  two  different  objects. 
If  two  objects  are  interacting  via  a  force  and  no  other  forces  are 
involved,  then  both  objects  will  accelerate  —  in  opposite  directions, 
as  shown  in  figure  f! 

Here  are  some  suggestions  for  avoiding  misapplication  of  New¬ 
ton’s  third  law: 
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1.  It  always  relates  exactly  two  forces,  not  more. 

2.  The  two  forces  involve  exactly  two  objects,  in  the  pattern  A 
on  B,  B  on  A. 

3.  The  two  forces  are  always  of  the  same  type,  e.g.,  friction  and 
friction,  or  gravity  and  gravity. 


Directions  of  forces 

We’ve  already  seen  that  momentum,  unlike  energy,  has  a  direc¬ 
tion  in  space.  Since  force  is  defined  in  terms  of  momentum,  force 
also  has  a  direction  in  space.  For  motion  in  one  dimension,  we 
have  to  pick  a  coordinate  system,  and  given  that  choice,  forces  and 
momenta  will  be  positive  or  negative.  We’ve  already  used  signs  to 
represent  directions  of  forces  in  Newton’s  third  law,  Fab  =  ~Fba- 

There  is,  however,  a  complication  with  force  that  we  were  able 
to  avoid  with  momentum.  If  an  object  is  moving  on  a  line,  we’re 
guaranteed  that  its  momentum  is  in  one  of  two  directions:  the  two 
directions  along  the  line.  But  even  an  object  that  stays  on  a  line 
may  still  be  subject  to  forces  that  act  perpendicularly  to  the  line. 
For  example,  suppose  a  coin  is  sliding  to  the  right  across  a  table,  h, 
and  let’s  choose  a  positive  x  axis  that  points  to  the  right.  The  coin’s 
motion  is  along  a  horizontal  line,  and  its  momentum  is  positive  and 
decreasing.  Because  the  momentum  is  decreasing,  its  time  derivative 
dp/  dt  is  negative.  This  derivative  equals  the  horizontal  force  of 
friction  F\ ,  and  its  negative  sign  tells  us  that  this  force  on  the  coin 
is  to  the  left. 

But  there  are  also  vertical  forces  on  the  coin.  The  Earth  exerts 
a  downward  gravitational  force  iy?  on  it,  and  the  table  makes  an 
upward  force  F$  that  prevents  the  coin  from  sinking  into  the  wood. 
In  fact,  without  these  vertical  forces  the  horizontal  frictional  force 
wouldn’t  exist:  surfaces  don’t  exert  friction  against  one  another 
unless  they  are  being  pressed  together. 

To  avoid  mathematical  complication,  we  want  to  postpone  the 
full  three-dimensional  treatment  of  force  and  momentum  until  sec¬ 
tion  3.4.  For  now,  we’ll  limit  ourselves  to  examples  like  the  coin,  in 
which  the  motion  is  confined  to  a  line,  and  any  forces  perpendicular 
to  the  line  cancel  each  other  out. 

Discussion  Questions 

A  Criticize  the  following  incorrect  statement: 

“If  an  object  is  at  rest  and  the  total  force  on  it  is  zero,  it  stays  at  rest. 
There  can  also  be  cases  where  an  object  is  moving  and  keeps  on  moving 
without  having  any  total  force  on  it,  but  that  can  only  happen  when  there’s 
no  friction,  like  in  outer  space.” 


g/A  swimmer  doing  the  breast 
stroke  pushes  backward  against 
the  water.  By  Newton’s  third  law, 
the  water  pushes  forward  on  him. 


h/A  coin  slides  across  a  ta¬ 
ble.  Even  for  motion  in  one 
dimension,  some  of  the  forces 
may  not  lie  along  the  line  of  the 
motion. 
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x  (m) 

t(  s) 

10 

1.84 

20 

2.86 

30 

3.80 

40 

4.67 

50 

5.53 

60 

6.38 

70 

7.23 

80 

8.10 

90 

8.96 

100 

9.83 

Discussion  question  B. 


B  The  table  gives  laser  timing  data  for  Ben  Johnson’s  1 00  m  dash  at  the 
1987  World  Championship  in  Rome.  (His  world  record  was  later  revoked 
because  he  tested  positive  for  steroids.)  How  does  the  total  force  on  him 
change  over  the  duration  of  the  race? 

C  You  hit  a  tennis  ball  against  a  wall.  Explain  any  and  all  incorrect 
ideas  in  the  following  description  of  the  physics  involved:  “According  to 
Newton’s  third  law,  there  has  to  be  a  force  opposite  to  your  force  on  the 
ball.  The  opposite  force  is  the  ball’s  mass,  which  resists  acceleration,  and 
also  air  resistance.” 

D  Tam  Anh  grabs  Sarah  by  the  hand  and  tries  to  pull  her.  She  tries 
to  remain  standing  without  moving.  A  student  analyzes  the  situation  as 
follows.  “If  Tam  Anh’s  force  on  Sarah  is  greater  than  her  force  on  him, 
he  can  get  her  to  move.  Otherwise,  she’ll  be  able  to  stay  where  she  is.” 
What’s  wrong  with  this  analysis? 

3.2.3  What  force  is  not 

Violin  teachers  have  to  endure  their  beginning  students’  screech¬ 
ing.  A  frown  appears  on  the  woodwind  teacher’s  face  as  she  watches 
her  student  take  a  breath  with  an  expansion  of  his  ribcage  but  none 
in  his  belly.  What  makes  physics  teachers  cringe  is  their  students’ 
verbal  statements  about  forces.  Below  I  have  listed  several  dicta 
about  what  force  is  not. 

Force  is  not  a  property  of  one  object. 

A  great  many  of  students’  incorrect  descriptions  of  forces  could 
be  cured  by  keeping  in  mind  that  a  force  is  an  interaction  of  two 
objects,  not  a  property  of  one  object. 

Incorrect  statement:  “That  magnet  has  a  lot  of  force." 

X  If  the  magnet  is  one  millimeter  away  from  a  steel  ball  bearing,  they 
may  exert  a  very  strong  attraction  on  each  other,  but  if  they  were  a 
meter  apart,  the  force  would  be  virtually  undetectable.  The  magnet’s 
strength  can  be  rated  using  certain  electrical  units  (ampere  -  meters2), 
but  not  in  units  of  force. 

Force  is  not  a  measure  of  an  object’s  motion. 

If  force  is  not  a  property  of  a  single  object,  then  it  cannot  be 
used  as  a  measure  of  the  object’s  motion. 

Incorrect  statement:  “The  freight  train  rumbled  down  the  tracks  with 
awesome  force.” 

X  Force  is  not  a  measure  of  motion.  If  the  freight  train  collides  with  a 
stalled  cement  truck,  then  some  awesome  forces  will  occur,  but  if  it  hits 
a  fly  the  force  will  be  small. 

Force  is  not  energy. 

Incorrect  statement:  “How  can  my  chair  be  making  an  upward  force  on 
my  rear  end?  It  has  no  power!” 

X  Power  is  a  concept  related  to  energy,  e.g.,  a  100-watt  lightbulb  uses 
up  1 00  joules  per  second  of  energy.  When  you  sit  in  a  chair,  no  energy 
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is  used  up,  so  forces  can  exist  between  you  and  the  chair  without  any 
need  for  a  source  of  power. 

Force  is  not  stored  or  used  up. 

Because  energy  can  be  stored  and  used  up,  people  think  force 
also  can  be  stored  or  used  up. 

Incorrect  statement:  “If  you  don’t  fill  up  your  tank  with  gas,  you’ll  run 
out  of  force.” 

X  Energy  is  what  you’ll  run  out  of,  not  force. 

Forces  need  not  be  exerted  by  living  things  or  machines. 

Transforming  energy  from  one  form  into  another  usually  requires 
some  kind  of  living  or  mechanical  mechanism.  The  concept  is  not 
applicable  to  forces,  which  are  an  interaction  between  objects,  not 
a  thing  to  be  transferred  or  transformed. 

Incorrect  statement:  “How  can  a  wooden  bench  be  making  an  upward 
force  on  my  rear  end?  It  doesn’t  have  any  springs  or  anything  inside  it.” 

X  No  springs  or  other  internal  mechanisms  are  required.  If  the  bench 
didn’t  make  any  force  on  you,  you  would  obey  Newton’s  second  law  and 
fall  through  it.  Evidently  it  does  make  a  force  on  you! 

A  force  is  the  direct  cause  of  a  change  in  motion. 

I  can  click  a  remote  control  to  make  my  garage  door  change  from 
being  at  rest  to  being  in  motion.  My  finger’s  force  on  the  button, 
however,  was  not  the  force  that  acted  on  the  door.  When  we  speak 
of  a  force  on  an  object  in  physics,  we  are  talking  about  a  force  that 
acts  directly.  Similarly,  when  you  pull  a  reluctant  dog  along  by  its 
leash,  the  leash  and  the  dog  are  making  forces  on  each  other,  not 
your  hand  and  the  dog.  The  dog  is  not  even  touching  your  hand. 

self-check  B 

Which  of  the  following  things  can  be  correctly  described  in  terms  of 
force? 

(1 )  A  nuclear  submarine  is  charging  ahead  at  full  steam. 

(2)  A  nuclear  submarine’s  propellers  spin  in  the  water. 

(3)  A  nuclear  submarine  needs  to  refuel  its  reactor  periodically.  c> 
Answer,  p.  962 

Discussion  Questions 

A  Criticize  the  following  incorrect  statement:  “If  you  shove  a  book 
across  a  table,  friction  takes  away  more  and  more  of  its  force,  until  finally 
it  stops.” 

B  You  hit  a  tennis  ball  against  a  wall.  Explain  any  and  all  incorrect 
ideas  in  the  following  description  of  the  physics  involved:  “The  ball  gets 
some  force  from  you  when  you  hit  it,  and  when  it  hits  the  wall,  it  loses  part 
of  that  force,  so  it  doesn’t  bounce  back  as  fast.  The  muscles  in  your  arm 
are  the  only  things  that  a  force  can  come  from.” 
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3.2.4  Forces  between  solids 


Conservation  laws  are  more  fundamental  than  Newton’s  laws, 
and  they  apply  where  Newton’s  laws  don’t,  e.g.,  to  light  and  to  the 
internal  structure  of  atoms.  However,  there  are  certain  problems 
that  are  much  easier  to  solve  using  Newton’s  laws.  As  a  trivial 
example,  if  you  drop  a  rock,  it  could  conserve  momentum  and  energy 
by  levitating,  or  by  falling  in  the  usual  manner.5  With  Newton’s 
laws,  however,  we  can  reason  that  a  =  F/m ,  so  the  rock  must 
respond  to  the  gravitational  force  by  accelerating. 

Less  trivially,  suppose  a  person  is  hanging  onto  a  rope,  and  we 
want  to  know  if  she  will  slip.  Unlike  the  case  of  the  levitating  rock, 
here  the  no-motion  solution  could  be  perfectly  reasonable  if  her  grip 
is  strong  enough.  We  know  that  her  hand’s  interaction  with  the  rope 
is  fundamentally  an  electrical  interaction  between  the  atoms  in  the 
surface  of  her  palm  and  the  nearby  atoms  in  the  surface  of  the  rope. 
For  practical  problem-solving,  however,  this  is  a  case  where  we’re 
better  off  forgetting  the  fundamental  classification  of  interactions 
at  the  atomic  level  and  working  with  a  more  practical,  everyday 
classification  of  forces.  In  this  practical  scheme,  we  have  three  types 
of  forces  that  can  occur  between  solid  objects  in  contact: 

A  normal  force,  Fn,  is  perpendicular  to  the  surface  of  con¬ 
tact,  and  prevents  objects  from  pass¬ 
ing  through  each  other  by  becoming  as 
strong  as  necessary  (up  to  the  point 
where  the  objects  break).  “Normal” 
means  perpendicular. 

Static  friction,  Fs,  is  parallel  to  the  surface  of  contact,  and 
prevents  the  surfaces  from  starting  to 
slip  by  becoming  as  strong  as  neces¬ 
sary,  up  to  a  maximum  value  of  Fs^max. 
“Static”  means  not  moving,  i.e.,  not 
slipping. 

Kinetic  friction,  F}.,  is  parallel  to  the  surface  of  contact,  and 
tends  to  slow  down  any  slippage  once 
it  starts.  “Kinetic”  means  moving,  i.e., 
slipping. 

self-check  C 

Can  a  frictionless  surface  exert  a  normal  force?  Can  a  frictional  force 

exist  without  a  normal  force?  >  Answer,  p.  962 

If  you  put  a  coin  on  this  page,  which  is  horizontal,  gravity  pulls 
down  on  the  coin,  but  the  atoms  in  the  paper  and  the  coin  repel  each 
other  electrically,  and  the  atoms  are  compressed  until  the  repulsion 
becomes  strong  enough  to  stop  the  downward  motion  of  the  coin. 
We  describe  this  complicated  and  invisible  atomic  process  by  saying 

5  This  pathological  solution  was  first  noted  on  page  83,  and  discussed  in  more 
detail  on  page  950. 
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that  the  paper  makes  an  upward  normal  force  on  the  coin,  and  the 
coin  makes  a  downward  normal  force  on  the  paper.  (The  two  normal 
forces  are  related  by  Newton’s  third  law.  In  fact,  Newton’s  third  law 
only  relates  forces  that  are  of  the  same  type.) 

If  you  now  tilt  the  book  a  little,  static  friction  keeps  the  coin 
from  slipping.  The  picture  at  the  microscopic  level  is  even  more 
complicated  than  the  previous  description  of  the  normal  force.  One 
model  is  to  think  of  the  tiny  bumps  and  depressions  in  the  coin 
as  settling  into  the  similar  irregularities  in  the  paper.  This  model 
predicts  that  rougher  surfaces  should  have  more  friction,  which  is 
sometimes  true  but  not  always.  Two  very  smooth,  clean  glass  sur¬ 
faces  or  very  well  finished  machined  metal  surfaces  may  actually 
stick  better  than  rougher  surfaces  would,  the  probable  explanation 
being  that  there  is  some  kind  of  chemical  bonding  going  on,  and  the 
smoother  surfaces  allow  more  atoms  to  be  in  contact. 

Finally,  as  you  tilt  the  book  more  and  more,  there  comes  a  point 
where  static  friction  reaches  its  maximum  value.  The  surfaces  be¬ 
come  unstuck,  and  the  coin  begins  to  slide  over  the  paper.  Kinetic 
friction  slows  down  this  slipping  motion  significantly.  In  terms  of  en¬ 
ergy,  kinetic  friction  is  converting  mechanical  energy  into  heat,  just 
like  when  you  rub  your  hands  together  to  keep  warm.  One  model 
of  kinetic  friction  is  that  the  tiny  irregularities  in  the  two  surfaces 
bump  against  each  other,  causing  vibrations  whose  energy  rapidly 
converts  to  heat  and  sound  —  you  can  hear  this  sound  if  you  rub 
your  fingers  together  near  your  ear. 

For  dry  surfaces,  experiments  show  that  the  following  equations 
usually  work  fairly  well: 


i  /  Static  friction:  the  tray  doesn’t 
slip  on  the  waiter’s  fingers. 


j  /  Kinetic  friction:  the  car  skids. 


F. 


s,max 


/ts  Fn 


and 


Fk  ~  /^fc-Fn, 


where  fis ,  the  coefficient  of  static  friction,  and  jik-  the  coefficient  of 
kinetic  friction,  are  constants  that  depend  on  the  properties  of  the 
two  surfaces,  such  as  what  they’re  made  of  and  how  rough  they  are. 

self-check  D 

1.  When  a  baseball  player  slides  in  to  a  base,  is  the  friction  static,  or 
kinetic? 

2.  A  mattress  stays  on  the  roof  of  a  slowly  accelerating  car.  Is  the 
friction  static,  or  kinetic? 

3.  Does  static  friction  create  heat?  Kinetic  friction?  >  Answer,  p.  962 
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Maximum  acceleration  of  a  car  example  25 

>  Rubber  on  asphalt  gives  \ik  &  0.4  and  ps  «  0.6.  What  is  the 
upper  limit  on  a  car’s  acceleration  on  a  flat  road,  assuming  that 
the  engine  has  plenty  of  power  and  that  air  friction  is  negligible? 

>  This  isn’t  a  flying  car,  so  we  don’t  expect  it  to  accelerate  verti¬ 
cally.  The  vertical  forces  acting  on  the  car  should  cancel  out.  The 
earth  makes  a  downward  gravitational  force  on  the  car  whose 
absolute  value  is  mg,  so  the  road  apparently  makes  an  upward 
normal  force  of  the  same  magnitude,  Fn  =  mg. 

Now  what  about  the  horizontal  motion?  As  is  always  true,  the  co¬ 
efficient  of  static  friction  is  greater  than  the  coefficient  of  kinetic 
friction,  so  the  maximum  acceleration  is  obtained  with  static  fric¬ 
tion,  i.e.,  the  driver  should  try  not  to  burn  rubber.  The  maximum 
force  of  static  friction  is  Fs,max  =  PsFn  =  Ps^S'-  The  maximum 
acceleration  is  a  =  Fs/m  =  \±sg  ~  6  m/s2.  This  is  true  regard¬ 
less  of  how  big  the  tires  are,  since  the  experimentally  determined 
relationship  Fs  max  =  ]xsFn  is  independent  of  surface  area. 

self-check  E 

Find  the  direction  of  each  of  the  forces  in  figure  k.  >  Answer,  p.  962 


k  / 1 .  The  cliff’s  normal  force  on 
the  climber’s  feet.  2.  The  track’s 
static  frictional  force  on  the  wheel 
of  the  accelerating  dragster.  3. 
The  ball’s  normal  force  on  the 
bat. 


Locomotives  example  26 

Looking  at  a  picture  of  a  locomotive,  I,  we  notice  two  obvious 
things  that  are  different  from  an  automobile.  Where  a  car  typi¬ 
cally  has  two  drive  wheels,  a  locomotive  normally  has  many  — 
ten  in  this  example.  (Some  also  have  smaller,  unpowered  wheels 
in  front  of  and  behind  the  drive  wheels,  but  this  example  doesn’t.) 
Also,  cars  these  days  are  generally  built  to  be  as  light  as  possi¬ 
ble  for  their  size,  whereas  locomotives  are  very  massive,  and  no 
effort  seems  to  be  made  to  keep  their  weight  low.  (The  steam 
locomotive  in  the  photo  is  from  about  1900,  but  this  is  true  even 
for  modern  diesel  and  electric  trains.) 

The  reason  locomotives  are  built  to  be  so  heavy  is  for  traction. 
The  upward  normal  force  of  the  rails  on  the  wheels,  FN,  cancels 
the  downward  force  of  gravity,  Fw,  so  ignoring  plus  and  minus 
signs,  these  two  forces  are  equal  in  absolute  value,  FN  =  Fw. 
Given  this  amount  of  normal  force,  the  maximum  force  of  static 
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I  /  Example  26. 


friction  is  Fs  =  \±SFN  =  \xsFw.  This  static  frictional  force,  of  the 
rails  pushing  forward  on  the  wheels,  is  the  only  force  that  can 
accelerate  the  train,  pull  it  uphill,  or  cancel  out  the  force  of  air 
resistance  while  cruising  at  constant  speed.  The  coefficient  of 
static  friction  for  steel  on  steel  is  about  1/4,  so  no  locomotive  can 
pull  with  a  force  greater  than  about  1/4  of  its  own  weight.  If  the 
engine  is  capable  of  supplying  more  than  that  amount  of  force,  the 
result  will  be  simply  to  break  static  friction  and  spin  the  wheels. 

The  reason  this  is  all  so  different  from  the  situation  with  a  car  is 
that  a  car  isn’t  pulling  something  else.  If  you  put  extra  weight  in 
a  car,  you  improve  the  traction,  but  you  also  increase  the  inertia 
of  the  car,  and  make  it  just  as  hard  to  accelerate.  In  a  train,  the 
inertia  is  almost  all  in  the  cars  being  pulled,  not  in  the  locomotive. 

The  other  fact  we  have  to  explain  is  the  large  number  of  driv¬ 
ing  wheels.  First,  we  have  to  realize  that  increasing  the  num¬ 
ber  of  driving  wheels  neither  increases  nor  decreases  the  total 
amount  of  static  friction,  because  static  friction  is  independent  of 
the  amount  of  surface  area  in  contact.  (The  reason  four-wheel- 
drive  is  good  in  a  car  is  that  if  one  or  more  of  the  wheels  is  slip¬ 
ping  on  ice  or  in  mud,  the  other  wheels  may  still  have  traction. 
This  isn’t  typically  an  issue  for  a  train,  since  all  the  wheels  experi¬ 
ence  the  same  conditions.)  The  advantage  of  having  more  driving 
wheels  on  a  train  is  that  it  allows  us  to  increase  the  weight  of  the 
locomotive  without  crushing  the  rails,  or  damaging  bridges. 

3.2.5  Fluid  friction 

Try  to  drive  a  nail  into  a  waterfall  and  you  will  be  confronted 
with  the  main  difference  between  solid  friction  and  fluid  friction. 
Fluid  friction  is  purely  kinetic;  there  is  no  static  fluid  friction.  The 
nail  in  the  waterfall  may  tend  to  get  dragged  along  by  the  water 
flowing  past  it,  but  it  does  not  stick  in  the  water.  The  same  is  true 
for  gases  such  as  air:  recall  that  we  are  using  the  word  “fluid”  to 
include  both  gases  and  liquids. 

Unlike  kinetic  friction  between  solids,  fluid  friction  increases 
rapidly  with  velocity.  It  also  depends  on  the  shape  of  the  object, 
which  is  why  a  fighter  jet  is  more  streamlined  than  a  Model  T.  For 
objects  of  the  same  shape  but  different  sizes,  fluid  friction  typically 
scales  up  with  the  cross-sectional  area  of  the  object,  which  is  one 
of  the  main  reasons  that  an  SUV  gets  worse  mileage  on  the  freeway 
than  a  compact  car. 


m  /  The  wheelbases  of  the 
Hummer  H3  and  the  Toyota  Prius 
are  surprisingly  similar,  differing 
by  only  10%.  The  main  difference 
in  shape  is  that  the  Hummer  is 
much  taller  and  wider.  It  presents 
a  much  greater  cross-sectional 
area  to  the  wind,  and  this  is  the 
main  reason  that  it  uses  about  2.5 
times  more  gas  on  the  freeway. 
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A  Criticize  the  following  analysis:  “A  book  is  sitting  on  a  table.  I  shove 
it,  overcoming  static  friction.  Then  it  slows  down  until  it  has  less  force  than 
static  friction,  and  it  stops.” 

3.2.6  Analysis  of  forces 

Newton’s  first  and  second  laws  deal  with  the  total  of  all  the 
forces  exerted  on  a  specific  object,  so  it  is  very  important  to  be  able 
to  figure  out  what  forces  there  are.  Once  you  have  focused  your 
attention  on  one  object  and  listed  the  forces  on  it,  it  is  also  helpful 
to  describe  all  the  corresponding  forces  that  must  exist  according 
to  Newton’s  third  law.  We  refer  to  this  as  “analyzing  the  forces”  in 
which  the  object  participates. 

A  barge  example  27 

A  barge  is  being  pulled  along  a  canal  by  teams  of  horses  on  the  shores.  Analyze  all  the  forces  in  which  the 
barge  participates. 


force  acting  on  barge 

force  related  to  it  by  Newton’s  third  law 

ropes’  forward  normal  forces  on  barge 

barge’s  backward  normal  force  on  ropes 

water’s  backward  fluid  friction  force  on  barge 

barge’s  forward  fluid  friction  force  on  water 

planet  earth’s  downward  gravitational  force 
on  barge 

barge’s  upward  gravitational  force  on  earth 

water’s  upward  “floating”  force  on  barge 

barge’s  downward  “floating”  force  on  water 

Here  I’ve  used  the  word  “floating”  force  as  an  example  of  a  sensible  invented  term  for  a  type  of  force  not 
classified  on  the  tree  in  the  previous  section.  A  more  formal  technical  term  would  be  “hydrostatic  force.” 

Note  how  the  pairs  of  forces  are  all  structured  as  “A’s  force  on  B,  B’s  force  on  A”:  ropes  on  barge  and  barge 
on  ropes;  water  on  barge  and  barge  on  water.  Because  all  the  forces  in  the  left  column  are  forces  acting  on 
the  barge,  all  the  forces  in  the  right  column  are  forces  being  exerted  by  the  barge,  which  is  why  each  entry  in 
the  column  begins  with  “barge.” 

Often  you  may  be  unsure  whether  you  have  missed  one  of  the 
forces.  Here  are  three  strategies  for  checking  your  list: 

1.  See  what  physical  result  would  come  from  the  forces  you’ve 
found  so  far.  Suppose,  for  instance,  that  you’d  forgotten  the 
“floating”  force  on  the  barge  in  the  example  above.  Looking 
at  the  forces  you’d  found,  you  would  have  found  that  there 
was  a  downward  gravitational  force  on  the  barge  which  was 
not  canceled  by  any  upward  force.  The  barge  isn’t  supposed 
to  sink,  so  you  know  you  need  to  find  a  fourth,  upward  force. 

2.  Whenever  one  solid  object  touches  another,  there  will  be 
a  normal  force,  and  possibly  also  a  frictional  force;  check  for 
both. 

3.  Make  a  drawing  of  the  object,  and  draw  a  dashed  boundary 
line  around  it  that  separates  it  from  its  environment.  Look  for 
points  on  the  boundary  where  other  objects  come  in  contact 
with  your  object.  This  strategy  guarantees  that  you’ll  find 
every  contact  force  that  acts  on  the  object,  although  it  won’t 
help  you  to  find  non-contact  forces. 
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The  following  is  another  example  in  which  we  can  profit  by  check¬ 
ing  against  our  physical  intuition  for  what  should  be  happening. 

Rappelling  example  28 

As  shown  in  the  figure  below,  Cindy  is  rappelling  down  a  cliff.  Her  downward  motion  is  at  constant  speed,  and 
she  takes  little  hops  off  of  the  cliff,  as  shown  by  the  dashed  line.  Analyze  the  forces  in  which  she  participates 
at  a  moment  when  her  feet  are  on  the  cliff  and  she  is  pushing  off. 


force  acting  on  Cindy 

force  related  to  it  by  Newton’s  third  law 

planet  earth’s  downward  gravitational  force 
on  Cindy 

Cindy’s  upward  gravitational  force  on  earth 

ropes  upward  frictional  force  on  Cindy  (her 
hand) 

Cindy’s  downward  frictional  force  on  the  rope 

cliff’s  rightward  normal  force  on  Cindy 

Cindy’s  leftward  normal  force  on  the  cliff 

The  two  vertical  forces  cancel,  which  is  what  they  should  be  doing  if  she  is  to  go  down  at  a  constant  rate.  The 
only  horizontal  force  on  her  is  the  cliff’s  force,  which  is  not  canceled  by  any  other  force,  and  which  therefore 
will  produce  an  acceleration  of  Cindy  to  the  right.  This  makes  sense,  since  she  is  hopping  off.  (This  solution 
is  a  little  oversimplified,  because  the  rope  is  slanting,  so  it  also  applies  a  small  leftward  force  to  Cindy.  As  she 
flies  out  to  the  right,  the  slant  of  the  rope  will  increase,  pulling  her  back  in  more  strongly.) 


I  believe  that  constructing  the  type  of  table  described  in  this 
section  is  the  best  method  for  beginning  students.  Most  textbooks, 
however,  prescribe  a  pictorial  way  of  showing  all  the  forces  acting  on 
an  object.  Such  a  picture  is  called  a  free-body  diagram.  It  should 
not  be  a  big  problem  if  a  future  physics  professor  expects  you  to 
be  able  to  draw  such  diagrams,  because  the  conceptual  reasoning 
is  the  same.  You  simply  draw  a  picture  of  the  object,  with  arrows 
representing  the  forces  that  are  acting  on  it.  Arrows  representing 
contact  forces  are  drawn  from  the  point  of  contact,  noncontact  forces 
from  the  center  of  mass.  Free-body  diagrams  do  not  show  the  equal 
and  opposite  forces  exerted  by  the  object  itself. 

Discussion  Questions 

A  When  you  fire  a  gun,  the  exploding  gases  push  outward  in  all  direc¬ 
tions,  causing  the  bullet  to  accelerate  down  the  barrel.  What  Newton’s- 
third-law  pairs  are  involved?  [Hint:  Remember  that  the  gases  themselves 
are  an  object.] 

B  In  the  example  of  the  barge  going  down  the  canal,  I  referred  to 
a  “floating”  or  “hydrostatic”  force  that  keeps  the  boat  from  sinking.  If  you 
were  adding  a  new  branch  on  the  force-classification  tree  to  represent  this 
force,  where  would  it  go? 
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C  A  pool  ball  is  rebounding  from  the  side  of  the  pool  table.  Analyze 
the  forces  in  which  the  ball  participates  during  the  short  time  when  it  is  in 
contact  with  the  side  of  the  table. 

D  The  earth’s  gravitational  force  on  you,  i.e.,  your  weight,  is  always 
equal  to  mg,  where  m  is  your  mass.  So  why  can  you  get  a  shovel  to  go 
deeper  into  the  ground  by  jumping  onto  it?  Just  because  you’re  jumping, 
that  doesn’t  mean  your  mass  or  weight  is  any  greater,  does  it? 

3.2.7  Transmission  of  forces  by  low-mass  objects 

You’re  walking  your  dog.  The  dog  wants  to  go  faster  than  you 
do,  and  the  leash  is  taut.  Does  Newton’s  third  law  guarantee  that 
your  force  on  your  end  of  the  leash  is  equal  and  opposite  to  the  dog’s 
force  on  its  end?  If  they’re  not  exactly  equal,  is  there  any  reason 
why  they  should  be  approximately  equal? 

If  there  was  no  leash  between  you,  and  you  were  in  direct  contact 
with  the  dog,  then  Newton’s  third  law  would  apply,  but  Newton’s 
third  law  cannot  relate  your  force  on  the  leash  to  the  dog’s  force 
on  the  leash,  because  that  would  involve  three  separate  objects. 
Newton’s  third  law  only  says  that  your  force  on  the  leash  is  equal 
and  opposite  to  the  leash’s  force  on  you, 

Fvl  =  ~Flv, 

and  that  the  dog’s  force  on  the  leash  is  equal  and  opposite  to  its 
force  on  the  dog 

FdL  =  -FLd. 

Still,  we  have  a  strong  intuitive  expectation  that  whatever  force  we 
make  on  our  end  of  the  leash  is  transmitted  to  the  dog,  and  vice- 
versa.  We  can  analyze  the  situation  by  concentrating  on  the  forces 
that  act  on  the  leash,  FdL  and  Fyi.  According  to  Newton’s  second 
law,  these  relate  to  the  leash’s  mass  and  acceleration: 


FdL  +  Fvl  =  mLaL. 

The  leash  is  far  less  massive  then  any  of  the  other  objects  involved, 
and  if  mi,  is  very  small,  then  apparently  the  total  force  on  the  leash 
is  also  very  small,  FdL  T  Fyi,  ~  0,  and  therefore 

FdL  ~  ~FyL- 

Thus  even  though  Newton’s  third  law  does  not  apply  directly  to 
these  two  forces,  we  can  approximate  the  low-mass  leash  as  if  it  was 
not  intervening  between  you  and  the  dog.  It’s  at  least  approximately 
as  if  you  and  the  dog  were  acting  directly  on  each  other,  in  which 
case  Newton’s  third  law  would  have  applied. 

In  general,  low-mass  objects  can  be  treated  approximately  as  if 
they  simply  transmitted  forces  from  one  object  to  another.  This  can 
be  true  for  strings,  ropes,  and  cords,  and  also  for  rigid  objects  such 
as  rods  and  sticks. 
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n  /  If  we  imagine  dividing  a  taut  rope  up  into  small  segments,  then 
any  segment  has  forces  pulling  outward  on  it  at  each  end.  If  the  rope 
is  of  negligible  mass,  then  all  the  forces  equal  +T  or  -7,  where  7,  the 
tension,  is  a  single  number. 


If  you  look  at  a  piece  of  string  under  a  magnifying  glass  as  you 
pull  on  the  ends  more  and  more  strongly,  you  will  see  the  fibers 
straightening  and  becoming  taut.  Different  parts  of  the  string  are 
apparently  exerting  forces  on  each  other.  For  instance,  if  we  think  of 
the  two  halves  of  the  string  as  two  objects,  then  each  half  is  exerting 
a  force  on  the  other  half.  If  we  imagine  the  string  as  consisting  of 
many  small  parts,  then  each  segment  is  transmitting  a  force  to  the 
next  segment,  and  if  the  string  has  very  little  mass,  then  all  the 
forces  are  equal  in  magnitude.  We  refer  to  the  magnitude  of  the 
forces  as  the  tension  in  the  string,  T. 

The  term  “tension”  refers  only  to  internal  forces  within  the 
string.  If  the  string  makes  forces  on  objects  at  its  ends,  then  those 
forces  are  typically  normal  or  frictional  forces  (example  29). 

Types  of  force  made  by  ropes  example  29 

>  Analyze  the  forces  in  figures  o/l  and  o/2. 

>  In  all  cases,  a  rope  can  only  make  “pulling”  forces,  i.e.,  forces 
that  are  parallel  to  its  own  length  and  that  are  toward  itself,  not 
away  from  itself.  You  can’t  push  with  a  rope! 

In  o/l ,  the  rope  passes  through  a  type  of  hook,  called  a  carabiner, 
used  in  rock  climbing  and  mountaineering.  Since  the  rope  can 
only  pull  along  its  own  length,  the  direction  of  its  force  on  the 
carabiner  must  be  down  and  to  the  right.  This  is  perpendicular  to 
the  surface  of  contact,  so  the  force  is  a  normal  force. 


force  acting  on  carabiner 

force  related  to  it  by  Newton’s 
third  law 

rope’s  normal  force  on  cara¬ 
biner  Y 

carabiner’s  normal  force  on 
rope  Y 

(There  are  presumably  other  forces  acting  on  the  carabiner  from 
other  hardware  above  it.) 


In  figure  o/2,  the  rope  can  only  exert  a  net  force  at  its  end  that 
is  parallel  to  itself  and  in  the  pulling  direction,  so  its  force  on  the 
hand  is  down  and  to  the  left.  This  is  parallel  to  the  surface  of 
contact,  so  it  must  be  a  frictional  force.  If  the  rope  isn’t  slipping 
through  the  hand,  we  have  static  friction.  Friction  can’t  exist  with¬ 
out  normal  forces.  These  forces  are  perpendicular  to  the  surface 
of  contact.  For  simplicity,  we  show  only  two  pairs  of  these  normal 


o  /  Example  29.  The  forces 
between  the  rope  and  other 
objects  are  normal  and  frictional 
forces. 
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forces,  as  if  the  hand  were  a  pair  of  pliers. 


force  acting  on  person 

force  related  to  it  by  Newton’s 
third  law 

rope’s  static  frictional  force  on 
person  k" 

person’s  static  frictional  force 
on  rope  Ai 

rope’s  normal  force  on 
person  \ 

person’s  normal  force  on 
rope  V 

rope’s  normal  force  on 
person  Iv 

person’s  normal  force  on 
rope  A 

(There  are  presumably  other  forces  acting  on  the  person  as  well, 
such  as  gravity.) 


If  a  rope  goes  over  a  pulley  or  around  some  other  object,  then 
the  tension  throughout  the  rope  is  approximately  equal  so  long  as 
the  pulley  has  negligible  mass  and  there  is  not  too  much  friction.  A 
rod  or  stick  can  be  treated  in  much  the  same  way  as  a  string,  but 
it  is  possible  to  have  either  compression  or  tension. 

Discussion  Question 

A  When  you  step  on  the  gas  pedal,  is  your  foot’s  force  being  transmitted 
in  the  sense  of  the  word  used  in  this  section? 


3.2.8  Work 

Energy  transferred  to  a  particle 

To  change  the  kinetic  energy,  K  =  (1/2 )mv2,  of  a  particle  mov¬ 
ing  in  one  dimension,  we  must  change  its  velocity.  That  will  entail  a 
change  in  its  momentum,  p  =  mv ,  as  well,  and  since  force  is  the  rate 
of  transfer  of  momentum,  we  conclude  that  the  only  way  to  change 
a  particle’s  kinetic  energy  is  to  apply  a  force.6  A  force  in  the  same 
direction  as  the  motion  speeds  it  up,  increasing  the  kinetic  energy, 
while  a  force  in  the  opposite  direction  slows  it  down. 

Consider  an  infinitesimal  time  interval  during  which  the  particle 
moves  an  infinitesimal  distance  dx,  and  its  kinetic  energy  changes 
by  d K.  In  one  dimension,  we  represent  the  direction  of  the  force 
and  the  direction  of  the  motion  with  positive  and  negative  signs  for 
F  and  dx,  so  the  relationship  among  the  signs  can  be  summarized 
as  follows: 


6The  converse  isn’t  true,  because  kinetic  energy  doesn’t  depend  on  the  di¬ 
rection  of  motion,  but  momentum  does.  We  can  change  a  particle’s  momentum 
without  changing  its  energy,  as  when  a  pool  ball  bounces  off  a  bumper,  reversing 
the  sign  of  p. 
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F  >  0 

dx  >  0 

d  K  >  0 

F  <  0 

dx  <  0 

dA"  >  0 

F  >  0 

dx  <  0 

dA"  <  0 

F  <  0 

dx  >  0 

dAT  <  0 

This  looks  exactly  like  the  rule  for  determining  the  sign  of  a  product, 
and  we  can  easily  show  using  the  chain  rule  that  this  is  indeed  a 
multiplicative  relationship: 

dK  =  [chain  rule] 

=  (mv)(a)(  1/v)  dx 
=  ma  dx 

=  F  dx  [Newton’s  second  law] 


We  can  verify  that  force  multiplied  by  distance  has  units  of  energy: 


N-m 


kg-rn/s 

-  x  m 

s 

kg-m2/s2 


=  J 


A  TV  picture  tube  example  30 

>  At  the  back  of  a  typical  TV’s  picture  tube,  electrical  forces  accel¬ 
erate  each  electron  to  an  energy  of  5  x  10~16  J  over  a  distance 
of  about  1  cm.  How  much  force  is  applied  to  a  single  electron? 
(Assume  the  force  is  constant.)  What  is  the  corresponding  accel¬ 
eration? 

>  Integrating 


we  find 


or 


dK  =  Fdx, 


Kf  -  K,  =  F(xf  -  Xj) 


A  K  =  FAx. 


The  force  is 

F  =  AK/Ax 

5  x  1(T16  J 
“  0.01  m 

=  5  x  10^14  N. 
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This  may  not  sound  like  an  impressive  force,  but  it’s  enough  to 
supply  an  electron  with  a  spectacular  acceleration.  Looking  up 
the  mass  of  an  electron  on  p.  986,  we  find 


p  /  A  simplified  drawing  of  an 
airgun. 


a  =  F/m 
=  5  x  1016  m/s2. 


An  air  gun  example  3 1 

t>  An  airgun,  figure  p,  uses  compressed  air  to  accelerate  a  pellet. 
As  the  pellet  moves  from  x-i  to  x2,  the  air  decompresses,  so  the 
force  is  not  constant.  Using  methods  from  chapter  5,  one  can 
show  that  the  air’s  force  on  the  pellet  is  given  by  F  =  bx~7/5.  A 
typical  high-end  airgun  used  for  competitive  target  shooting  has 


x-i  =  0.046  m, 
x2  =  0.41  m, 


and 


b  =  4.4  N-m7/5. 

What  is  the  kinetic  energy  of  the  pellet  when  it  leaves  the  muzzle? 
(Assume  friction  is  negligible.) 

>  Since  the  force  isn’t  constant,  it  would  be  incorrect  to  do  F  = 
AK/Ax.  Integrating  both  sides  of  the  equation  d K  =  Fdx,  we 
have 

f* 2 

A  K  =  Fdx 

J  X i 

=  22  J 


q  /  The  black  box  does  work 
by  reeling  in  its  cable. 


In  general,  when  energy  is  transferred  by  a  force,7  we  use  the 
term  work  to  refer  to  the  amount  of  energy  transferred.  This  is  dif¬ 
ferent  from  the  way  the  word  is  used  in  ordinary  speech.  If  you  stand 
for  a  long  time  holding  a  bag  of  cement,  you  get  tired,  and  everyone 
will  agree  that  you’ve  worked  hard,  but  you  haven’t  changed  the  en¬ 
ergy  of  the  cement,  so  according  to  the  definition  of  the  physics  term, 
you  haven’t  done  any  work  on  the  bag.  There  has  been  an  energy 
transformation  inside  your  body,  of  chemical  energy  into  heat,  but 
this  just  means  that  one  part  of  your  body  did  positive  work  (lost 
energy)  while  another  part  did  a  corresponding  amount  of  negative 
work  (gained  energy). 

7The  part  of  the  definition  about  “by  a  force”  is  meant  to  exclude  the  transfer 
of  energy  by  heat  conduction,  as  when  a  stove  heats  soup. 
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Work  in  general 

I  derived  the  expression  F  dx  for  one  particular  type  of  kinetic- 
energy  transfer,  the  work  done  in  accelerating  a  particle,  and  then 
defined  work  as  a  more  general  term.  Is  the  equation  correct  for 
other  types  of  work  as  well?  For  example,  if  a  force  lifts  a  mass  m 
against  the  resistance  of  gravity  at  constant  velocity,  the  increase  in 
the  mass’s  gravitational  energy  is  d (mgy)  =  mgdy  =  F  dy,  so  again 
the  equation  works,  but  this  still  doesn’t  prove  that  the  equation  is 
always  correct  as  a  way  of  calculating  energy  transfers. 

Imagine  a  black  box8,  containing  a  gasoline-powered  engine, 
which  is  designed  to  reel  in  a  steel  cable,  exerting  a  certain  force 
F.  For  simplicity,  we  imagine  that  this  force  is  always  constant,  so 
we  can  talk  about  Ax  rather  than  an  infinitesimal  dx.  If  this  black 
box  is  used  to  accelerate  a  particle  (or  any  mass  without  internal 
structure),  and  no  other  forces  act  on  the  particle,  then  the  original 
derivation  applies,  and  the  work  done  by  the  box  is  W  =  F Ax. 
Since  F  is  constant,  the  box  will  run  out  of  gas  after  reeling  in  a 
certain  amount  of  cable  Ax.  The  chemical  energy  inside  the  box 
has  decreased  by  —W,  while  the  mass  being  accelerated  has  gained 
W  worth  of  kinetic  energy.9 

Now  what  if  we  use  the  black  box  to  pull  a  plow?  The  energy  in¬ 
crease  in  the  outside  world  is  of  a  different  type  than  before;  it  takes 
the  forms  of  (1)  the  gravitational  energy  of  the  dirt  that  has  been 
lifted  out  to  the  sides  of  the  furrow,  (2)  frictional  heating  of  the  dirt 
and  the  plowshare,  and  (3)  the  energy  needed  to  break  up  the  dirt 
clods  (a  form  of  electrical  energy  involving  the  attractions  among 
the  atoms  in  the  clod).  The  box,  however,  only  communicates  with 
the  outside  world  via  the  hole  through  which  its  cable  passes.  The 
amount  of  chemical  energy  lost  by  the  gasoline  can  therefore  only 
depend  on  F  and  Ax,  so  it  is  the  same  —  W  as  when  the  box  was 
being  used  to  accelerate  a  mass,  and  thus  by  conservation  of  energy, 
the  work  done  on  the  outside  world  is  again  W. 

This  is  starting  to  sound  like  a  proof  that  the  force-times-distance 
method  is  always  correct,  but  there  was  one  subtle  assumption  in¬ 
volved,  which  was  that  the  force  was  exerted  at  one  point  (the  end  of 
the  cable,  in  the  black  box  example).  Real  life  often  isn’t  like  that. 
For  example,  a  cyclist  exerts  forces  on  both  pedals  at  once.  Serious 
cyclists  use  toe-clips,  and  the  conventional  wisdom  is  that  one  should 
use  equal  amounts  of  force  on  the  upstroke  and  downstroke,  to  make 
full  use  of  both  sets  of  muscles.  This  is  a  two-dimensional  example, 
since  the  pedals  go  in  circles.  We’re  only  discussing  one-dimensional 
motion  right  now,  so  let’s  just  pretend  that  the  upstroke  and  down- 

8  “Black  box”  is  a  traditional  engineering  term  for  a  device  whose  inner  work¬ 
ings  we  don’t  care  about. 

9For  conceptual  simplicity,  we  ignore  the  transfer  of  heat  energy  to  the  outside 
world  via  the  exhaust  and  radiator.  In  reality,  the  sum  of  these  energies  plus  the 
useful  kinetic  energy  transferred  would  equal  W. 


r/The  wheel  spinning  in  the 
air  has  Kcm  =  0.  The  space 
shuttle  has  all  its  kinetic  energy 
in  the  form  of  center  of  mass 
motion,  K  =  Kcm.  The  rolling 
ball  has  some,  but  not  all,  of  its 
energy  in  the  form  of  center  of 
mass  motion,  Kcm  <  K. (Space 
Shuttle  photo  by  NASA) 
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stroke  are  both  executed  in  straight  lines.  Since  the  forces  are  in 
opposite  directions,  one  is  positive  and  one  is  negative.  The  cyclist’s 
total  force  on  the  crank  set  is  zero,  but  the  work  done  isn’t  zero.  We 
have  to  add  the  work  done  by  each  stroke,  W  =  F\Ax\  + 

(I’m  pretending  that  both  forces  are  constant,  so  we  don’t  have  to 
do  integrals.)  Both  terms  are  positive;  one  is  a  positive  number 
multiplied  by  a  positive  number,  while  the  other  is  a  negative  times 
a  negative. 

This  might  not  seem  like  a  big  deal  —  just  remember  not  to  use 
the  total  force  —  but  there  are  many  situations  where  the  total  force 
is  all  we  can  measure.  The  ultimate  example  is  heat  conduction. 
Heat  conduction  is  not  supposed  to  be  counted  as  a  form  of  work, 
since  it  occurs  without  a  force.  But  at  the  atomic  level,  there  are 
forces,  and  work  is  done  by  one  atom  on  another.  When  you  hold  a 
hot  potato  in  your  hand,  the  transfer  of  heat  energy  through  your 
skin  takes  place  with  a  total  force  that’s  extremely  close  to  zero. 
At  the  atomic  level,  atoms  in  your  skin  are  interacting  electrically 
with  atoms  in  the  potato,  but  the  attractions  and  repulsions  add 
up  to  zero  total  force.  It’s  just  like  the  cyclist’s  feet  acting  on  the 
pedals,  but  with  zillions  of  forces  involved  instead  of  two.  There  is 
no  practical  way  to  measure  all  the  individual  forces,  and  therefore 
we  can’t  calculate  the  total  energy  transferred. 

To  summarize,  ^  Fj  d Xj  is  a  correct  way  of  calculating  work, 
where  Fj  is  the  individual  force  acting  on  particle  j,  which  moves  a 
distance  d Xj.  However,  this  is  only  useful  if  you  can  identify  all  the 
individual  forces  and  determine  the  distance  moved  at  each  point  of 
contact.  For  convenience,  I’ll  refer  to  this  as  the  work  theorem.  (It 
doesn’t  have  a  standard  name.) 

There  is,  however,  something  useful  we  can  do  with  the  total 
force.  We  can  use  it  to  calculate  the  part  of  the  work  done  on  an 
object  that  consists  of  a  change  in  the  kinetic  energy  it  has  due  to 
the  motion  of  its  center  of  mass.  The  proof  is  essentially  the  same 
as  the  proof  on  p.  161,  except  that  now  we  don’t  assume  the  force 
is  acting  on  a  single  particle,  so  we  have  to  be  a  little  more  delicate. 
Let  the  object  consist  of  n  particles.  Its  total  kinetic  energy  is 
K  =  1  (1/2 )mjv‘j,  but  this  is  what  we've  already  realized  can’t 

be  calculated  using  the  total  force.  The  kinetic  energy  it  has  due  to 
motion  of  its  center  of  mass  is 


FI  cm  —  2mt°talV 


2 

cm* 


Figure  r  shows  some  examples  of  the  distinction  between  Kcm  and 
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K.  Differentiating  Kcm,  we  have 


d KCm  —  mtotalVcjm  dvc 
dvr 


d  t 


—  'mtotal'V cm~ 


dt  d xr 


dxr 


[chain  rule] 


d  VCm  , 

—  TYltotal  7T  dxcm 


dptotal 


dt 

d  Xr 


dt 

=  F total  d^cm 


[dt/  dxcm  =  l/Vcm] 
\Ptotal  =  ITHotalVcml 


I’ll  call  this  the  kinetic  energy  theorem  —  like  the  work  theorem,  it 
has  no  standard  name. 

An  ice  skater  pushing  off  from  a  wall  example  32 

The  kinetic  energy  theorem  tells  us  how  to  calculate  the  skater’s 
kinetic  energy  if  we  know  the  amount  of  force  and  the  distance 
her  center  of  mass  travels  while  she  is  pushing  off. 

The  work  theorem  tells  us  that  the  wall  does  no  work  on  the 
skater,  since  the  point  of  contact  isn’t  moving.  This  makes  sense, 
because  the  wall  does  not  have  any  source  of  energy. 

Absorbing  an  impact  without  recoiling?  example  33 

>  Is  it  possible  to  absorb  an  impact  without  recoiling?  For  in¬ 
stance,  if  a  ping-pong  ball  hits  a  brick  wall,  does  the  wall  “give”  at 
all? 

>  There  will  always  be  a  recoil.  In  the  example  proposed,  the  wall 
will  surely  have  some  energy  transferred  to  it  in  the  form  of  heat 
and  vibration.  The  work  theorem  tells  us  that  we  can  only  have 
an  energy  transfer  if  the  distance  traveled  by  the  point  of  contact 
is  nonzero. 

Dragging  a  refrigerator  at  constant  velocity  example  34 

The  fridge’s  momentum  is  constant,  so  there  is  no  net  momen¬ 
tum  transfer,  and  the  total  force  on  it  must  be  zero:  your  force 
is  canceling  the  floor’s  kinetic  frictional  force.  The  kinetic  energy 
theorem  is  therefore  true  but  useless.  It  tells  us  that  there  is  zero 
total  force  on  the  refrigerator,  and  that  the  refrigerator’s  kinetic 
energy  doesn’t  change. 

The  work  theorem  tells  us  that  the  work  you  do  equals  your  hand’s 
force  on  the  refrigerator  multiplied  by  the  distance  traveled.  Since 
we  know  the  floor  has  no  source  of  energy,  the  only  way  for  the 
floor  and  refrigerator  to  gain  energy  is  from  the  work  you  do.  We 
can  thus  calculate  the  total  heat  dissipated  by  friction  in  the  re¬ 
frigerator  and  the  floor. 

Note  that  there  is  no  way  to  find  how  much  of  the  heat  is  dissi¬ 
pated  in  the  floor  and  how  much  in  the  refrigerator. 
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Accelerating  a  cart  example  35 

If  you  push  on  a  cart  and  accelerate  it,  there  are  two  forces  acting 
on  the  cart:  your  hand’s  force,  and  the  static  frictional  force  of  the 
ground  pushing  on  the  wheels  in  the  opposite  direction. 

Applying  the  work  theorem  to  your  force  tells  us  how  to  calculate 
the  work  you  do. 

Applying  the  work  theorem  to  the  floor’s  force  tells  us  that  the 
floor  does  no  work  on  the  cart.  There  is  no  motion  at  the  point 
of  contact,  because  the  atoms  in  the  floor  are  not  moving.  (The 
atoms  in  the  surface  of  the  wheel  are  also  momentarily  at  rest 
when  they  touch  the  floor.)  This  makes  sense,  because  the  floor 
does  not  have  any  source  of  energy. 

The  kinetic  energy  theorem  refers  to  the  total  force,  and  because 
the  floor’s  backward  force  cancels  part  of  your  force,  the  total 
force  is  less  than  your  force.  This  tells  us  that  only  part  of  your 
work  goes  into  the  kinetic  energy  associated  with  the  forward  mo¬ 
tion  of  the  cart’s  center  of  mass.  The  rest  goes  into  rotation  of  the 
wheels. 

Discussion  Questions 

A  Criticize  the  following  incorrect  statement:  “A  force  doesn’t  do  any 
work  unless  it’s  causing  the  object  to  move.” 

B  To  stop  your  car,  you  must  first  have  time  to  react,  and  then  it  takes 
some  time  for  the  car  to  slow  down.  Both  of  these  times  contribute  to  the 
distance  you  will  travel  before  you  can  stop.  The  figure  shows  how  the 
average  stopping  distance  increases  with  speed.  Because  the  stopping 
distance  increases  more  and  more  rapidly  as  you  go  faster,  the  rule  of 
one  car  length  per  1 0  m.p.h.  of  speed  is  not  conservative  enough  at  high 
speeds.  In  terms  of  work  and  kinetic  energy,  what  is  the  reason  for  the 
more  rapid  increase  at  high  speeds? 
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s  /  Discussion  question  B. 


3.2.9  Simple  Machines 

Conservation  of  energy  provided  the  necessary  tools  for  analyz¬ 
ing  some  mechanical  systems,  such  as  the  seesaw  on  page  85  and  the 
pulley  arrangements  of  the  homework  problems  on  page  120,  but  we 
could  only  analyze  those  machines  by  computing  the  total  energy 
of  the  system.  That  approach  wouldn’t  work  for  systems  like  the 
biceps/forearm  machine  on  page  85,  or  the  one  in  figure  t,  where 
the  energy  content  of  the  person’s  body  is  impossible  to  compute 
directly.  Even  though  the  seesaw  and  the  biceps/forearm  system 
were  clearly  just  two  different  forms  of  the  lever,  we  had  no  way 
to  treat  them  both  on  the  same  footing.  We  can  now  successfully 
attack  such  problems  using  the  work  and  kinetic  energy  theorems. 

Constant  tension  around  a  pulley  example  36 

>  In  figure  t,  what  is  the  relationship  between  the  force  applied  by 
the  person’s  hand  and  the  force  exerted  on  the  block? 

o  If  we  assume  the  rope  and  the  pulley  are  ideal,  i.e.,  frictionless 
and  massless,  then  there  is  no  way  for  them  to  absorb  or  release 
energy,  so  the  work  done  by  the  hand  must  be  the  same  as  the 
work  done  on  the  block.  Since  the  hand  and  the  block  move  the 
same  distance,  the  work  theorem  tells  us  the  two  forces  are  the 
same. 

Similar  arguments  provide  an  alternative  justification  for  the  state¬ 
ment  made  in  section  3.2.7  that  show  that  an  idealized  rope  ex¬ 
erts  the  same  force,  the  tension,  anywhere  it’s  attached  to  some¬ 
thing,  and  the  same  amount  of  force  is  also  exerted  by  each  seg¬ 
ment  of  the  rope  on  the  neighboring  segments.  Going  around  an 
ideal  pulley  also  has  no  effect  on  the  tension. 


t  /  The  force  is  transmitted  to 
the  block. 


u  /  A  mechanical  advantage 
of  2. 


v  /  An  inclined  plane. 
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w  /  A  wedge. 


x  /  Archimedes’  screw 


This  is  an  example  of  a  simple  machine,  which  is  any  mechanical 
system  that  manipulates  forces  to  do  work.  This  particular  ma¬ 
chine  reverses  the  direction  of  the  motion,  but  doesn’t  change  the 
force  or  the  speed  of  motion. 

A  mechanical  advantage  example  37 

The  idealized  pulley  in  figure  u  has  negligible  mass,  so  its  kinetic 
energy  is  zero,  and  the  kinetic  energy  theorem  tells  us  that  the 
total  force  on  it  is  zero.  We  know,  as  in  the  preceding  example, 
that  the  two  forces  pulling  it  to  the  right  are  equal  to  each  other, 
so  the  force  on  the  left  must  be  twice  as  strong.  This  simple 
machine  doubles  the  applied  force,  and  we  refer  to  this  ratio  as 
a  mechanical  advantage  (M.A.)  of  2.  There’s  no  such  thing  as 
a  free  lunch,  however;  the  distance  traveled  by  the  load  is  cut  in 
half,  and  there  is  no  increase  in  the  amount  of  work  done. 

Inclined  plane  and  wedge  example  38 

In  figure  v,  the  force  applied  by  the  hand  is  equal  to  the  one  ap¬ 
plied  to  the  load,  but  there  is  a  mechanical  advantage  compared 
to  the  force  that  would  have  been  required  to  lift  the  load  straight 
up.  The  distance  traveled  up  the  inclined  plane  is  greater  by  a 
factor  of  1/sin  0,  so  by  the  work  theorem,  the  force  is  smaller  by 
a  factor  of  sin  0,  and  we  have  M.A.=1/sin  0.  The  wedge,  w,  is 
similar. 

Archimedes’  screw  example  39 

In  one  revolution,  the  crank  travels  a  distance  2nb,  and  the  water 
rises  by  a  height  h.  The  mechanical  advantage  is  2nb/h. 

3.2.10  Force  related  to  interaction  energy 

In  section  2.3,  we  saw  that  there  were  two  equivalent  ways  of 
looking  at  gravity,  the  gravitational  field  and  the  gravitational  en¬ 
ergy.  They  were  related  by  the  equation  d U  =  mg  dr,  so  if  we  knew 
the  field,  we  could  find  the  energy  by  integration,  U  =  f  mg  dr, 
and  if  we  knew  the  energy,  we  could  find  the  field  by  differentiation, 
g  =  (1/m)  d U /  dr. 

The  same  approach  can  be  applied  to  other  interactions,  for  ex¬ 
ample  a  mass  on  a  spring.  The  main  difference  is  that  only  in  grav¬ 
itational  interactions  does  the  strength  of  the  interaction  depend 
on  the  mass  of  the  object,  so  in  general,  it  doesn’t  make  sense  to 
separate  out  the  factor  of  m  as  in  the  equation  dU  =  mg  dr.  Since 
F  =  mg  is  the  gravitational  force,  we  can  rewrite  the  equation  in 
the  more  suggestive  form  dU  =  F  dr.  This  form  no  longer  refers  to 
gravity  specifically,  and  can  be  applied  much  more  generally.  The 
only  remaining  detail  is  that  I’ve  been  fairly  cavalier  about  positive 
and  negative  signs  up  until  now.  That  wasn’t  such  a  big  problem 
for  gravitational  interactions,  since  gravity  is  always  attractive,  but 
it  requires  more  careful  treatment  for  nongravitational  forces,  where 
we  don’t  necessarily  know  the  direction  of  the  force  in  advance,  and 
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we  need  to  use  positive  and  negative  signs  carefully  for  the  direction 
of  the  force. 

In  general,  suppose  that  forces  are  acting  on  a  particle  —  we 
can  think  of  them  as  coming  from  other  objects  that  are  “off  stage” 
-  and  that  the  interaction  between  the  particle  and  the  off-stage 
objects  can  be  characterized  by  an  interaction  energy,  U ,  which 
depends  only  on  the  particle’s  position,  x.  Using  the  kinetic  energy 
theorem,  we  have  d/C  =  F  dx.  (It’s  not  necessary  to  write  Kcm ,  since 
a  particle  can’t  have  any  other  kind  of  kinetic  energy.)  Conservation 
of  energy  tells  us  d K  +  d U  =  0,  so  the  relationship  between  force 
and  interaction  energy  is  d U  =  —Fdx,  or 

F  =  —  -j—  [relationship  between  force  and  interaction  energy]. 


Force  exerted  by  a  spring  example  40 

>  A  mass  is  attached  to  the  end  of  a  spring,  and  the  energy  of  the 
spring  is  U  =  (1  /2 )kx2,  where  x  is  the  position  of  the  mass,  and 
x  =  0  is  defined  to  be  the  equilibrium  position.  What  is  the  force 
the  spring  exerts  on  the  mass?  Interpret  the  sign  of  the  result. 

o  Differentiating,  we  find 


=  -kx. 


If  x  is  positive,  then  the  force  is  negative,  i.e.,  it  acts  so  as  to  bring 
the  mass  back  to  equilibrium,  and  similarly  for  x  <  0  we  have 
F>  0. 

Most  books  do  the  F  =  -kx  form  before  the  U  =  (1  /2 )kx2  form, 
and  call  it  Hooke’s  law.  Neither  form  is  really  more  fundamental 
than  the  other  —  we  can  always  get  from  one  to  the  other  by 
integrating  or  differentiating. 

Newton ’s  law  of  gravity  example  4 1 

>  Given  the  equation  U  =  -Gm<[m2/r  for  the  energy  of  gravita¬ 
tional  interactions,  find  the  corresponding  equation  for  the  grav¬ 
itational  force  on  mass  m2.  Interpret  the  positive  and  negative 
signs. 

o  We  have  to  be  a  little  careful  here,  because  we’ve  been  taking 
r  to  be  positive  by  definition,  whereas  the  position,  x,  of  mass  m2 
could  be  positive  or  negative,  depending  on  which  side  of  m-\  it’s 
on. 

For  positive  x,  we  have  r  =  x,  and  differentiation  gives 


=  -Gm-\m2/x2. 
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As  in  the  preceding  example,  we  have  F  <  0  when  x  is  positive, 
because  the  object  is  being  attracted  back  toward  x  =  0. 

When  x  is  negative,  the  relationship  between  r  and  x  becomes 
r  =  -x,  and  the  result  for  the  force  is  the  same  as  before,  but  with 
a  minus  sign.  We  can  combine  the  two  equations  by  writing 

\F\  =  Gm-\m2/r2, 

and  this  is  the  form  traditionally  known  as  Newton’s  law  of  grav¬ 
ity.  As  in  the  preceding  example,  the  U  and  F  equations  contain 
equivalent  information,  and  neither  is  more  fundamental  than  the 
other. 

Equilibrium  example  42 

I  previously  described  the  condition  for  equilibrium  as  a  local  max¬ 
imum  or  minimum  of  U.  A  differentiable  function  has  a  zero  deriva¬ 
tive  at  its  extrema,  and  we  can  now  relate  this  directly  to  force: 
zero  force  acts  on  an  object  when  it  is  at  equilibrium. 
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3.3  Resonance 


Resonance  is  a  phenomenon  in  which  an  oscillator  responds  most 
strongly  to  a  driving  force  that  matches  its  own  natural  frequency 
of  vibration.  For  example,  suppose  a  child  is  on  a  playground  swing 
with  a  natural  frequency  of  1  Hz.  That  is,  if  you  pull  the  child 
away  from  equilibrium,  release  her,  and  then  stop  doing  anything 
for  a  while,  she’ll  oscillate  at  1  Hz.  If  there  was  no  friction,  as  we 
assumed  in  section  2.5,  then  the  sum  of  her  gravitational  and  kinetic 
energy  would  remain  constant,  and  the  amplitude  would  be  exactly 
the  same  from  one  oscillation  to  the  next.  However,  friction  is  going 
to  convert  these  forms  of  energy  into  heat,  so  her  oscillations  would 
gradually  die  out.  To  keep  this  from  happening,  you  might  give 
her  a  push  once  per  cycle,  i.e. ,  the  frequency  of  your  pushes  would 
be  1  Hz,  which  is  the  same  as  the  swing’s  natural  frequency.  As 
long  as  you  stay  in  rhythm,  the  swing  responds  quite  well.  If  you 
start  the  swing  from  rest,  and  then  give  pushes  at  1  Hz,  the  swing’s 
amplitude  rapidly  builds  up,  as  in  figure  a,  until  after  a  while  it 
reaches  a  steady  state  in  which  friction  removes  just  as  much  energy 
as  you  put  in  over  the  course  of  one  cycle. 

self-check  F 

In  figure  a,  compare  the  amplitude  of  the  cycle  immediately  following 
the  first  push  to  the  amplitude  after  the  second.  Compare  the  energies 
as  well.  >  Answer,  p.  962 

What  will  happen  if  you  try  pushing  at  2  Hz?  Your  first  push 
puts  in  some  momentum,  p,  but  your  second  push  happens  after 
only  half  a  cycle,  when  the  swing  is  coming  right  back  at  you,  with 
momentum  —  p\  The  momentum  transfer  from  the  second  push  is 
exactly  enough  to  stop  the  swing.  The  result  is  a  very  weak,  and 
not  very  sinusoidal,  motion,  b. 

Making  the  math  easy 

This  is  a  simple  and  physically  transparent  example  of  resonance: 
the  swing  responds  most  strongly  if  you  match  its  natural  rhythm. 
However,  it  has  some  characteristics  that  are  mathematically  ugly 
and  possibly  unrealistic.  The  quick,  hard  pushes  are  known  as  im¬ 
pulse  forces,  c,  and  they  lead  to  an  x-t  graph  that  has  nondifferen- 
tiable  kinks.  Impulsive  forces  like  this  are  not  only  badly  behaved 
mathematically,  they  are  usually  undesirable  in  practical  terms.  In 
a  car  engine,  for  example,  the  engineers  work  very  hard  to  make 
the  force  on  the  pistons  change  smoothly,  to  avoid  excessive  vibra¬ 
tion.  Throughout  the  rest  of  this  section,  we’ll  assume  a  driving 
force  that  is  sinusoidal,  d,  i.e.,  one  whose  F-t  graph  is  either  a  sine 
function  or  a  function  that  differs  from  a  sine  wave  in  phase,  such 
as  a  cosine.  The  force  is  positive  for  half  of  each  cycle  and  negative 
for  the  other  half,  i.e.,  there  is  both  pushing  and  pulling.  Sinusoidal 
functions  have  many  nice  mathematical  characteristics  (we  can  dif¬ 
ferentiate  and  integrate  them,  and  the  sum  of  sinusoidal  functions 


a  /  An  x-versus -t  graph  for  a 
swing  pushed  at  resonance. 


b  /  A  swing  pushed  at  twice 
its  resonant  frequency. 


c  /  The  F-versus-f  graph  for 
an  impulsive  driving  force. 


d/A  sinusoidal  driving  force. 
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that  have  the  same  frequency  is  a  sinusoidal  function),  and  they  are 
also  used  in  many  practical  situations.  For  instance,  my  garage  door 
zapper  sends  out  a  sinusoidal  radio  wave,  and  the  receiver  is  tuned 
to  resonance  with  it. 

A  second  mathematical  issue  that  I  glossed  over  in  the  swing 
example  was  how  friction  behaves.  In  section  3.2.4,  about  forces 
between  solids,  the  empirical  equation  for  kinetic  friction  was  inde¬ 
pendent  of  velocity.  Fluid  friction,  on  the  other  hand,  is  velocity- 
dependent.  For  a  child  on  a  swing,  fluid  friction  is  the  most  im¬ 
portant  form  of  friction,  and  is  approximately  proportional  to  v2. 
In  still  other  situations,  e.g.,  with  a  low-density  gas  or  friction  be¬ 
tween  solid  surfaces  that  have  been  lubricated  with  a  fluid  such  as 
oil,  we  may  find  that  the  frictional  force  has  some  other  dependence 
on  velocity,  perhaps  being  proportional  to  v,  or  having  some  other 
complicated  velocity  dependence  that  can’t  even  be  expressed  with 
a  simple  equation.  It  would  be  extremely  complicated  to  have  to 
treat  all  of  these  different  possibilities  in  complete  generality,  so  for 
the  rest  of  this  section,  we’ll  assume  friction  proportional  to  velocity 

F  =  —bv, 

simply  because  the  resulting  equations  happen  to  be  the  easiest  to 
solve.  Even  when  the  friction  doesn’t  behave  in  exactly  this  way, 
many  of  our  results  may  still  be  at  least  qualitatively  correct. 

3.3.1  Damped,  free  motion 

Numerical  treatment 

An  oscillator  that  has  friction  is  referred  to  as  damped.  Let’s  use 
numerical  techniques  to  find  the  motion  of  a  damped  oscillator  that 
is  released  away  from  equilibrium,  but  experiences  no  driving  force 
after  that.  We  can  expect  that  the  motion  will  consist  of  oscillations 
that  gradually  die  out. 

In  section  2.5,  we  simulated  the  undamped  case  using  our  tried 
and  true  Python  function  based  on  conservation  of  energy.  Now, 
however,  that  approach  becomes  a  little  awkward,  because  it  in¬ 
volves  splitting  up  the  path  to  be  traveled  into  n  tiny  segments,  but 
in  the  presence  of  damping,  each  swing  is  a  little  shorter  than  the 
last  one,  and  we  don’t  know  in  advance  exactly  how  far  the  oscilla¬ 
tion  will  get  before  turning  around.  An  easier  technique  here  is  to 
use  force  rather  than  energy.  Newton’s  second  law,  a  =  F/m,  gives 
a  =  (—kx  —  bv)/m,  where  we’ve  made  use  of  the  result  of  example 
40  for  the  force  exerted  by  the  spring.  This  becomes  a  little  prettier 
if  we  rewrite  it  in  the  form 

ma  +  bv  +  kx  =  0, 

which  gives  symmetric  treatment  to  three  terms  involving  x  and  its 
first  and  second  derivatives,  v  and  a.  Now  instead  of  calculating 
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the  time  At  =  Ax/v  required  to  move  a  predetermined  distance 
Ax,  we  pick  At  and  determine  the  distance  traveled  in  that  time, 
Ax  =  vAt.  Also,  we  can  no  longer  update  v  based  on  conservation  of 
energy,  since  we  don’t  have  any  easy  way  to  keep  track  of  how  much 
mechanical  energy  has  been  changed  into  heat  energy.  Instead,  we 
recalculate  the  velocity  using  Av  =  aAt. 

1  import  math 

2  k=39.4784  #  chosen  to  give  a  period  of  1  second 

3  m=l . 

4  b=0.211  #  chosen  to  make  the  results  simple 

5  x=l. 

6  v= 0 . 

7  t=0 . 

8  dt= . 01 

9  n=1000 

10  for  j  in  range(n): 

11  x=x+v*dt 

12  a=(-k*x-b*v) /m 

13  if  (v>0)  and  (v+a*dt<0)  : 

14  print ("turnaround  at  t=",t,",  x=",x) 

15  v=v+a*dt 

16  t=t+dt 


turnaround 

at 

t=  0.99 

turnaround 

at 

t=  1.99 

turnaround 

at 

t=  2.99 

turnaround 

at 

t=  3.99 

turnaround 

at 

t=  4.99 

turnaround 

at 

t=  5.99 

turnaround 

at 

t=  6.99 

turnaround 

at 

t=  7.99 

turnaround 

at 

t=  8.99 

turnaround 

at 

t=  9.99 

x=  0.899919262445 
x=  0.809844934046 
x=  0.728777519477 
x=  0.655817260033 
x=  0.590154191135 
x=  0.531059189965 
x=  0.477875914756 
x=  0.430013546991 
x=  0.386940256644 
x=  0.348177318484 


The  spring  constant,  k  =  4ir  =  39.4784  N/m,  is  designed  so 
that  if  the  undamped  equation  /  =  ( l/27r)y/&?7m  was  still  true,  the 
frequency  would  be  1  Hz.  We  start  by  noting  that  the  addition  of  a 
small  amount  of  damping  doesn’t  seem  to  have  changed  the  period 
at  all,  or  at  least  not  to  within  the  accuracy  of  the  calculation.10 
You  can  check  for  yourself,  however,  that  a  large  value  of  b,  say  5 
N-s/m,  does  change  the  period  significantly. 

We  release  the  mass  from  x  =  1  m,  and  after  one  cycle,  it  only 
comes  back  to  about  x  =  0.9  m.  I  chose  b  =  0.211  N-s/rn  by  fiddling 

10This  subroutine  isn’t  as  accurate  a  way  of  calculating  the  period  as  the 
energy-based  one  we  used  in  the  undamped  case,  since  it  only  checks  whether 
the  mass  turned  around  at  some  point  during  the  time  interval  At. 
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around  until  I  got  this  result,  since  a  decrease  of  exactly  10%  is 
easy  to  discuss.  Notice  how  the  amplitude  after  two  cycles  is  about 
0.81  m,  i.e. ,  1  m  times  0.92:  the  amplitude  has  again  dropped  by 
exactly  10%.  This  pattern  continues  for  as  long  as  the  simulation 
runs,  e.g.,  for  the  last  two  cycles,  we  have  0.34818/0.38694=0.89982, 
or  almost  exactly  0.9  again.  It  might  have  seemed  capricious  when  I 
chose  to  use  the  unrealistic  equation  F  =  —bv,  but  this  is  the  payoff. 
Only  with  —bv  friction  do  we  get  this  kind  of  mathematically  simple 
exponential  decay. 

Because  the  decay  is  exponential,  it  never  dies  out  completely; 
this  is  different  from  the  behavior  we  would  have  had  with  Coulomb 
friction,  which  does  make  objects  grind  completely  to  a  stop  at  some 
point.  With  friction  that  acts  like  F  =  —bv,  v  gets  smaller  as  the 
oscillations  get  smaller.  The  smaller  and  smaller  force  then  causes 
them  to  die  out  at  a  rate  that  is  slower  and  slower. 


e/A  damped  sine  wave,  of 
the  form  x  =  Ae^c,s\n(wft  +  6). 


Analytic  treatment 

Taking  advantage  of  this  unexpectedly  simple  result,  let’s  find 
an  analytic  solution  for  the  motion.  The  numerical  output  suggests 
that  we  assume  a  solution  of  the  form 

x  =  Ae~ct  sin(c Oft  +  5), 

where  the  unknown  constants  ujf  and  c  will  presumably  be  related  to 
rn,  b,  and  k.  The  constant  c  indicates  how  quickly  the  oscillations  die 
out.  The  constant  c Vf  is,  as  before,  defined  as  2ir  times  the  frequency, 
with  the  subscript  /  to  indicate  a  free  (undriven)  solution.  All 
our  equations  will  come  out  much  simpler  if  we  use  cus  everywhere 
instead  of  / s  from  now  on,  and,  as  physicists  often  do,  I’ll  generally 
use  the  word  “frequency”  to  refer  to  uj  when  the  context  makes  it 
clear  what  I’m  talking  about.  The  phase  angle  6  has  no  real  physical 
significance,  since  we  can  define  t  =  0  to  be  any  moment  in  time  we 
like. 

self-check  G 

In  figure  f,  which  graph  has  the  greater  value  of  c?  >  Answer,  p.  963 

The  factor  A  for  the  initial  amplitude  can  also  be  omitted  with¬ 
out  loss  of  generality,  since  the  equation  we’re  trying  to  solve,  ma  + 
bv  +  kx  =  0,  is  linear.  That  is,  v  and  a  are  the  first  and  second 
derivatives  of  x,  and  the  derivative  of  Ax  is  simply  A  times  the 
derivative  of  x.  Thus,  if  x(t)  is  a  solution  of  the  equation,  then 
multiplying  it  by  a  constant  gives  an  equally  valid  solution.  This  is 
another  place  where  we  see  that  a  damping  force  proportional  to  v  is 
the  easiest  to  handle  mathematically.  For  a  damping  force  propor¬ 
tional  to  v2,  for  example,  we  would  have  had  to  solve  the  equation 
ma  +  bv2  +  kx  =  0,  which  is  nonlinear. 

For  the  purpose  of  determining  c Of  and  c,  the  most  general  form 
we  need  to  consider  is  therefore  x  =  e~ct  sin  uj  ft  ,  whose  first  and 
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second  derivatives  are  v  =  e  ct  (— c  sin  lu ft  +  cu  cos  uj  ft)  and  a  = 
e-ct  ^c2  sin _  2ujfCcosujft  —  uj2  sin ujft^J .  Plugging  these  into  the 
equation  rna  +  bv  +  kx  =  0  and  setting  the  sine  and  cosine  parts 
equal  to  zero  gives,  after  some  tedious  algebra, 

_  _b_ 

2  m 

and 


Intuitively,  we  expect  friction  to  “slow  down”  the  motion,  as  when 
we  ride  a  bike  into  a  big  patch  of  mud.  “Slow  down,”  however,  could 
have  more  than  one  meaning  here.  It  could  mean  that  the  oscillator 
would  take  more  time  to  complete  each  cycle,  or  it  could  mean  that 
as  time  went  on,  the  oscillations  would  die  out,  thus  giving  smaller 
velocities. 

Our  mathematical  results  show  that  both  of  these  things  hap¬ 
pen.  The  first  equation  says  that  c,  which  indicates  how  quickly  the 
oscillations  damp  out,  is  directly  related  to  b,  the  strength  of  the 
damping. 

The  second  equation,  for  the  frequency,  can  be  compared  with 
the  result  from  page  116  of  y/k/m  for  the  undamped  system.  Let’s 
refer  to  this  now  as  cu0,  to  distinguish  it  from  the  actual  frequency 
uj  of  the  free  oscillations  when  damping  is  present.  The  result  for 
ujf  will  be  less  than  cu0,  due  to  the  presence  of  the  b2 /4m2  term.  This 
tells  us  that  the  addition  of  friction  to  the  system  does  increase  the 
time  required  for  each  cycle.  However,  it  is  very  common  for  the 
b2/4:7n2  term  to  be  negligible,  so  that  ujf  ~  u0. 

Figure  g  shows  an  example.  The  damping  here  is  quite  strong: 
after  only  one  cycle  of  oscillation,  the  amplitude  has  already  been 
reduced  by  a  factor  of  2,  corresponding  to  a  factor  of  4  in  energy. 
However,  the  frequency  of  the  damped  oscillator  is  only  about  1% 
lower  than  that  of  the  undamped  one;  after  five  periods,  the  ac¬ 
cumulated  lag  is  just  barely  visible  in  the  offsetting  of  the  arrows. 
We  can  see  that  extremely  strong  damping  —  even  stronger  than 
this  —  would  have  been  necessary  in  order  to  make  ujf  ~  uj0  a  poor 
approximation. 

3.3.2  The  quality  factor 

It’s  usually  impractical  to  measure  b  directly  and  determine  c 
from  the  equation  c  =  &/2m.  For  a  child  on  a  swing,  measuring 
b  would  require  putting  the  child  in  a  wind  tunnel!  It’s  usually 
much  easier  to  characterize  the  amount  of  damping  by  observing  the 
actual  damped  oscillations  and  seeing  how  many  cycles  it  takes  for 
the  mechanical  energy  to  decrease  by  a  certain  factor.  The  unitless 


g/A  damped  sine  wave  is 
compared  with  an  undamped 
one,  with  m  and  k  kept  the  same 
and  only  b  changed. 
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Summary  of  Notation 

k  spring  constant 
m  mass  of  the  oscillator 

b  sets  the  amount  of  damp¬ 

ing,  F  =  -bv 
T  period 
f  frequency,  1/T 
cu  (Greek  letter  omega),  an¬ 
gular  frequency,  2 nf,  often 
referred  to  simply  as  “fre¬ 
quency” 

cu0  frequency  the  oscillator 
would  have  without  damp¬ 
ing,  \fkjm 

a uf  frequency  of  the  free  vi¬ 
brations 

c  sets  the  time  scale  for  the 
exponential  decay  enve¬ 
lope  e~ct  of  the  free  vibra¬ 
tions 

Fm  strength  of  the  driving 
force,  which  is  assumed  to 
vary  sinusoidally  with  fre¬ 
quency  O) 

A  amplitude  of  the  steady- 
state  response 

6  phase  angle  of  the  steady- 
state  response 


quality  factor ,  Q,  is  defined  as  Q  =  loq/2c ,  and  in  the  limit  of  weak 
damping,  where  co  ss  u0,  this  can  be  interpreted  as  the  number  of 
cycles  required  for  the  mechanical  energy  to  fall  off  by  a  factor  of 
e27r  =  535.49  . . .  Using  this  new  quantity,  we  can  rewrite  the  equation 
for  the  frequency  of  damped  oscillations  in  the  slightly  more  elegant 
form  c Of  =  cuoi/l  —  1/4 Q2. 

self-check  H 

What  if  we  wanted  to  make  a  simpler  definition  of  Q,  as  the  number  of 
oscillations  required  for  the  vibrations  to  die  out  completely,  rather  than 
the  number  required  for  the  energy  to  fall  off  by  this  obscure  factor?  > 
Answer,  p.  963 

A  graph  example  43 

The  damped  motion  in  figure  g  has  Q  «  4.5,  giving  ^1-1  /4 Q2  « 
0.99,  as  claimed  at  the  end  of  the  preceding  subsection. 

Exponential  decay  in  a  trumpet  example  44 

>  The  vibrations  of  the  air  column  inside  a  trumpet  have  a  Q  of 
about  10.  This  means  that  even  after  the  trumpet  player  stops 
blowing,  the  note  will  keep  sounding  for  a  short  time.  If  the  player 
suddenly  stops  blowing,  how  will  the  sound  intensity  20  cycles 
later  compare  with  the  sound  intensity  while  she  was  still  blowing? 

>  The  trumpet’s  Q  is  10,  so  after  10  cycles  the  energy  will  have 
fallen  off  by  a  factor  of  535.  After  another  1 0  cycles  we  lose  an¬ 
other  factor  of  535,  so  the  sound  intensity  is  reduced  by  a  factor 
of  535  x  535  =  2.9  x  105. 

The  decay  of  a  musical  sound  is  part  of  what  gives  it  its  charac¬ 
ter,  and  a  good  musical  instrument  should  have  the  right  Q ,  but  the 
Q  that  is  considered  desirable  is  different  for  different  instruments. 
A  guitar  is  meant  to  keep  on  sounding  for  a  long  time  after  a  string 
has  been  plucked,  and  might  have  a  Q  of  1000  or  10000.  One  of  the 
reasons  why  a  cheap  synthesizer  sounds  so  bad  is  that  the  sound 
suddenly  cuts  off  after  a  key  is  released. 

3.3.3  Driven  motion 

The  driven  case  is  extremely  important  in  science,  technology, 
and  engineering.  We  have  an  external  driving  force  F  =  Fin  sin  cut, 
where  the  constant  Fm  indicates  the  maximum  strength  of  the  force 
in  either  direction.  The  equation  of  motion  is  now 

[1]  ma  +  bv  +  kx  =  Fm  sin  cot 

[equation  of  motion  for  a  driven  oscillator]. 

After  the  driving  force  has  been  applied  for  a  while,  we  expect  that 
the  amplitude  of  the  oscillations  will  approach  some  constant  value. 
This  motion  is  known  as  the  steady  state ,  and  it’s  the  most  inter¬ 
esting  thing  to  find  out;  as  we’ll  see  later,  the  most  general  type  of 
motion  is  only  a  minor  variation  on  the  steady-state  motion.  For 
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the  steady-state  motion,  we’re  going  to  look  for  a  solution  of  the 
form 

x  =  A  sin(ut  +  6). 

In  contrast  to  the  undriven  case,  here  it’s  not  possible  to  sweep  A 
and  5  under  the  rug.  The  amplitude  of  the  steady-state  motion,  A, 
is  actually  the  most  interesting  thing  to  know  about  the  steady-state 
motion,  and  it’s  not  true  that  we  still  have  a  solution  no  matter  how 
we  fiddle  with  A;  if  we  have  a  solution  for  a  certain  value  of  A,  then 
multiplying  A  by  some  constant  would  break  the  equality  between 
the  two  sides  of  the  equation  of  motion.  It’s  also  no  longer  true  that 
we  can  get  rid  of  6  simply  be  redefining  when  we  start  the  clock; 
here  5  represents  a  difference  in  time  between  the  start  of  one  cycle 
of  the  driving  force  and  the  start  of  the  corresponding  cycle  of  the 
motion. 

The  velocity  and  acceleration  are  v  =  uAcos(ut  +  5)  and  a  = 
— u2A  sin(cjt  +  S),  and  if  we  plug  these  into  the  equation  of  motion, 

[1] ,  and  simplify  a  little,  we  find 

P 

[2]  ( k  —  mu2)  sin(u;f  +  5)  +  ubcos(ut  +  5)  =  smut. 

The  sum  of  any  two  sinusoidal  functions  with  the  same  frequency 
is  also  a  sinusoidal,  so  the  whole  left  side  adds  up  to  a  sinusoidal. 
By  fiddling  with  A  and  5  we  can  make  the  amplitudes  and  phases 
of  the  two  sides  of  the  equation  match  up. 

Steady  state,  no  damping 

A  and  <5  are  easy  to  find  in  the  case  where  there  is  no  damping 
at  all.  There  are  now  no  cosines  in  equation  [2]  above,  only  sines,  so 
if  we  wish  we  can  set  6  to  zero,  and  we  find  A  =  Fm/(k  —  mu2)  = 
FmJrn(u2  —  u2).  This,  however,  makes  A  negative  for  u  >  uQ.  The 
variable  6  was  designed  to  represent  this  kind  of  phase  relationship, 
so  we  prefer  to  keep  A  positive  and  set  6  =  tt  for  u  >  u0.  Our 
results  are  then 


m  | u2  —  u2\ 

and 

^  _  f  0,  u  <uQ 

\  7T  ,  U  >  U0 


The  most  important  feature  of  the  result  is  that  there  is  a  reso¬ 
nance:  the  amplitude  becomes  greater  and  greater,  and  approaches 
infinity,  as  u  approaches  the  resonant  frequency  uQ.  This  is  the  phys¬ 
ical  behavior  we  anticipated  on  page  171  in  the  example  of  pushing 
a  child  on  a  swing.  If  the  driving  frequency  matches  the  frequency 
of  the  free  vibrations,  then  the  driving  force  will  always  be  in  the 


5  A 
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cd/cg0 

h  /  Dependence  of  the  amplitude 
and  phase  angle  on  the  driving 
frequency,  for  an  undamped 
oscillator.  The  amplitudes  were 
calculated  with  Fm,  m,  and  cu0,  all 
set  to  1 . 
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right  direction  to  add  energy  to  the  swing.  At  a  driving  frequency 
very  different  from  the  resonant  frequency,  we  might  get  lucky  and 
push  at  the  right  time  during  one  cycle,  but  our  next  push  would 
come  at  some  random  point  in  the  next  cycle,  possibly  having  the 
effect  of  slowing  the  swing  down  rather  than  speeding  it  up. 

The  interpretation  of  the  infinite  amplitude  at  u>  =  uj0  is  that 
there  really  isn’t  any  steady  state  if  we  drive  the  system  exactly  at 
resonance  —  the  amplitude  will  just  keep  on  increasing  indefinitely. 
In  real  life,  the  amplitude  can’t  be  infinite  both  because  there  is 
always  some  damping  and  because  there  will  always  be  some  differ¬ 
ence,  however  small,  between  uj  and  cj0.  Even  though  the  infinity  is 
unphysical,  it  has  entered  into  the  popular  consciousness,  starting 
with  the  eccentric  Serbian-American  inventor  and  physicist  Nikola 
Tesla.  Around  1912,  the  tabloid  newspaper  The  World  Today  credu¬ 
lously  reported  a  story  which  Tesla  probably  fabricated  —  or  wildly 
exaggerated  —  for  the  sake  of  publicity.  Supposedly  he  created  a 
steam-powered  device  “no  larger  than  an  alarm  clock,”  containing 
a  piston  that  could  be  made  to  vibrate  at  a  tunable  and  precisely 
controlled  frequency.  “He  put  his  little  vibrator  in  his  coat-pocket 
and  went  out  to  hunt  a  half-erected  steel  building.  Down  in  the 
Wall  Street  district,  he  found  one  —  ten  stories  of  steel  framework 
without  a  brick  or  a  stone  laid  around  it.  He  clamped  the  vibrator 
to  one  of  the  beams,  and  fussed  with  the  adjustment  [presumably 
hunting  for  the  building’s  resonant  frequency]  until  he  got  it.  Tesla 
said  finally  the  structure  began  to  creak  and  weave  and  the  steel¬ 
workers  came  to  the  ground  panic-stricken,  believing  that  there  had 
been  an  earthquake.  Police  were  called  out.  Tesla  put  the  vibrator 
in  his  pocket  and  went  away.  Ten  minutes  more  and  he  could  have 
laid  the  building  in  the  street.  And,  with  the  same  vibrator  he  could 
have  dropped  the  Brooklyn  Bridge  into  the  East  River  in  less  than 
an  hour.” 

The  phase  angle  5  also  exhibits  surprising  behavior.  As  the  fre¬ 
quency  is  tuned  upward  past  resonance,  the  phase  abruptly  shifts  so 
that  the  phase  of  the  response  is  opposite  to  that  of  the  driving  force. 
There  is  a  simple  interpretation  for  this.  The  system’s  mechanical 
energy  can  only  change  due  to  work  done  by  the  driving  force,  since 
there  is  no  damping  to  convert  mechanical  energy  to  heat.  In  the 
steady  state,  then,  the  power  transmitted  by  the  driving  force  over 
a  full  cycle  of  motion  must  average  out  to  zero.  In  general,  the  work 
theorem  d E  =  F  dx  can  always  be  divided  by  df  on  both  sides  to 
give  the  useful  relation  P  =  Fv.  If  Fv  is  to  average  out  to  zero, 
then  F  and  v  must  be  out  of  phase  by  ±7t/2,  and  since  v  is  ahead  of 
x  by  a  phase  angle  of  vr/2,  the  phase  angle  between  x  and  F  must 
be  zero  or  7 r. 

Given  that  these  are  the  two  possible  phases,  why  is  there  a 
difference  in  behavior  between  ui  <  ua  and  u  >  wQ?  At  the  low- 
frequency  limit,  consider  to  =  (1,  i.e.,  a  constant  force.  A  constant 
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force  will  simply  displace  the  oscillator  to  one  side,  reaching  an 
equilibrium  that  is  offset  from  the  usual  one.  The  force  and  the 
response  are  in  phase,  e.g.,  if  the  force  is  to  the  right,  the  equilibrium 
will  be  offset  to  the  right.  This  is  the  situation  depicted  in  the 
amplitude  graph  of  figure  h  at  w  =  0.  The  response,  which  is  not 
zero,  is  simply  this  static  displacement  of  the  oscillator  to  one  side. 

At  high  frequencies,  on  the  other  hand,  imagine  shaking  the 
poor  child  on  the  swing  back  and  forth  with  a  force  that  oscillates 
at  10  Hz.  This  is  so  fast  that  there  is  essentially  no  time  for  the 
force  F  =  —kx  from  gravity  and  the  chain  to  act  from  one  cycle 
to  the  next.  The  problem  becomes  equivalent  to  the  oscillation  of 
a  free  object.  If  the  driving  force  varies  like  sin (ut),  with  <5  =  0, 
then  the  acceleration  is  also  proportional  to  the  sine.  Integrating, 
we  find  that  the  velocity  goes  like  minus  a  cosine,  and  a  second 
integration  gives  a  position  that  varies  as  minus  the  sine  —  opposite 
in  phase  to  the  driving  force.  Intuitively,  this  mathematical  result 
corresponds  to  the  fact  that  at  the  moment  when  the  object  has 
reached  its  maximum  displacement  to  the  right ,  that  is  the  time 
when  the  greatest  force  is  being  applied  to  the  left,  in  order  to  turn 
it  around  and  bring  it  back  toward  the  center. 


A  practice  mute  for  a  violin  example  45 

The  amplitude  of  the  driven  vibrations,  A  =  Fm/(m|tu2-tu§|),  con¬ 
tains  an  inverse  proportionality  to  the  mass  of  the  vibrating  object. 
This  is  simply  because  a  given  force  will  produce  less  accelera¬ 
tion  when  applied  to  a  more  massive  object.  An  application  is 
shown  in  figure  45. 

In  a  stringed  instrument,  the  strings  themselves  don’t  have  enough 
surface  area  to  excite  sound  waves  very  efficiently.  In  instruments 
of  the  violin  family,  as  the  strings  vibrate  from  left  to  right,  they 
cause  the  bridge  (the  piece  of  wood  they  pass  over)  to  wiggle 
clockwise  and  counterclockwise,  and  this  motion  is  transmitted  to 
the  top  panel  of  the  instrument,  which  vibrates  and  creates  sound 
waves  in  the  air. 

A  string  player  who  wants  to  practice  at  night  without  bothering 
the  neighbors  can  add  some  mass  to  the  bridge.  Adding  mass  to 
the  bridge  causes  the  amplitude  of  the  vibrations  to  be  smaller, 
and  the  sound  to  be  much  softer.  A  similar  effect  is  seen  when 
an  electric  guitar  is  used  without  an  amp.  The  body  of  an  electric 
guitar  is  so  much  more  massive  than  the  body  of  an  acoustic 
guitar  that  the  amplitude  of  its  vibrations  is  very  small. 
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i  /  Example  45:  a  viola  without 
a  mute  (left),  and  with  a  mute 
(right).  The  mute  doesn’t  touch 
the  strings  themselves. 


Ago 


energy  of 

steady- 

state 

vibrations 


1/2  max. 


k/The  definition  of  Acu,  the 
full  width  at  half  maximum. 


Steady  state,  with  damping 

The  extension  of  the  analysis  to  the  damped  case  involves  some 
lengthy  algebra,  which  I’ve  outlined  on  page  952  in  appendix  2.  The 
results  are  shown  in  figure  j.  It’s  not  surprising  that  the  steady  state 
response  is  weaker  when  there  is  more  damping,  since  the  steady 
state  is  reached  when  the  power  extracted  by  damping  matches  the 
power  input  by  the  driving  force.  The  maximum  amplitude,  at  the 
peak  of  the  resonance  curve,  is  approximately  proportional  to  Q. 

self-check  I 

From  the  final  result  of  the  analysis  on  page  952,  substitute  w  =  cu0, 
and  satisfy  yourself  that  the  result  is  proportional  to  Q.  Why  is  Ares  oc  Q 
only  an  approximation?  >  Answer,  p.  963 

What  is  surprising  is  that  the  amplitude  is  strongly  affected  by 
damping  close  to  resonance,  but  only  weakly  affected  far  from  it.  In 
other  words,  the  shape  of  the  resonance  curve  is  broader  with  more 
damping,  and  even  if  we  were  to  scale  up  a  high-damping  curve  so 
that  its  maximum  was  the  same  as  that  of  a  low-damping  curve,  it 
would  still  have  a  different  shape.  The  standard  way  of  describing 
the  shape  numerically  is  to  give  the  quantity  Aw,  called  the  full 
width  at  half-maximum,  or  FWHM,  which  is  defined  in  figure  k. 
Note  that  the  y  axis  is  energy,  which  is  proportional  to  the  square 
of  the  amplitude.  Our  previous  observations  amount  to  a  statement 
that  A lo  is  greater  when  the  damping  is  stronger,  i.e. ,  when  the  Q 
is  lower.  It’s  not  hard  to  show  from  the  equations  on  page  952  that 
for  large  Q,  the  FWHM  is  given  approximately  by 


A lj  ps  uj0/Q. 


Another  thing  we  notice  in  figure  j  is  that  for  small  values  of  Q 
the  frequency  ures  of  the  maximum  A  is  less  than  w0.n  At  even 

11  The  relationship  is  Umax  a/u0  =  y  1  —  1/2 Q2,  which  is  similar  in  form  to 
the  equation  for  the  frequency  of  the  free  vibration,  Uf/u0  =  \/ 1  —  1/4 Q2 .  A 
subtle  point  here  is  that  although  the  maximum  of  A  and  the  maximum  of  A2 
must  occur  at  the  same  frequency,  the  maximum  energy  does  not  occur,  as  we 
might  expect,  at  the  same  frequency  as  the  maximum  of  A2.  This  is  because 
the  interaction  energy  is  proportional  to  A2  regardless  of  frequency,  but  the 
kinetic  energy  is  proportional  to  Adu2 .  The  maximum  energy  actually  occurs 
are  precisely  ua. 
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j  /  Dependence  of  the  amplitude 
and  phase  angle  on  the  driving 
frequency.  The  undamped  case 
is  Q  =  oo,  and  the  other  curves 
represent  Q=1,  3,  and  10.  Fm,  m, 
and  a>o  are  all  set  to  1 . 


lower  values  of  Q,  like  Q  =  1,  the  A  —  uj  curve  doesn’t  even  have  a 
maximum  near  uj  >  0. 

An  opera  singer  breaking  a  wineglass  example  46 

In  order  to  break  a  wineglass  by  singing,  an  opera  singer  must 
first  tap  the  glass  to  find  its  natural  frequency  of  vibration,  and 
then  sing  the  same  note  back,  so  that  her  driving  force  will  pro¬ 
duce  a  response  with  the  greatest  possible  amplitude.  If  she’s 
shopping  for  the  right  glass  to  use  for  this  display  of  her  prowess, 
she  should  look  for  one  that  has  the  greatest  possible  Q,  since 
the  resonance  curve  has  a  higher  maximum  for  higher  values  of 
Q. 
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I  /  The  collapsed  section  of 
the  Nimitz  Freeway 


Collapse  of  the  Nimitz  Freeway  example  47 

Figure  I  shows  a  section  of  the  Nimitz  Freeway  in  Oakland,  CA, 
that  collapsed  during  an  earthquake  in  1989.  An  earthquake  con¬ 
sists  of  many  low-frequency  vibrations  that  occur  simultaneously, 
which  is  why  it  sounds  like  a  rumble  of  indeterminate  pitch  rather 
than  a  low  hum.  The  frequencies  that  we  can  hear  are  not  even 
the  strongest  ones;  most  of  the  energy  is  in  the  form  of  vibrations 
in  the  range  of  frequencies  from  about  1  Hz  to  1 0  Hz. 

All  the  structures  we  build  are  resting  on  geological  layers  of  dirt, 
mud,  sand,  or  rock.  When  an  earthquake  wave  comes  along,  the 
topmost  layer  acts  like  a  system  with  a  certain  natural  frequency 
of  vibration,  sort  of  like  a  cube  of  jello  on  a  plate  being  shaken 
from  side  to  side.  The  resonant  frequency  of  the  layer  depends 
on  how  stiff  it  is  and  also  on  how  deep  it  is.  The  ill-fated  section 
of  the  Nimitz  freeway  was  built  on  a  layer  of  mud,  and  analysis  by 
geologist  Susan  E.  Flough  of  the  U.S.  Geological  Survey  shows 
that  the  mud  layer’s  resonance  was  centered  on  about  2.5  Hz, 
and  had  a  width  covering  a  range  from  about  1  Hz  to  4  Hz. 

When  the  earthquake  wave  came  along  with  its  mixture  of  fre¬ 
quencies,  the  mud  responded  strongly  to  those  that  were  close  to 
its  own  natural  2.5  Hz  frequency.  Unfortunately,  an  engineering 
analysis  after  the  quake  showed  that  the  overpass  itself  had  a  res¬ 
onant  frequency  of  2.5  Hz  as  well!  The  mud  responded  strongly  to 
the  earthquake  waves  with  frequencies  close  to  2.5  Hz,  and  the 
bridge  responded  strongly  to  the  2.5  Hz  vibrations  of  the  mud, 
causing  sections  of  it  to  collapse. 


Physical  reason  for  the  relationship  between  Q  and  the  FWHM 

What  is  the  reason  for  this  surprising  relationship  between  the 
damping  and  the  width  of  the  resonance?  Fundamentally,  it  has  to 
do  with  the  fact  that  friction  causes  a  system  to  lose  its  “memory” 
of  its  previous  state.  If  the  Pioneer  10  space  probe,  coasting  through 
the  frictionless  vacuum  of  interplanetary  space,  is  detected  by  aliens 
a  million  years  from  now,  they  will  be  able  to  trace  its  trajectory 
backwards  and  infer  that  it  came  from  our  solar  system.  On  the 
other  hand,  imagine  that  I  shove  a  book  along  a  tabletop,  it  comes 
to  rest,  and  then  someone  else  walks  into  the  room.  There  will  be 
no  clue  as  to  which  direction  the  book  was  moving  before  it  stopped 
-  friction  has  erased  its  memory  of  its  motion.  Now  consider  the 
playground  swing  driven  at  twice  its  natural  frequency,  figure  m, 
where  the  undamped  case  is  repeated  from  figure  b  on  page  171.  In 
the  undamped  case,  the  first  push  starts  the  swing  moving  with  mo¬ 
mentum  p.  but  when  the  second  push  comes,  if  there  is  no  friction 
at  all,  it  now  has  a  momentum  of  exactly  —p,  and  the  momentum 
transfer  from  the  second  push  is  exactly  enough  to  stop  it  dead. 
With  moderate  damping,  however,  the  momentum  on  the  rebound 
is  not  quite  —p,  and  the  second  push’s  effect  isn’t  quite  as  disas- 
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trous.  With  very  strong  damping,  the  swing  comes  essentially  to 
rest  long  before  the  second  push.  It  has  lost  all  its  memory,  and  the 
second  push  puts  energy  into  the  system  rather  than  taking  it  out. 
Although  the  detailed  mathematical  results  with  this  kind  of  impul¬ 
sive  driving  force  are  different,12  the  general  results  are  the  same 
as  for  sinusoidal  driving:  the  less  damping  there  is,  the  greater  the 
penalty  you  pay  for  driving  the  system  off  of  resonance. 

High-Q  speakers  example  48 

Most  good  audio  speakers  have  Q  «  1 ,  but  the  resonance  curve 
for  a  higher-Q  oscillator  always  lies  above  the  corresponding  curve 
for  one  with  a  lower  Q,  so  people  who  want  their  car  stereos  to 
be  able  to  rattle  the  windows  of  the  neighboring  cars  will  often 
choose  speakers  that  have  a  high  Q.  Of  course  they  could  just 
use  speakers  with  stronger  driving  magnets  to  increase  Fm,  but 
the  speakers  might  be  more  expensive,  and  a  high-Q  speaker 
also  has  less  friction,  so  it  wastes  less  energy  as  heat. 

One  problem  with  this  is  that  whereas  the  resonance  curve  of  a 
low-Q  speaker  (its  “response  curve”  or  “frequency  response”  in 
audiophile  lingo)  is  fairly  flat,  a  higher-Q  speaker  tends  to  em¬ 
phasize  the  frequencies  that  are  close  to  its  natural  resonance. 
In  audio,  a  flat  response  curve  gives  more  realistic  reproduction 
of  sound,  so  a  higher  quality  factor,  Q,  really  corresponds  to  a 
tower-quality  speaker. 

Another  problem  with  high-Q  speakers  is  discussed  in  example 
51  on  page  185  . 

Changing  the  pitch  of  a  wind  instrument  example  49 

>  A  saxophone  player  normally  selects  which  note  to  play  by 
choosing  a  certain  fingering,  which  gives  the  saxophone  a  cer¬ 
tain  resonant  frequency.  The  musician  can  also,  however,  change 
the  pitch  significantly  by  altering  the  tightness  of  her  lips.  This 
corresponds  to  driving  the  horn  slightly  off  of  resonance.  If  the 
pitch  can  be  altered  by  about  5%  up  or  down  (about  one  musi¬ 
cal  half-step)  without  too  much  effort,  roughly  what  is  the  Q  of  a 
saxophone? 

>  Five  percent  is  the  width  on  one  side  of  the  resonance,  so  the 
full  width  is  about  10%,  A f/f0  ss  0.1.  The  equation  Atu  =  cu0/Q  is 
defined  in  terms  of  angular  frequency,  w  =  2nf,  and  we’ve  been 
given  our  data  in  terms  of  ordinary  frequency,  f.  The  factors  of  2n 


undamped 


moderately  damped 


strongly  damped 


m  /  An  x-versus -t  graph  of 
the  steady-state  motion  of  a 
swing  being  pushed  at  twice 
its  resonant  frequency  by  an 
impulsive  force. 


12For  example,  the  graphs  calculated  for  sinusoidal  driving  have  resonances 
that  are  somewhat  below  the  natural  frequency,  getting  lower  with  increasing 
damping,  until  for  Q  <  1  the  maximum  response  occurs  at  w  =  0.  In  figure  m, 
however,  we  can  see  that  impulsive  driving  at  c o  =  2io0  produces  a  steady  state 
with  more  energy  than  at  u>  =  u>0. 
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end  up  canceling  out,  however: 
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In  other  words,  once  the  musician  stops  blowing,  the  horn  will 
continue  sounding  for  about  10  cycles  before  its  energy  falls  off 
by  a  factor  of  535.  (Blues  and  jazz  saxophone  players  will  typi¬ 
cally  choose  a  mouthpiece  that  gives  a  low  Q,  so  that  they  can 
produce  the  bluesy  pitch-slides  typical  of  their  style.  “Legit,”  i.e. , 
classically  oriented  players,  use  a  higher-Q  setup  because  their 
style  only  calls  for  enough  pitch  variation  to  produce  a  vibrato, 
and  the  higher  Q  makes  it  easier  to  play  in  tune.) 

Q  of  a  radio  receiver  example  50 

>  A  radio  receiver  used  in  the  FM  band  needs  to  be  tuned  in  to 
within  about  0.1  MHz  for  signals  at  about  100  MHz.  What  is  its 
Q? 

>  As  in  the  last  example,  we’re  given  data  in  terms  of  fs,  not  tus, 
but  the  factors  of  27t  cancel.  The  resulting  Q  is  about  1 000,  which 
is  extremely  high  compared  to  the  Q  values  of  most  mechanical 
systems. 

Transients 

What  about  the  motion  before  the  steady  state  is  achieved? 
When  we  computed  the  undriven  motion  numerically  on  page  172, 
the  program  had  to  initialize  the  position  and  velocity.  By  changing 
these  two  variables,  we  could  have  gotten  any  of  an  infinite  number 
of  simulations.13  The  same  is  true  when  we  have  an  equation  of  mo¬ 
tion  with  a  driving  term,  ma  +  bv  +  kx  =  Fm  sin  cut  (p.  176,  equation 
[1]).  The  steady-state  solutions,  however,  have  no  adjustable  param¬ 
eters  at  all  —  A  and  5  are  uniquely  determined  by  the  parameters 
of  the  driving  force  and  the  oscillator  itself.  If  the  oscillator  isn’t 
initially  in  the  steady  state,  then  it  will  not  have  the  steady-state 
motion  at  first.  What  kind  of  motion  will  it  have? 

The  answer  comes  from  realizing  that  if  we  start  with  the  so¬ 
lution  to  the  driven  equation  of  motion,  and  then  add  to  it  any 
solution  to  the  free  equation  of  motion,  the  result, 

x  =  7lsin(u;t  +  S )  +  A'e~ct  sin (ojft  +  5'), 

13If  you’ve  learned  about  differential  equations,  you’ll  know  that  any  second- 
order  differential  equation  requires  the  specification  of  two  boundary  conditions 
in  order  to  specify  solution  uniquely. 
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is  also  a  solution  of  the  driven  equation.  Here,  as  before,  ojf  is  the 
frequency  of  the  free  oscillations  (uf  «  uj0  for  small  Q),  oj  is  the 
frequency  of  the  driving  force,  A  and  b  are  related  as  usual  to  the 
parameters  of  the  driving  force,  and  A!  and  b'  can  have  any  values 
at  all.  Given  the  initial  position  and  velocity,  we  can  always  choose 
A'  and  5'  to  reproduce  them,  but  this  is  not  something  one  often 
has  to  do  in  real  life.  What’s  more  important  is  to  realize  that  the 
second  term  dies  out  exponentially  over  time,  decaying  at  the  same 
rate  at  which  a  free  vibration  would.  For  this  reason,  the  A'  term  is 
called  a  transient.  A  high-Q  oscillator’s  transients  take  a  long  time 
to  die  out,  while  a  low-Q  oscillator  always  settles  down  to  its  steady 
state  very  quickly. 

Boomy  bass  example  5 1 

In  example  48  on  page  183,  I’ve  already  discussed  one  of  the 
drawbacks  of  a  high-Q  speaker,  which  is  an  uneven  response 
curve.  Another  problem  is  that  in  a  high-Q  speaker,  transients 
take  a  long  time  to  die  out.  The  bleeding-eardrums  crowd  tend 
to  focus  mostly  on  making  their  bass  loud,  so  it’s  usually  their 
woofers  that  have  high  Qs.  The  result  is  that  bass  notes,  “ring” 
after  the  onset  of  the  note,  a  phenomenon  referred  to  as  “boomy 
bass.” 

Overdamped  motion 

The  treatment  of  free,  damped  motion  on  page  174  skipped 
over  a  subtle  point:  in  the  equation  cjf  =  y^k/m  —  b2 / Am2  = 
uj0 1  —  1/4Q2,  Q  <  1/2  results  in  an  answer  that  is  the  square 
root  of  a  negative  number.  For  example,  suppose  we  had  k  =  0, 
which  corresponds  to  a  neutral  equilibrium.  A  physical  example 
would  be  a  mass  sitting  in  a  tub  of  syrup.  If  we  set  it  in  motion, 
it  won’t  oscillate  —  it  will  simply  slow  to  a  stop.  This  system  has 
Q  =  0.  The  equation  of  motion  in  this  case  is  ma  +  bv  =  0,  or,  more 
suggestively, 

du 

m— — |-  bv  =  0. 
at 

One  can  easily  verify  that  this  has  the  solution  v  =  (constant)e_w/m, 
and  integrating,  we  find  x  =  (constant)e_6t//m-|- (constant).  In  other 
words,  the  reason  ujf  comes  out  to  be  mathematical  nonsense14  is 
that  we  were  incorrect  in  assuming  a  solution  that  oscillated  at  a 
frequency  ujf.  The  actual  motion  is  not  oscillatory  at  all. 

In  general,  systems  with  Q  <  1/2,  called  overdamped  systems, 
do  not  display  oscillatory  motion.  Most  cars’  shock  absorbers  are 
designed  with  Q  «  1/2,  since  it’s  undesirable  for  the  car  to  undulate 
up  and  down  for  a  while  after  you  go  over  a  bump.  (Shocks  with  ex¬ 
tremely  low  values  of  Q  are  not  good  either,  because  such  a  system 
takes  a  very  long  time  to  come  back  to  equilibrium.)  It’s  not  par- 

14 Actually,  if  you  know  about  complex  numbers  and  Euler’s  theorem,  it’s  not 
quite  so  nonsensical. 
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ticularly  important  for  our  purposes,  but  for  completeness  I’ll  note, 
as  you  can  easily  verify,  that  the  general  solution  to  the  equation  of 
motion  for  0  <  Q  <  1/2  is  of  the  form  x  =  Ae~ct  +  Be~dt,  while 
Q  =  1/2,  called  the  critically  damped  case,  gives  x  =  (A  +  Bt)e~ct. 
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3.4  Motion  In  Three  Dimensions 

3.4.1  The  Cartesian  perspective 

When  my  friends  and  I  were  bored  in  high  school,  we  used  to  play 
a  paper-and-pencil  game  which,  although  we  never  knew  it,  was  Very 
Educational  —  in  fact,  it  pretty  much  embodies  the  entire  world¬ 
view  of  classical  physics.  To  play  the  game,  you  draw  a  racetrack  on 
graph  paper,  and  try  to  get  your  car  around  the  track  before  anyone 
else.  The  default  is  for  your  car  to  continue  at  constant  speed  in  a 
straight  line,  so  if  it  moved  three  squares  to  the  right  and  one  square 
up  on  your  last  turn,  it  will  do  the  same  this  turn.  You  can  also 
control  the  car’s  motion  by  changing  its  Ax  and  Ay  by  up  to  one 
unit.  If  it  moved  three  squares  to  the  right  last  turn,  you  can  have 
it  move  anywhere  from  two  to  four  squares  to  the  right  this  turn. 


a  /  The  car  can  change  its  x 
and  y  motions  by  one  square 
every  turn. 


b  /  French  mathematician  Rene  Descartes  invented  analytic  geometry; 
Cartesian  (xyz)  coordinates  are  named  after  him.  Fie  did  work  in  phi¬ 
losophy,  and  was  particularly  interested  in  the  mind-body  problem.  He 
was  a  skeptic  and  an  antiaristotelian,  and,  probably  for  fear  of  religious 
persecution,  spent  his  adult  life  in  the  Netherlands,  where  he  fathered 
a  daughter  with  a  Protestant  peasant  whom  he  could  not  marry.  He  kept 
his  daughter’s  existence  secret  from  his  enemies  in  France  to  avoid  giving 
them  ammunition,  but  he  was  crushed  when  she  died  of  scarlatina  at  age 
5.  A  pious  Catholic,  he  was  widely  expected  to  be  sainted.  His  body  was 
buried  in  Sweden  but  then  reburied  several  times  in  France,  and  along 
the  way  everything  but  a  few  fingerbones  was  stolen  by  peasants  who 
expected  the  body  parts  to  become  holy  relics. 

The  fundamental  way  of  dealing  with  the  direction  of  an  ob¬ 
ject’s  motion  in  physics  is  to  use  conservation  of  momentum,  since 
momentum  depends  on  direction.  Up  until  now,  we’ve  only  done 
momentum  in  one  dimension.  How  does  this  relate  to  the  racetrack 
game?  In  the  game,  the  motion  of  a  car  from  one  turn  to  the  next 
is  represented  by  its  Ax  and  Ay.  In  one  dimension,  we  would  only 
need  Ax,  which  could  be  related  to  the  velocity,  Ax/A t,  and  the 
momentum,  mAx/At.  In  two  dimensions,  the  rules  of  the  game 
amount  to  a  statement  that  if  there  is  no  momentum  transfer,  then 
both  mAx/At  and  mAy/At  stay  the  same.  In  other  words,  there 
are  two  flavors  of  momentum,  and  they  are  separately  conserved. 
All  of  this  so  far  has  been  done  with  an  artificial  division  of  time 
into  “turns,”  but  we  can  fix  that  by  redefining  everything  in  terms 
of  derivatives,  and  for  motion  in  three  dimensions  rather  than  two, 
we  augment  x  and  y  with  z: 

vx  =  dx/  d t  vy  =  dy /  df  vz  =  d z/  dt 

and 

px  =  mvx  py  =  mvy  pz  =  mvz 
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c  /  Bullets  are  dropped  and  shot  at  the  same  time. 


We  call  these  the  x,  y,  and  z  components  of  the  velocity  and  the 
momentum. 

There  is  both  experimental  and  theoretical  evidence  that  the 
x ,  y,  and  z  momentum  components  are  separately  conserved,  and 
that  a  momentum  transfer  (force)  along  one  axis  has  no  effect  on 
the  momentum  components  along  the  other  two  axes.  On  page 
89,  for  example,  I  argued  that  it  was  impossible  for  an  air  hockey 
puck  to  make  a  180-degree  turn  spontaneously,  because  then  in  the 
frame  moving  along  with  the  puck,  it  would  have  begun  moving 
after  starting  from  rest.  Now  that  we’re  working  in  two  dimensions, 
we  might  wonder  whether  the  puck  could  spontaneously  make  a  90- 
degree  turn,  but  exactly  the  same  line  of  reasoning  shows  that  this 
would  be  impossible  as  well,  which  proves  that  the  puck  can’t  trade 
x'-niomentum  for  ^/-momentum.  A  more  general  proof  of  separate 
conservation  will  be  given  on  page  214,  after  some  of  the  appropriate 
mathematical  techniques  have  been  introduced. 

As  an  example  of  the  experimental  evidence  for  separate  con¬ 
servation  of  the  momentum  components,  figure  c  shows  correct  and 
incorrect  predictions  of  what  happens  if  you  shoot  a  rifle  and  arrange 
for  a  second  bullet  to  be  dropped  from  the  same  height  at  exactly 
the  same  moment  when  the  first  one  left  the  barrel.  Nearly  every¬ 
one  expects  that  the  dropped  bullet  will  reach  the  dirt  first,  and 
Aristotle  would  have  agreed,  since  he  believed  that  the  bullet  had 
to  lose  its  horizontal  motion  before  it  could  start  moving  vertically. 
In  reality,  we  find  that  the  vertical  momentum  transfer  between  the 
earth  and  the  bullet  is  completely  unrelated  to  the  horizontal  mo¬ 
mentum.  The  bullet  ends  up  with  py  <  0,  while  the  planet  picks 
up  an  upward  momentum  py  >  0,  and  the  total  momentum  in  the 
y  direction  remains  zero.  Both  bullets  hit  the  ground  at  the  same 
time.  This  is  much  simpler  than  the  Aristotelian  version! 

The  Pelton  waterwheel  example  52 

>  There  is  a  general  class  of  machines  that  either  do  work  on  a 
gas  or  liquid,  like  a  boat’s  propeller,  or  have  work  done  on  them 
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by  a  gas  or  liquid,  like  the  turbine  in  a  hydroelectric  power  plant. 
Figure  d  shows  two  types  of  surfaces  that  could  be  attached  to 
the  circumference  of  an  old-fashioned  waterwheel.  Compare  the 
force  exerted  by  the  water  in  the  two  cases. 

>  Let  the  x  axis  point  to  the  right,  and  the  y  axis  up.  In  both 
cases,  the  stream  of  water  rushes  down  onto  the  surface  with 
momentum  py  i  =  -p0,  where  the  subscript  /  stands  for  “initial,” 
i.e.,  before  the  collision. 

In  the  case  of  surface  1 ,  the  streams  of  water  leaving  the  surface 
have  no  momentum  in  the  y  direction,  and  their  momenta  in  the  x 
direction  cancel.  The  final  momentum  of  the  water  is  zero  along 
both  axes,  so  its  entire  momentum,  -p0,  has  been  transferred  to 
the  waterwheel. 

When  the  water  leaves  surface  2,  however,  its  momentum  isn’t 
zero.  If  we  assume  there  is  no  friction,  it’s  pyf  =  +p0,  with  the 
positive  sign  indicating  upward  momentum.  The  change  in  the 
water’s  momentum  is  pyj  -  pyj  =  2 p0,  and  the  momentum  trans¬ 
ferred  to  the  waterwheel  is  -2 p0. 

Force  is  defined  as  the  rate  of  transfer  of  momentum,  so  surface 
2  experiences  double  the  force.  A  waterwheel  constructed  in  this 
way  is  known  as  a  Pelton  waterwheel. 

The  Yarkovsky  effect  example  53 

We  think  of  the  planets  and  asteroids  as  inhabiting  their  orbits 
permanently,  but  it  is  possible  for  an  orbit  to  change  over  periods 
of  millions  or  billions  of  years,  due  to  a  variety  of  effects.  For 
asteroids  with  diameters  of  a  few  meters  or  less,  an  important 
mechanism  is  the  Yarkovsky  effect,  which  is  easiest  to  understand 
if  we  consider  an  asteroid  spinning  about  an  axis  that  is  exactly 
perpendicular  to  its  orbital  plane. 

The  illuminated  side  of  the  asteroid  is  relatively  hot,  and  radiates 
more  infrared  light  than  the  dark  (night)  side.  Light  has  momen¬ 
tum,  and  a  total  force  away  from  the  sun  is  produced  by  combined 
effect  of  the  sunlight  hitting  the  asteroid  and  the  imbalance  be¬ 
tween  the  momentum  radiated  away  on  the  two  sides.  This  force, 
however,  doesn’t  cause  the  asteroid’s  orbit  to  change  over  time, 
since  it  simply  cancels  a  tiny  fraction  of  the  sun’s  gravitational 
attraction.  The  result  is  merely  a  tiny,  undetectable  violation  of 
Kepler’s  law  of  periods. 

Consider  the  sideways  momentum  transfers,  however.  In  figure  e, 
the  part  of  the  asteroid  on  the  right  has  been  illuminated  for  half  a 
spin-period  (half  a  “day”)  by  the  sun,  and  is  hot.  It  radiates  more 
light  than  the  morning  side  on  the  left.  This  imbalance  produces 
a  total  force  in  the  x  direction  which  points  to  the  left.  If  the  aster¬ 
oid’s  orbital  motion  is  to  the  left,  then  this  is  a  force  in  the  same 
direction  as  the  motion,  which  will  do  positive  work,  increasing 


d  /  Two  surfaces  that  could 
be  used  to  extract  energy  from  a 
stream  of  water. 


sunlight 


e  /  An  asteroid  absorbs  visi¬ 
ble  light  from  the  sun,  and  gets 
rid  of  the  energy  by  radiating 
infrared  light. 
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the  asteroid’s  energy  and  boosting  it  into  an  orbit  with  a  greater 
radius.  On  the  other  hand,  if  the  asteroid’s  spin  and  orbital  motion 
are  in  opposite  directions,  the  Yarkovsky  push  brings  the  asteroid 
spiraling  in  closer  to  the  sun. 

Calculations  show  that  it  takes  on  the  order  of  1 07  to  1 08  years  for 
the  Yarkovsky  effect  to  move  an  asteroid  out  of  the  asteroid  belt 
and  into  the  vicinity  of  earth’s  orbit,  and  this  is  about  the  same  as 
the  typical  age  of  a  meteorite  as  estimated  by  its  exposure  to  cos¬ 
mic  rays.  The  Yarkovsky  effect  doesn’t  remove  all  the  asteroids 
from  the  asteroid  belt,  because  many  of  them  have  orbits  that 
are  stabilized  by  gravitational  interactions  with  Jupiter.  However, 
when  collisions  occur,  the  fragments  can  end  up  in  orbits  which 
are  not  stabilized  in  this  way,  and  they  may  then  end  up  reaching 
the  earth  due  to  the  Yarkovsky  effect.  The  cosmic-ray  technique 
is  really  telling  us  how  long  it  has  been  since  the  fragment  was 
broken  out  of  its  parent. 

Discussion  Questions 

A  The  following  is  an  incorrect  explanation  of  a  fact  about  target  shoot¬ 
ing: 

“Shooting  a  high-powered  rifle  with  a  high  muzzle  velocity  is  different  from 
shooting  a  less  powerful  gun.  With  a  less  powerful  gun,  you  have  to  aim 
quite  a  bit  above  your  target,  but  with  a  more  powerful  one  you  don’t  have 
to  aim  so  high  because  the  bullet  doesn’t  drop  as  fast.” 

What  is  the  correct  explanation? 


f  /  Discussion  question  A. 


B  You  have  thrown  a  rock,  and  it  is  flying  through  the  air  in  an  arc.  If 
the  earth’s  gravitational  force  on  it  is  always  straight  down,  why  doesn’t  it 
just  go  straight  down  once  it  leaves  your  hand? 

C  Consider  the  example  of  the  bullet  that  is  dropped  at  the  same 
moment  another  bullet  is  fired  from  a  gun.  What  would  the  motion  of  the 
two  bullets  look  like  to  a  jet  pilot  flying  alongside  in  the  same  direction  as 
the  shot  bullet  and  at  the  same  horizontal  speed? 

3.4.2  Rotational  invariance 

The  Cartesian  approach  requires  that  we  choose  x,  y,  and  z 
axes.  How  do  we  choose  them  correctly?  The  answer  is  that  it  had 
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better  not  matter  which  directions  the  axes  point  (provided  they’re 
perpendicular  to  each  other),  or  where  we  put  the  origin,  because  if 
it  did  matter,  it  would  mean  that  space  was  asymmetric.  If  there 
was  a  certain  point  in  the  universe  that  was  the  right  place  to  put 
the  origin,  where  would  it  be?  The  top  of  Mount  Olympus?  The 
United  Nations  headquarters?  We  find  that  experiments  come  out 
the  same  no  matter  where  we  do  them,  and  regardless  of  which  way 
the  laboratory  is  oriented,  which  indicates  that  no  location  in  space 
or  direction  in  space  is  special  in  any  way.15 

This  is  closely  related  to  the  idea  of  Galilean  relativity  stated  on 
page  62,  from  which  we  already  know  that  the  absolute  motion  of 
a  frame  of  reference  is  irrelevant  and  undetectable.  Observers  using 
frames  of  reference  that  are  in  motion  relative  to  each  other  will  not 
even  agree  on  the  permanent  identity  of  a  particular  point  in  space, 
so  it’s  not  possible  for  the  laws  of  physics  to  depend  on  where  you  are 
in  space.  For  instance,  if  gravitational  energies  were  proportional 
to  mim2  in  one  location  but  to  (77717772)1'00001  in  another,  then  it 
would  be  possible  to  determine  when  you  were  in  a  state  of  absolute 
motion,  because  the  behavior  of  gravitational  interactions  would 
change  as  you  moved  from  one  region  to  the  other. 

Because  of  this  close  relationship,  we  restate  the  principle  of 
Galilean  relativity  in  a  more  general  form.  This  extended  principle 
of  Galilean  relativity  states  that  the  laws  of  physics  are  no  different 
in  one  time  and  place  than  in  another,  and  that  they  also  don’t  de¬ 
pend  on  your  orientation  or  your  motion,  provided  that  your  motion 
is  in  a  straight  line  and  at  constant  speed. 

The  irrelevance  of  time  and  place  could  have  been  stated  in  chap¬ 
ter  1,  but  since  this  section  is  the  first  one  in  which  we’re  dealing 
with  three-dimensional  physics  in  full  generality,  the  irrelevance  of 
orientation  is  what  we  really  care  about  right  now.  This  property  of 
the  laws  of  physics  is  called  rotational  invariance.  The  word  “invari¬ 
ance”  means  a  lack  of  change,  i.e. ,  the  laws  of  physics  don’t  change 
when  we  reorient  our  frame  of  reference. 

Rotational  invariance  of  gravitational  interactions  example  54 
Gravitational  energies  depend  on  the  quantity  Mr,  which  by  the 
Pythagorean  theorem  equals 

1 

y/. Ax2  +  Ay2  +  A  z2 

Rotating  a  line  segment  doesn’t  change  its  length,  so  this  expres¬ 
sion  comes  out  the  same  regardless  of  which  way  we  orient  our 

loOf  course,  you  could  tell  in  a  sealed  laboratory  which  way  was  down,  but 
that’s  because  there  happens  to  be  a  big  planet  nearby,  and  the  planet’s  grav¬ 
itational  field  reaches  into  the  lab,  not  because  space  itself  has  a  special  down 
direction.  Similarly,  if  your  experiment  was  sensitive  to  magnetic  fields,  it  might 
matter  which  way  the  building  was  oriented,  but  that’s  because  the  earth  makes 
a  magnetic  field,  not  because  space  itself  comes  equipped  with  a  north  direction. 
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g  /  Two  balls  roll  down  a  cone  and 
onto  a  plane. 


coordinate  axes.  Even  though  Ax,  Ay,  and  A z  are  different  in 
differently  oriented  coordinate  systems,  r  is  the  same. 

Kinetic  energy  example  55 

Kinetic  energy  equals  (1/2)mv2,  but  what  does  that  mean  in  three 
dimensions,  where  we  have  vx,  vy,  and  vz?  If  you  were  tempted 
to  add  the  components  and  calculate  K  =  (1  /2 )m(vx  +  vy  +  vz)2, 
figure  g  should  convince  you  otherwise.  Using  that  method,  we’d 
have  to  assign  a  kinetic  energy  of  zero  to  ball  number  1,  since 
its  negative  vy  would  exactly  cancel  its  positive  vx,  whereas  ball 
number  2’s  kinetic  energy  wouldn’t  be  zero.  This  would  violate 
rotational  invariance,  since  the  balls  would  behave  differently. 

The  only  possible  way  to  generalize  kinetic  energy  to  three  di¬ 
mensions,  without  violating  rotational  invariance,  is  to  use  an  ex¬ 
pression  that  resembles  the  Pythagorean  theorem, 


v  = 


V2  +  Vy  + 


vf. 


which  results  in 


1 

2' 


/y 


Since  the  velocity  components  are  squared,  the  positive  and  neg¬ 
ative  signs  don’t  matter,  and  the  two  balls  in  the  example  behave 
the  same  way. 
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3.4.3  Vectors 


Remember  the  title  of  this  book?  It  would  have  been  possible  to 
obtain  the  result  of  example  55  by  applying  the  Pythagorean  theo¬ 
rem  to  d.x,  d y,  and  dz,  and  then  dividing  by  dt,  but  the  rotational 
invariance  approach  is  simpler ,  and  is  useful  in  a  much  broader 
context.  Even  with  a  quantity  you  presently  know  nothing  about, 
say  the  magnetic  field,  you  can  infer  that  if  the  components  of  the 
magnetic  field  are  Bx,  By,  and  Bz ,  then  the  physically  useful  way 
to  talk  about  the  strength  of  the  magnetic  field  is  to  define  it  as 
\J B^.  +  B'2  +  Bl.  Nature  knows  your  brain  cells  are  precious,  and 

doesn’t  want  you  to  have  to  waste  them  by  memorizing  mathemat¬ 
ical  rules  that  are  different  for  magnetic  fields  than  for  velocities. 

When  mathematicians  see  that  the  same  set  of  techniques  is 
useful  in  many  different  contexts,  that’s  when  they  start  making 
definitions  that  allow  them  to  stop  reinventing  the  wheel.  The  an¬ 
cient  Greeks,  for  example,  had  no  general  concept  of  fractions.  They 
couldn’t  say  that  a  circle’s  radius  divided  by  its  diameter  was  equal 
to  the  number  1/2.  They  had  to  say  that  the  radius  and  the  diameter 
were  in  the  ratio  of  one  to  two.  With  this  limited  number  concept, 
they  couldn’t  have  said  that  water  was  dripping  out  of  a  tank  at 
a  rate  of  3/4  of  a  barrel  per  day;  instead,  they  would  have  had  to 
say  that  over  four  days,  three  barrels  worth  of  water  would  be  lost. 
Once  enough  of  these  situations  came  up,  some  clever  mathemati¬ 
cian  finally  realized  that  it  would  make  sense  to  define  something 
called  a  fraction,  and  that  one  could  think  of  these  fraction  thingies 
as  numbers  that  lay  in  the  gaps  between  the  traditionally  recognized 
numbers  like  zero,  one,  and  two.  Later  generations  of  mathemati¬ 
cians  introduced  further  subversive  generalizations  of  the  number 
concepts,  inventing  mathematical  creatures  like  negative  numbers, 
and  the  square  root  of  two,  which  can’t  be  expressed  as  a  fraction. 

In  this  spirit,  we  define  a  vector  as  any  quantity  that  has  both 
an  amount  and  a  direction  in  space.  In  contradistinction,  a  scalar 
has  an  amount,  but  no  direction.  Time  and  temperature  are  scalars. 
Velocity,  acceleration,  momentum,  and  force  are  vectors.  In  one  di¬ 
mension,  there  are  only  two  possible  directions,  and  we  can  use  pos¬ 
itive  and  negative  numbers  to  indicate  the  two  directions.  In  more 
than  one  dimension,  there  are  infinitely  many  possible  directions,  so 
we  can’t  use  the  two  symbols  +  and  —  to  indicate  the  direction  of 
a  vector.  Instead,  we  can  specify  the  three  components  of  the  vec¬ 
tor,  each  of  which  can  be  either  negative  or  positive.  We  represent 
vector  quantities  in  handwriting  by  writing  an  arrow  above  them, 
so  for  example  the  momentum  vector  looks  like  this,  p,  but  the  ar¬ 
row  looks  ugly  in  print,  so  in  books  vectors  are  usually  shown  in 
bold-face  type:  p.  A  straightforward  way  of  thinking  about  vectors 
is  that  a  vector  equation  really  represents  three  different  equations. 
For  instance,  conservation  of  momentum  could  be  written  in  terms 
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of  the  three  components, 


A  px  =  0 
Apy  =  0 
Apz  =  0, 

or  as  a  single  vector  equation,16 

Ap  =  0. 


h  /  Example  56. 


The  following  table  summarizes  some  vector  operations. 


operation 

definition 

vector 

vector  +  vector 

vector  —  vector 

vector  •  scalar 
vector  /  scalar 

y/ vector‘d  +  vector %  +  vector f 

Add  component  by  component. 

Subtract  component  by  component. 
Multiply  each  component  by  the  scalar. 
Divide  each  component  by  the  scalar. 

The  first  of  these  is  called  the  magnitude  of  the  vector;  in  one  dimen¬ 
sion,  where  a  vector  only  has  one  component,  it  amounts  to  taking 
the  absolute  value,  hence  the  similar  notation. 

self-check  J 

Translate  the  equations  Fx=max,  Fy=may,  and  Fz=maz  into  a  single 
equation  in  vector  notation.  >  Answer,  p.  963 

An  explosion  example  56 

>  Astronomers  observe  the  planet  Mars  as  the  Martians  fight  a 
nuclear  war.  The  Martian  bombs  are  so  powerful  that  they  rip  the 
planet  into  three  separate  pieces  of  liquefied  rock,  all  having  the 
same  mass.  If  one  fragment  flies  off  with  velocity  components 
Vix  =  0,  i/1y=  1 .0x1 04  km/hr,  and  the  second  with  v2x=1.0x104 
km/hr,  v2y  =  0,  what  is  the  magnitude  of  the  third  one’s  velocity? 

>  We  work  the  problem  in  the  center  of  mass  frame,  in  which 
the  planet  initially  had  zero  momentum.  After  the  explosion,  the 
vector  sum  of  the  momenta  must  still  be  zero.  Vector  addition  can 
be  done  by  adding  components,  so 


m  i/1x  +  mv2x  +  mv3x  =  0 


and 


mv-\y  +  mv2y  +  mv3y  =  0, 

where  we  have  used  the  same  symbol  m  for  all  the  terms,  be¬ 
cause  the  fragments  all  have  the  same  mass.  The  masses  can 

16The  zero  here  is  really  a  zero  vector,  i.e.,  a  vector  whose  components  are 
all  zero,  so  we  should  really  represent  it  with  a  boldface  0.  There’s  usually  not 
much  danger  of  confusion,  however,  so  most  books,  including  this  one,  don’t  use 
boldface  for  the  zero  vector. 
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be  eliminated  by  dividing  each  equation  by  m,  and  we  find 


v3x  =  -1 .0x1 04  km/hr 
vzy  =  -1 .0x1 04  km/hr, 


which  gives  a  magnitude  of 


v3 


1 .4x1 04  km/hr. 


A  toppling  box  example  57 

If  you  place  a  box  on  a  frictionless  surface,  it  will  fall  over  with  a 
very  complicated  motion  that  is  hard  to  predict  in  detail.  We  know, 
however,  that  its  center  of  mass’s  motion  is  related  to  its  momen¬ 
tum,  and  the  rate  at  which  momentum  is  transferred  is  the  force. 
Moreover,  we  know  that  these  relationships  apply  separately  to 
each  component.  Let  x  and  y  be  horizontal,  and  z  vertical.  There 
are  two  forces  on  the  box,  an  upward  force  from  the  table  and  a 
downward  gravitational  force.  Since  both  of  these  are  along  the 
z  axis,  pz  is  the  only  component  of  the  box’s  momentum  that  can 
change.  We  conclude  that  the  center  of  mass  travels  vertically. 
This  is  true  even  if  the  box  bounces  and  tumbles.  [Based  on  an 
example  by  Kleppner  and  Kolenkow.] 

Geometric  representation  of  vectors 

A  vector  in  two  dimensions  can  be  easily  visualized  by  drawing 
an  arrow  whose  length  represents  its  magnitude  and  whose  direction 
represents  its  direction.  The  x  component  of  a  vector  can  then  be 
visualized,  j,  as  the  length  of  the  shadow  it  would  cast  in  a  beam  of 
light  projected  onto  the  x  axis,  and  similarly  for  the  y  component. 
Shadows  with  arrowheads  pointing  back  against  the  direction  of  the 
positive  axis  correspond  to  negative  components. 

In  this  type  of  diagram,  the  negative  of  a  vector  is  the  vector 
with  the  same  magnitude  but  in  the  opposite  direction.  Multiplying 
a  vector  by  a  scalar  is  represented  by  lengthening  the  arrow  by  that 
factor,  and  similarly  for  division. 

self-check  K 

Given  vector  Q  represented  by  an  arrow  below,  draw  arrows  represent¬ 
ing  the  vectors  1.5Q  and  -Q. 


i  /  Example  57. 


x  component 
(positive) 


j/The  geometric  interpreta¬ 
tion  of  a  vector’s  components. 


Q 


>  Answer,  p.  962 
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p 


Q 


(i) 


y 


(2) 


X 


k/Two  vectors,  1,  to  which 
we  apply  the  same  operation  in 
two  different  frames  of  reference, 
2  and  3. 


A  useless  vector  operation  example  58 

The  way  I’ve  defined  the  various  vector  operations  above  aren’t 
as  arbitrary  as  they  seem.  There  are  many  different  vector  oper¬ 
ations  that  we  could  define,  but  only  some  of  the  possible  defini¬ 
tions  are  mathematically  useful.  Consider  the  operation  of  mul¬ 
tiplying  two  vectors  component  by  component  to  produce  a  third 
vector: 


Rx  —  P x  Qx 
Ry  =  PyQy 
Rz  =  P z  Qz 


As  a  simple  example,  we  choose  vectors  P  and  Q  to  have  length 
1 ,  and  make  them  perpendicular  to  each  other,  as  shown  in  figure 
k/1 .  If  we  compute  the  result  of  our  new  vector  operation  using 
the  coordinate  system  shown  in  k/2,  we  find: 

Rx  =  0 
Ry  =  0 
Rz  =  0 

The  x  component  is  zero  because  Px  =  0,  the  y  component  is 
zero  because  Qy  =  0,  and  the  z  component  is  of  course  zero  be¬ 
cause  both  vectors  are  in  the  x-y  plane.  However,  if  we  carry  out 
the  same  operations  in  coordinate  system  k/3,  rotated  45  degrees 
with  respect  to  the  previous  one,  we  find 

Rx  =  - 1/2 
Ry  =  1/2 
Rz  =  0 


The  operation’s  result  depends  on  what  coordinate  system  we 
use,  and  since  the  two  versions  of  R  have  different  lengths  (one 
being  zero  and  the  other  nonzero),  they  don’t  just  represent  the 
same  answer  expressed  in  two  different  coordinate  systems.  Such 
an  operation  will  never  be  useful  in  physics,  because  experiments 
show  physics  works  the  same  regardless  of  which  way  we  orient 
the  laboratory  building!  The  useful  vector  operations,  such  as 
addition  and  scalar  multiplication,  are  rotationally  invariant,  i.e. , 
come  out  the  same  regardless  of  the  orientation  of  the  coordi¬ 
nate  system. 

All  the  vector  techniques  can  be  applied  to  any  kind  of  vector, 
but  the  graphical  representation  of  vectors  as  arrows  is  particularly 
natural  for  vectors  that  represent  lengths  and  distances.  We  define  a 
vector  called  r  whose  components  are  the  coordinates  of  a  particular 
point  in  space,  x ,  y,  and  z.  The  Ar  vector,  whose  components  are 
Ax,  Ay,  and  A z,  can  then  be  used  to  represent  motion  that  starts  at 
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one  point  and  ends  at  another.  Adding  two  Ar  vectors  is  interpreted 
as  a  trip  with  two  legs:  by  computing  the  Ar  vector  going  from  point 
A  to  point  B  plus  the  vector  from  B  to  C,  we  find  the  vector  that 
would  have  taken  us  directly  from  A  to  C. 

Calculations  with  magnitude  and  direction 

If  you  ask  someone  where  Las  Vegas  is  compared  to  Los  Angeles, 
she  is  unlikely  to  say  that  the  Ax  is  290  km  and  the  Ay  is  230  km, 
in  a  coordinate  system  where  the  positive  x  axis  is  east  and  the  y 
axis  points  north.  She  will  probably  say  instead  that  it’s  370  km 
to  the  northeast.  If  she  was  being  precise,  she  might  specify  the 
direction  as  38°  counterclockwise  from  east.  In  two  dimensions,  we 
can  always  specify  a  vector’s  direction  like  this,  using  a  single  angle. 
A  magnitude  plus  an  angle  suffice  to  specify  everything  about  the 
vector.  The  following  two  examples  show  how  we  use  trigonometry 
and  the  Pythagorean  theorem  to  go  back  and  forth  between  the  x-y 
and  magnitude-angle  descriptions  of  vectors. 

Finding  magnitude  and  angle  from  components  example  59 

>  Given  that  the  Ar  vector  from  LA  to  Las  Vegas  has  Ax=290  km 
and  Ay=230  km,  how  would  we  find  the  magnitude  and  direction 
of  Ar? 

>  We  find  the  magnitude  of  Ar  from  the  Pythagorean  theorem: 


We  know  all  three  sides  of  the  triangle,  so  the  angle  0  can  be 
found  using  any  of  the  inverse  trig  functions.  For  example,  we 
know  the  opposite  and  adjacent  sides,  so 


=  38°. 


Finding  the  components  from  the  magnitude  and  angle  example 


60 


>  Given  that  the  straight-line  distance  from  Los  Angeles  to  Las 
Vegas  is  370  km,  and  that  the  angle  0  in  the  figure  is  38°,  how 
can  the  x  and  y  components  of  the  Ar  vector  be  found? 

>  The  sine  and  cosine  of  0  relate  the  given  information  to  the 
information  we  wish  to  find: 


Las  V  egas 


I  /  Example  59. 
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Solving  for  the  unknowns  gives 


Ax  =  |Ar|  cos  0 
=  290  km 
Ay  =  |Ar|  sin  0 
=  230  km 


Los 

Angeles 


The  following  example  shows  the  correct  handling  of  the  plus 
and  minus  signs,  which  is  usually  the  main  cause  of  mistakes  by 
students. 

Negative  components  example  61 

>  San  Diego  is  1 20  km  east  and  1 50  km  south  of  Los  Angeles.  An 
airplane  pilot  is  setting  course  from  San  Diego  to  Los  Angeles.  At 
what  angle  should  she  set  her  course,  measured  counterclock¬ 
wise  from  east,  as  shown  in  the  figure? 

>  If  we  make  the  traditional  choice  of  coordinate  axes,  with  x 
pointing  to  the  right  and  y  pointing  up  on  the  map,  then  her  Ax  is 
negative,  because  her  final  x  value  is  less  than  her  initial  x  value. 
Her  Ay  is  positive,  so  we  have 

Ax  =  - 1 20  km 
Ay  =  150  km. 


If  we  work  by  analogy  with  the  example  59,  we  get 


0  =  tan 


i  Ay 
Ax 


=  tan-1  (-1.25) 


=  -51°. 


According  to  the  usual  way  of  defining  angles  in  trigonometry, 
a  negative  result  means  an  angle  that  lies  clockwise  from  the  x 
axis,  which  would  have  her  heading  for  the  Baja  California.  What 
went  wrong?  The  answer  is  that  when  you  ask  your  calculator  to 
take  the  arctangent  of  a  number,  there  are  always  two  valid  pos¬ 
sibilities  differing  by  180°.  That  is,  there  are  two  possible  angles 
whose  tangents  equal  -1 .25: 

tan  129°  =  -1.25 
tan  (-51°)  =  -1.25 

You  calculator  doesn’t  know  which  is  the  correct  one,  so  it  just 
picks  one.  In  this  case,  the  one  it  picked  was  the  wrong  one,  and 
it  was  up  to  you  to  add  180°  to  it  to  find  the  right  answer. 
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A  shortcut  example  62 

>  A  split  second  after  nine  o’clock,  the  hour  hand  on  a  clock  dial 
has  moved  clockwise  past  the  nine-o’clock  position  by  some  im¬ 
perceptibly  small  angle  4>-  Let  positive  x  be  to  the  right  and  posi¬ 
tive  y  up.  If  the  hand,  with  length  i,  is  represented  by  a  Ar  vector 
going  from  the  dial’s  center  to  the  tip  of  the  hand,  find  this  vector’s 
Ax. 

>  The  following  shortcut  is  the  easiest  way  to  work  out  examples 
like  these,  in  which  a  vector’s  direction  is  known  relative  to  one 
of  the  axes.  We  can  tell  that  Ar  will  have  a  large,  negative  x 
component  and  a  small,  positive  y.  Since  Ax  <  0,  there  are 
really  only  two  logical  possibilities:  either  Ax  =  -^coscth  or  Ax  = 
— £sin  cp.  Because  4>  is  small,  coscf)  is  large  and  sin 4>  is  small. 
We  conclude  that  Ax  =  —l  cos  4>. 

A  typical  application  of  this  technique  to  force  vectors  is  given  in 
example  71  on  p.  205. 

Addition  of  vectors  given  their  components 

The  easiest  type  of  vector  addition  is  when  you  are  in  possession 
of  the  components,  and  want  to  find  the  components  of  their  sum. 

San  Diego  to  Las  Vegas  example  63 

>  Given  the  Ax  and  Ay  values  from  the  previous  examples,  find 
the  Ax  and  Ay  from  San  Diego  to  Las  Vegas. 

o 


A  Xtotai  =  AX-,  +  Ax2 

=  -120  km  +  290  km 
=  1 70  km 

Ayf0fa/  =  Ay!  +  Ay2 

=  1 50  km  +  230  km 
=  380 


n  /  Example  62. 


Las  V  egas 


o  /  Example  63. 


Addition  of  vectors  given  their  magnitudes  and  directions 

In  this  case,  you  must  first  translate  the  magnitudes  and  direc¬ 
tions  into  components,  and  the  add  the  components. 

Graphical  addition  of  vectors 

Often  the  easiest  way  to  add  vectors  is  by  making  a  scale  drawing 
on  a  piece  of  paper.  This  is  known  as  graphical  addition,  as  opposed 
to  the  analytic  techniques  discussed  previously. 

From  San  Diego  to  Las  Vegas,  graphically  example  64 

>  Given  the  magnitudes  and  angles  of  the  Ar  vectors  from  San 
Diego  to  Los  Angeles  and  from  Los  Angeles  to  Las  Vegas,  find 
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p  /  Example  64. 


q  /  Adding  vectors  graphically 
by  placing  them  tip-to-tail,  like  a 
train. 


the  magnitude  and  angle  of  the  Ar  vector  from  San  Diego  to  Las 
Vegas. 

>  Using  a  protractor  and  a  ruler,  we  make  a  careful  scale  drawing, 
as  shown  in  figure  p  on  page  200.  A  scale  of  1  cmolO  km  was 
chosen  for  this  solution.  With  a  ruler,  we  measure  the  distance 
from  San  Diego  to  Las  Vegas  to  be  3.8  cm,  which  corresponds  to 
380  km.  With  a  protractor,  we  measure  the  angle  0  to  be  71°. 

Even  when  we  don’t  intend  to  do  an  actual  graphical  calculation 
with  a  ruler  and  protractor,  it  can  be  convenient  to  diagram  the 
addition  of  vectors  in  this  way,  as  shown  in  figure  q.  With  Ar 
vectors,  it  intuitively  makes  sense  to  lay  the  vectors  tip-to-tail  and 
draw  the  sum  vector  from  the  tail  of  the  first  vector  to  the  tip  of 
the  second  vector.  We  can  do  the  same  when  adding  other  vectors 
such  as  force  vectors. 

Unit  vector  notation 

When  we  want  to  specify  a  vector  by  its  components,  it  can  be 
cumbersome  to  have  to  write  the  algebra  symbol  for  each  compo¬ 
nent: 

Ax  =  290  km,  Ay  =  230  km 
A  more  compact  notation  is  to  write 


Ar  =  (290  km)x  +  (230  km)y, 
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where  the  vectors  x,  y,  and  z,  called  the  unit  vectors,  are  defined 
as  the  vectors  that  have  magnitude  equal  to  1  and  directions  lying 
along  the  x,  y,  and  z  axes.  In  speech,  they  are  referred  to  as  “x-hat,” 
“y-hat,”  and  “z-hat.” 


A  slightly  different,  and  harder  to  remember,  version  of  this 
notation  is  unfortunately  more  prevalent.  In  this  version,  the  unit 
vectors  are  called  i,  j,  and  k: 


Ar  =  (290  km)i  +  (230  km)j. 


Applications  to  relative  motion,  momentum,  and  force 

Vector  addition  is  the  correct  way  to  generalize  the  one-dimensional 
concept  of  adding  velocities  in  relative  motion,  as  shown  in  the  fol¬ 
lowing  example: 

Velocity  vectors  in  relative  motion  example  65 

>  You  wish  to  cross  a  river  and  arrive  at  a  dock  that  is  directly 
across  from  you,  but  the  river’s  current  will  tend  to  carry  you 
downstream.  To  compensate,  you  must  steer  the  boat  at  an  an¬ 
gle.  Find  the  angle  0,  given  the  magnitude,  \vwl\,  of  the  water’s 
velocity  relative  to  the  land,  and  the  maximum  speed,  |vew|,  of 
which  the  boat  is  capable  relative  to  the  water. 

o  The  boat’s  velocity  relative  to  the  land  equals  the  vector  sum  of 
its  velocity  with  respect  to  the  water  and  the  water’s  velocity  with 
respect  to  the  land, 


Vbl  =  Vbw  +  Vwl- 


If  the  boat  is  to  travel  straight  across  the  river,  i.e. ,  along  the  y 
axis,  then  we  need  to  have  Vbl,x  =  0.  This  x  component  equals 
the  sum  of  the  x  components  of  the  other  two  vectors, 


veL,x  =  vBW,x  +  mwL,x, 


or 


0  =  — |veiv|  sin  0  +  \mwl 


Solving  for  0,  we  find 


sin  0  =  |vw/J/Ivbw|, 


so 


0  =  sin 


-1  \Vwl\ 


r  /  The  racing  greyhound’s 
velocity  vector  is  in  the  direction 
of  its  motion,  i.e.,  tangent  to  its 
curved  path. 


VWL — ► 
VBW 
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How  to  generalize  one-dimensional  equations 
>  How  can  the  one-dimensional  relationships 


example  66 


Ptotal  ~  mtota!vcm 


and 


Xcm  — 


Ey  mj 


be  generalized  to  three  dimensions? 

>  Momentum  and  velocity  are  vectors,  since  they  have  directions 
in  space.  Mass  is  a  scalar.  If  we  rewrite  the  first  equation  to  show 
the  appropriate  quantities  notated  as  vectors, 


Ptotal  -  mtotalV cm, 

we  get  a  valid  mathematical  operation,  the  multiplication  of  a  vec¬ 
tor  by  a  scalar.  Similarly,  the  second  equation  becomes 

,  _Ey  miri 

Cm~  Ey  171  j  ’ 

which  is  also  valid.  Each  term  in  the  sum  on  top  contains  a  vector 
multiplied  by  a  scalar,  which  gives  a  vector.  Adding  up  all  these 
vectors  gives  a  vector,  and  dividing  by  the  scalar  sum  on  the  bot¬ 
tom  gives  another  vector. 

This  kind  of  wave-the-magic-wand-and-write-it-all-in-bold-face  tech¬ 
nique  will  always  give  the  right  generalization  from  one  dimension 
to  three,  provided  that  the  result  makes  sense  mathematically  — 
if  you  find  yourself  doing  something  nonsensical,  such  as  adding 
a  scalar  to  a  vector,  then  you  haven’t  found  the  generalization 
correctly. 

Colliding  coins  example  67 

t>  Take  two  identical  coins,  put  one  down  on  a  piece  of  paper,  and 
slide  the  other  across  the  paper,  shooting  it  fairly  rapidly  so  that 
it  hits  the  target  coin  off-center.  If  you  trace  the  initial  and  final 
positions  of  the  coins,  you  can  determine  the  directions  of  their 
momentum  vectors  after  the  collision.  The  angle  between  these 
vectors  is  always  fairly  close  to,  but  a  little  less  than,  90  degrees. 
Why  is  this? 

>  Let  the  velocity  vector  of  the  incoming  coin  be  a,  and  let  the  two 
outgoing  velocity  vectors  be  b  and  c.  Since  the  masses  are  the 
same,  conservation  of  momentum  amounts  to  a  =  b  +  c,  which 
means  that  it  has  to  be  possible  to  assemble  the  three  vectors 
into  a  triangle.  If  we  assume  that  no  energy  is  converted  into 
heat  and  sound,  then  conservation  of  energy  gives  (discarding 
the  common  factor  of  m/2)  a2  =  b2  +  c2  for  the  magnitudes  of  the 
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three  vectors.  This  is  the  Pythagorean  theorem,  which  will  hold 
only  if  the  three  vectors  form  a  right  triangle. 

The  fact  that  we  observe  the  angle  to  be  somewhat  less  than 
90  degrees  shows  that  the  assumption  used  in  the  proof  is  only 
approximately  valid:  a  little  energy  is  converted  into  heat  and 
sound.  The  opposite  case  would  be  a  collision  between  two  blobs 
of  putty,  where  the  maximum  possible  amount  of  energy  is  con¬ 
verted  into  heat  and  sound,  the  two  blobs  fly  off  together,  giving 
an  angle  of  zero  between  their  momentum  vectors.  The  real-life 
experiment  interpolates  between  the  ideal  extremes  of  0  and  90 
degrees,  but  comes  much  closer  to  90. 

Force  is  a  vector,  and  we  add  force  vectors  when  more  than  one 
force  acts  on  the  same  object. 

Pushing  a  block  up  a  ramp  example  68 

>  Figure  t/1  shows  a  block  being  pushed  up  a  frictionless  ramp 
at  constant  speed  by  an  applied  force  Fa.  Flow  much  force  is 
required,  in  terms  of  the  block’s  mass,  m,  and  the  angle  of  the 
ramp,  0? 

o  We  analyzed  this  simple  machine  in  example  38  on  page  168 
using  the  concept  of  work.  Here  we’ll  do  it  using  vector  addition 
of  forces.  Figure  t/2  shows  the  other  two  forces  acting  on  the 
block:  a  normal  force,  Fn,  created  by  the  ramp,  and  the  gravita¬ 
tional  force,  Fg.  Because  the  block  is  being  pushed  up  at  constant 
speed,  it  has  zero  acceleration,  and  the  total  force  on  it  must  be 
zero.  In  figure  t/3,  we  position  all  the  force  vectors  tip-to-tail  for 
addition.  Since  they  have  to  add  up  to  zero,  they  must  join  up 
without  leaving  a  gap,  so  they  form  a  triangle.  Using  trigonometry 
we  find 


Fa  =  Fg  sin  0 
=  mg  sin  0. 


Buoyancy ;  again  example  69 

In  example  10  on  page  85,  we  found  that  the  energy  required  to 
raise  a  cube  immersed  in  a  fluid  is  as  if  the  cube’s  mass  had  been 
reduced  by  an  amount  equal  to  the  mass  of  the  fluid  that  other¬ 
wise  would  have  been  in  the  volume  it  occupies  (Archimedes’ 
principle).  From  the  energy  perspective,  this  effect  occurs  be¬ 
cause  raising  the  cube  allows  a  certain  amount  of  fluid  to  move 
downward,  and  the  decreased  gravitational  energy  of  the  fluid 
tends  to  offset  the  increased  gravitational  energy  of  the  cube.  The 
proof  given  there,  however,  could  not  easily  be  extended  to  other 
shapes. 

Thinking  in  terms  of  force  rather  than  energy,  it  becomes  easier 
to  give  a  proof  that  works  for  any  shape.  A  certain  upward  force  is 


u  /  Archimedes’  principle  works 
regardless  of  whether  the  object 
is  a  cube.  The  fluid  makes  a  force 
on  every  square  millimeter  of  the 
object’s  surface. 
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needed  to  support  the  object  in  figure  u.  If  this  force  was  applied, 
then  the  object  would  be  in  equilibrium:  the  vector  sum  of  all  the 
forces  acting  on  it  would  be  zero.  These  forces  are  Fa,  the  upward 
force  just  mentioned,  Fg,  the  downward  force  of  gravity,  and  Ff, 
the  total  force  from  the  fluid: 


Fa  +  Fg  +  Ff-0 

Since  the  fluid  is  under  more  pressure  at  a  greater  depth,  the  part 
of  the  fluid  underneath  the  object  tends  to  make  more  force  than 
the  part  above,  so  the  fluid  tends  to  help  support  the  object. 


v  /  Example  70. 


w  /  An  artist’s  rendering  of 
what  Cosmos  1  would  have 
looked  like  in  orbit. 


Now  suppose  the  object  was  removed,  and  instantly  replaced 
with  an  equal  volume  of  fluid.  The  new  fluid  would  be  in  equi¬ 
librium  without  any  force  applied  to  hold  it  up,  so 

Fgf+Ff  =  0, 

where  Fgf,  the  weight  of  the  fluid,  is  not  the  same  as  Fg,  the 
weight  of  the  object,  but  Ff  is  the  same  as  before,  since  the  pres¬ 
sure  of  the  surrounding  fluid  is  the  same  as  before  at  any  partic¬ 
ular  depth.  We  therefore  have 

Fa  =  -  (Fg  -  Fg,)  , 

which  is  Archimedes’  principle  in  terms  of  force:  the  force  re¬ 
quired  to  support  the  object  is  lessened  by  an  amount  equal  to 
the  weight  of  the  fluid  that  would  have  occupied  its  volume. 

By  the  way,  the  word  “pressure”  that  I  threw  around  casually 
in  the  preceding  example  has  a  precise  technical  definition:  force 
per  unit  area.  The  SI  units  of  pressure  are  N/m2,  which  can  be 
abbreviated  as  pascals,  1  Pa  =  1  N/m2.  Atmospheric  pressure  is 
about  100  kPa.  By  applying  the  equation  F9  +  F j  =  0  to  the  top 
and  bottom  surfaces  of  a  cubical  volume  of  fluid,  one  can  easily 
prove  that  the  difference  in  pressure  between  two  different  depths 
is  A P  =  pgAy.  (In  physics,  “fluid”  can  refer  to  either  a  gas  or  a 
liquid.)  Pressure  is  discussed  in  more  detail  in  chapter  5. 

A  solar  sail  example  70 

A  solar  sail,  figure  v/1 ,  allows  a  spacecraft  to  get  its  thrust  without 
using  internal  stores  of  energy  or  having  to  carry  along  mass  that 
it  can  shove  out  the  back  like  a  rocket.  Sunlight  strikes  the  sail 
and  bounces  off,  transferring  momentum  to  the  sail.  A  working 
30-meter-diameter  solar  sail,  Cosmos  1 ,  was  built  by  an  American 
company,  and  was  supposed  to  be  launched  into  orbit  aboard  a 
Russian  booster  launched  from  a  submarine,  but  launch  attempts 
in  2001  and  2005  both  failed. 

In  this  example,  we  will  calculate  the  optimal  orientation  of  the 
sail,  assuming  that  “optimal”  means  changing  the  vehicle’s  en¬ 
ergy  as  rapidly  as  possible.  For  simplicity,  we  model  the  compli¬ 
cated  shape  of  the  sail’s  surface  as  a  disk,  seen  edge-on  in  fig¬ 
ure  v/2,  and  we  assume  that  the  craft  is  in  a  nearly  circular  orbit 
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around  the  sun,  hence  the  90-degree  angle  between  the  direction 
of  motion  and  the  incoming  sunlight.  We  assume  that  the  sail  is 
100%  reflective.  The  orientation  of  the  sail  is  specified  using  the 
angle  0  between  the  incoming  rays  of  sunlight  and  the  perpen¬ 
dicular  to  the  sail.  In  other  words,  0=0  if  the  sail  is  catching  the 
sunlight  full-on,  while  0=90°  means  that  the  sail  is  edge-on  to  the 
sun. 

Conservation  of  momentum  gives 


Plight, i  ~  Plight, f  +  ^Psaih 


where  Apsa//  is  the  change  in  momentum  picked  up  by  the  sail. 
Breaking  this  down  into  components,  we  have 

0  =  Piight,f,x  +  Apsa,/X  and 
Plight,  i,y  =  Plight,  f,y  +  ^ Psail,y ■ 

As  in  example  53  on  page  189,  the  component  of  the  force  that 
is  directly  away  from  the  sun  (up  in  figure  v/2)  doesn’t  change  the 
energy  of  the  craft,  so  we  only  care  about  Apsa//x,  which  equals 
-Plight, t,x-  The  outgoing  light  ray  forms  an  angle  of  20  with  the 
negative  y  axis,  or  27O°-20  measured  counterclockwise  from  the 
x  axis,  so  the  useful  thrust  depends  on  -cos(270°  -  20)  =  sin  20. 

However,  this  is  all  assuming  a  given  amount  of  light  strikes  the 
sail.  During  a  certain  time  period,  the  amount  of  sunlight  striking 
the  sail  depends  on  the  cross-sectional  area  the  sail  presents  to 
the  sun,  which  is  proportional  to  cos  0.  For  0=90°,  cos  0  equals 
zero,  since  the  sail  is  edge-on  to  the  sun. 

Putting  together  these  two  factors,  the  useful  thrust  is  proportional 
to  sin  20  cos0,  and  this  quantity  is  maximized  for  0  «  35°.  A  coun¬ 
terintuitive  fact  about  this  maneuver  is  that  as  the  spacecraft  spi¬ 
rals  outward,  its  total  energy  (kinetic  plus  gravitational)  increases, 
but  its  kinetic  energy  actually  decreases! 

A  layback  example  71 

The  figure  shows  a  rock  climber  using  a  technique  called  a  lay- 
back.  He  can  make  the  normal  forces  FWi  and  FW2  large,  which 
has  the  side-effect  of  increasing  the  frictional  forces  FF1  and  FF2, 
so  that  he  doesn’t  slip  down  due  to  the  gravitational  (weight)  force 
Fw.  The  purpose  of  the  problem  is  not  to  analyze  all  of  this  in  de¬ 
tail,  but  simply  to  practice  finding  the  components  of  the  forces 
based  on  their  magnitudes.  To  keep  the  notation  simple,  let’s 
write  F/vi  for  |FW1  |,  etc.  The  crack  overhangs  by  a  small,  positive 
angle  0  «  9°. 

In  this  example,  we  determine  the  x  component  of  FWi .  The  other 
nine  components  are  left  as  an  exercise  to  the  reader  (problem 
81,  p.  236). 
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The  easiest  method  is  the  one  demonstrated  in  example  62  on 
p.  199.  Casting  vector  FWi’s  shadow  on  the  ground,  we  can  tell 
that  it  would  point  to  the  left,  so  its  x  component  is  negative.  The 
only  two  possibilities  for  its  x  component  are  therefore  -FN i  cos  0 
or  -Fa/i  sin  0.  We  expect  this  force  to  have  a  large  x  component 
and  a  much  smaller  y.  Since  0  is  small,  cos  0^1,  while  sin  0  is 
small.  Therefore  the  x  component  must  be  —F^  cos  0. 


x  /  Example  71  and  problem  81  on  p.  236. 


Discussion  Questions 

A  An  object  goes  from  one  point  in  space  to  another.  After  it  arrives  at 
its  destination,  how  does  the  magnitude  of  its  Ar  vector  compare  with  the 
distance  it  traveled? 

B  In  several  examples,  I’ve  dealt  with  vectors  having  negative  com¬ 
ponents.  Does  it  make  sense  as  well  to  talk  about  negative  and  positive 
vectors? 

C  If  you’re  doing  graphical  addition  of  vectors,  does  it  matter  which 
vector  you  start  with  and  which  vector  you  start  from  the  other  vector’s 
tip? 

D  If  you  add  a  vector  with  magnitude  1  to  a  vector  of  magnitude  2, 
what  magnitudes  are  possible  for  the  vector  sum? 

E  Which  of  these  examples  of  vector  addition  are  correct,  and  which 
are  incorrect? 

y  /  Discussion  question  E.  F  Is  it  possible  for  an  airplane  to  maintain  a  constant  velocity  vector 

but  not  a  constant  |v|?  How  about  the  opposite  -  a  constant  |v|  but  not  a 
constant  velocity  vector?  Explain. 
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G  New  York  and  Rome  are  at  about  the  same  latitude,  so  the  earth’s 
rotation  carries  them  both  around  nearly  the  same  circle.  Do  the  two  cities 
have  the  same  velocity  vector  (relative  to  the  center  of  the  earth)?  If  not, 
is  there  any  way  for  two  cities  to  have  the  same  velocity  vector? 

H  The  figure  shows  a  roller  coaster  car  rolling  down  and  then  up  under 
the  influence  of  gravity.  Sketch  the  car’s  velocity  vectors  and  acceleration 
vectors.  Pick  an  interesting  point  in  the  motion  and  sketch  a  set  of  force 
vectors  acting  on  the  car  whose  vector  sum  could  have  resulted  in  the 
right  acceleration  vector. 

I  The  following  is  a  question  commonly  asked  by  students: 

“Why  does  the  force  vector  always  have  to  point  in  the  same  direction 
as  the  acceleration  vector?  What  if  you  suddenly  decide  to  change  your 
force  on  an  object,  so  that  your  force  is  no  longer  pointing  in  the  same 
direction  that  the  object  is  accelerating?” 

What  misunderstanding  is  demonstrated  by  this  question?  Suppose,  for 
example,  a  spacecraft  is  blasting  its  rear  main  engines  while  moving  for¬ 
ward,  then  suddenly  begins  firing  its  sideways  maneuvering  rocket  as 
well.  What  does  the  student  think  Newton’s  laws  are  predicting? 

J  Debug  the  following  incorrect  solutions  to  this  vector  addition  prob¬ 
lem. 

Problem'.  Freddi  Fish™  swims  5.0  km  northeast,  and  then  12.0  km  in  the 
direction  55  degrees  west  of  south.  How  far  does  she  end  up  from  her 
starting  point,  and  in  what  direction  is  she  from  her  starting  point? 

Incorrect  solution  #1: 

5.0  km+12.0  km=17.0  km 

Incorrect  solution  #2: 

i/(5.0  km)2 +  (12.0  km)2=13.0  km 

Incorrect  solution  #3: 

Let  A  and  B  be  her  two  Ar  vectors,  and  let  C  =  A  +  B.  Then 


z  /  Discussion  question  H. 


Ax  =  (5.0  km)  cos  45°  =  3.5  km 
Bx  =  (12.0  km)  cos  55°  =  6.9  km 
Ay  =  (5.0  km)  sin  45°  =  3.5  km 
By  =  (1 2.0  km)  sin  55°  =  9.8  km 
Cx  =  Ax  +  Bx 
=  10.4  km 

Cy  =  Ay  +  By 

=  13.3  km 

|C|  =  y/Ct  +  C$ 

=  16.9  km 

direction  =  tan-1  (13.3/1 0.4) 

=  52°  north  of  east 
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Incorrect  solution  #4: 
(same  notation  as  above) 


Ax  =  (5.0  km)  cos  45°  =  3.5  km 
Bx  =  -( 1 2.0  km)  cos 55°  =  -6.9  km 
Ay  =  (5.0  km)  sin  45°  =  3.5  km 
By  =  -(1 2.0  km)  sin  55°  =  -9.8  km 
Cx  =  Ax  +  Bx 
=  -3.4  km 

Cy  =  Ay  +  By 

=  -6.3  km 

\C\  =  yJC$  +  C$ 

=  7.2  km 

direction  =  tan-1  (-6.3/  -  3.4) 

=  62°  north  of  east 


aa  /  Visualizing  the  accelera¬ 
tion  vector. 


Incorrect  solution  #5: 
(same  notation  as  above) 


Ax  =  (5.0  km)  cos  45°  =  3.5  km 
Bx  =  -(1 2.0  km)  sin  55°  =  -9.8  km 
Ay  =  (5.0  km)  sin  45°  =  3.5  km 
By  =  -( 1 2.0  km)  cos 55°  =  -6.9  km 
Cx  =  Ax  +  Bx 
=  -6.3  km 

Cy  =  Ay  +  By 

=  -3.4  km 
|C|  =  y/C$  +  C$ 

=  7.2  km 

direction  =  tan-1  (-3.4/  -  6.3) 

=  28°  north  of  east 


3.4.4  Calculus  with  vectors 


Differentiation 

In  one  dimension,  we  define  the  velocity  as  the  derivative  of  the 
position  with  respect  to  time,  and  we  can  think  of  the  derivative  as 
what  we  get  when  we  calculate  Ax/ At  for  very  short  time  intervals. 
The  quantity  Ax  =  Xf  —  Xi  is  calculated  by  subtraction.  In  three 
dimensions,  x  becomes  r,  and  the  Ar  vector  is  calculated  by  vector 
subtraction,  Ar  =  rj  —  r,.  Vector  subtraction  is  defined  component 
by  component,  so  when  we  take  the  derivative  of  a  vector,  this  means 
we  end  up  taking  the  derivative  component  by  component, 


Vx 


dx  d  y  d  z 

dt  ’  v  dt  ’  dt 
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or 

dr  dx  „  d |/A  dz  ^ 
dt=d(x+d*y+dt^ 

All  of  this  reasoning  applies  equally  well  to  any  derivative  of  a  vector, 
so  for  instance  we  can  take  the  second  derivative, 


CLqr>  - 


dvx 

At 


or 


dv  dvr 


AVy 

At 
Avv 


az  = 


Avz 


d  vz 
At 


37  =  77x+  37y  +  3fz- 


At  At 


At 


At 


A  counterintuitive  consequence  of  this  is  that  the  acceleration 
vector  does  not  need  to  be  in  the  same  direction  as  the  motion.  The 
velocity  vector  points  in  the  direction  of  motion,  but  by  Newton’s 
second  law,  a  =  F/m,  the  acceleration  vector  points  in  the  same  di¬ 
rection  as  the  force,  not  the  motion.  This  is  easiest  to  understand  if 
we  take  velocity  vectors  from  two  different  moments  in  the  motion, 
and  visualize  subtracting  them  graphically  to  make  a  Av  vector. 
The  direction  of  the  Av  vector  tells  us  the  direction  of  the  accelera¬ 
tion  vector  as  well,  since  the  derivative  dv/  dt  can  be  approximated 
as  Av/A t.  As  shown  in  figure  aa/1,  a  change  in  the  magnitude  of 
the  velocity  vector  implies  an  acceleration  that  is  in  the  direction  of 
motion.  A  change  in  the  direction  of  the  velocity  vector  produces 
an  acceleration  perpendicular  to  the  motion,  aa/2. 

Circular  motion  example  72 

>  An  object  moving  in  a  circle  of  radius  r  in  the  x-y  plane  has 


x  =  r  cos  c vt  and 

y  =  r  sin  t vt, 


where  cu  is  the  number  of  radians  traveled  per  second,  and  the 
positive  or  negative  sign  indicates  whether  the  motion  is  clock¬ 
wise  or  counterclockwise.  What  is  its  acceleration? 

o  The  components  of  the  velocity  are 

vx  =  -cur  sin  cut  and 
vy  =  cur  cos  c vt, 

and  for  the  acceleration  we  have 

ax  =  -cu2r  cos  w t  and 

ay  =  -c u2r  sin  c vt. 

The  acceleration  vector  has  cosines  and  sines  in  the  same  places 
as  the  r  vector,  but  with  minus  signs  in  front,  so  it  points  in  the  op¬ 
posite  direction,  i.e. ,  toward  the  center  of  the  circle.  By  Newton’s 
second  law,  a=F 7m,  this  shows  that  the  force  must  be  inward  as 
well;  without  this  force,  the  object  would  fly  off  straight. 


ft*** 


ab  /  This  figure  shows  an  in¬ 
tuitive  justification  for  the  fact 
proved  mathematically  in  the 
example,  that  the  direction  of  the 
force  and  acceleration  in  circular 
motion  is  inward.  The  heptagon, 
2,  is  a  better  approximation  to 
a  circle  than  the  triangle,  1.  To 
make  an  infinitely  good  approx¬ 
imation  to  circular  motion,  we 
would  need  to  use  an  infinitely 
large  number  of  infinitesimal  taps, 
which  would  amount  to  a  steady 
inward  force. 
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The  magnitude  of  the  acceleration  is 


a 


a2x  +  a2v 


2 

c irr. 


ac  /  The  total  force  in  the 
forward-backward  direction  is 
zero  in  both  cases. 


ad  /  There  is  no  outward  force 
on  the  bowling  ball,  but  in  the 
noninertial  frame  it  seems  like 
one  exists. 


It  makes  sense  that  tu  is  squared,  since  reversing  the  sign  of  cu 
corresponds  to  reversing  the  direction  of  motion,  but  the  acceler¬ 
ation  is  toward  the  center  of  the  circle,  regardless  of  whether  the 
motion  is  clockwise  or  counterclockwise.  This  result  can  also  be 
rewritten  in  the  form 


Although  I’ve  relegated  the  results  a  =  uj2r  =  |v|2/r  to  an  ex¬ 
ample  because  they  are  a  straightforward  corollary  of  more  general 
principles  already  developed,  they  are  important  and  useful  enough 
to  record  for  later  use.  These  results  are  counterintuitive  as  well. 
Until  Newton,  physicists  and  laypeople  alike  had  assumed  that  the 
planets  would  need  a  force  to  push  them  forward  in  their  orbits. 
Figure  ab  may  help  to  make  it  more  plausible  that  only  an  inward 
force  is  required.  A  forward  force  might  be  needed  in  order  to  cancel 
out  a  backward  force  such  as  friction,  ac,  but  the  total  force  in  the 
forward-backward  direction  needs  to  be  exactly  zero  for  constant- 
speed  motion.  When  you  are  in  a  car  undergoing  circular  motion, 
there  is  also  a  strong  illusion  of  an  outward  force.  But  what  object 
could  be  making  such  a  force?  The  car’s  seat  makes  an  inward  force 
on  you,  not  an  outward  one.  There  is  no  object  that  could  be  ex¬ 
erting  an  outward  force  on  your  body.  In  reality,  this  force  is  an 
illusion  that  comes  from  our  brain’s  intuitive  efforts  to  interpret  the 
situation  within  a  noninertial  frame  of  reference.  As  shown  in  figure 
ad,  we  can  describe  everything  perfectly  well  in  an  inertial  frame 
of  reference,  such  as  the  frame  attached  to  the  sidewalk.  In  such  a 
frame,  the  bowling  ball  goes  straight  because  there  is  no  force  on  it. 
The  wall  of  the  truck’s  bed  hits  the  ball,  not  the  other  way  around. 


Integration 

An  integral  is  really  just  a  sum  of  many  infinitesimally  small 
terms.  Since  vector  addition  is  defined  in  terms  of  addition  of  the 
components,  an  integral  of  a  vector  quantity  is  found  by  doing  inte¬ 
grals  component  by  component. 

Projectile  motion  example  73 

t>  Find  the  motion  of  an  object  whose  acceleration  vector  is  con¬ 
stant,  for  instance  a  projectile  moving  under  the  influence  of  grav¬ 
ity. 

>  We  integrate  the  acceleration  to  get  the  velocity,  and  then  inte¬ 
grate  the  velocity  to  get  the  position  as  a  function  of  time.  Doing 
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this  to  the  x  component  of  the  acceleration,  we  find 


ax  dfj  df 

=  J  (axt  +  vxo)  df, 

where  vx0  is  a  constant  of  integration,  and 

1  ? 

x=  -axr  +  vx0t  +  x0. 

Similarly,  y=(M2)ayt2 +  vy0t+y0  and  z  =  (1/2 )azt2+vz0t+z0.  Once 
one  has  gained  a  little  confidence,  it  becomes  natural  to  do  the 
whole  thing  as  a  single  vector  integral, 

adf^j  df 

=  J  (at  +  v0)  df 

1  2 

=  -ar +v0f  +  r0, 

where  now  the  constants  of  integration  are  vectors. 

Discussion  Questions 

A  In  the  game  of  crack  the  whip,  a  line  of  people  stand  holding  hands, 
and  then  they  start  sweeping  out  a  circle.  One  person  is  at  the  center, 
and  rotates  without  changing  location.  At  the  opposite  end  is  the  person 
who  is  running  the  fastest,  in  a  wide  circle.  In  this  game,  someone  always 
ends  up  losing  their  grip  and  flying  off.  Suppose  the  person  on  the  end 
loses  her  grip.  What  path  does  she  follow  as  she  goes  flying  off?  Draw  an 
overhead  view.  (Assume  she  is  going  so  fast  that  she  is  really  just  trying 
to  put  one  foot  in  front  of  the  other  fast  enough  to  keep  from  falling;  she 
is  not  able  to  get  any  significant  horizontal  force  between  her  feet  and  the 
ground.) 

B  Suppose  the  person  on  the  outside  is  still  holding  on,  but  feels  that 
she  may  loose  her  grip  at  any  moment.  What  force  or  forces  are  acting 
on  her,  and  in  what  directions  are  they?  (We  are  not  interested  in  the 
vertical  forces,  which  are  the  earth’s  gravitational  force  pulling  down,  and 
the  ground’s  normal  force  pushing  up.)  Make  a  table  in  the  format  shown 
in  subsection  3.2.6. 

C  Suppose  the  person  on  the  outside  is  still  holding  on,  but  feels  that 
she  may  loose  her  grip  at  any  moment.  What  is  wrong  with  the  following 
analysis  of  the  situation?  “The  person  whose  hand  she’s  holding  exerts 
an  inward  force  on  her,  and  because  of  Newton’s  third  law,  there’s  an 
equal  and  opposite  force  acting  outward.  That  outward  force  is  the  one 
she  feels  throwing  her  outward,  and  the  outward  force  is  what  might  make 
her  go  flying  off,  if  it’s  strong  enough.” 

D  If  the  only  force  felt  by  the  person  on  the  outside  is  an  inward  force, 
why  doesn’t  she  go  straight  in? 
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E  In  the  amusement  park  ride  shown  in  the  figure,  the  cylinder  spins 
faster  and  faster  until  the  customer  can  pick  her  feet  up  off  the  floor  with¬ 
out  falling.  In  the  old  Coney  Island  version  of  the  ride,  the  floor  actually 
dropped  out  like  a  trap  door,  showing  the  ocean  below.  (There  is  also  a 
version  in  which  the  whole  thing  tilts  up  diagonally,  but  we’re  discussing 
the  version  that  stays  flat.)  If  there  is  no  outward  force  acting  on  her,  why 
does  she  stick  to  the  wall?  Analyze  all  the  forces  on  her. 

F  What  is  an  example  of  circular  motion  where  the  inward  force  is  a 
normal  force?  What  is  an  example  of  circular  motion  where  the  inward 
force  is  friction?  What  is  an  example  of  circular  motion  where  the  inward 
force  is  the  sum  of  more  than  one  force? 

G  Does  the  acceleration  vector  always  change  continuously  in  circular 
motion?  The  velocity  vector? 

H  A  certain  amount  of  force  is  needed  to  provide  the  acceleration 
of  circular  motion.  What  if  we  are  exerting  a  force  perpendicular  to  the 
direction  of  motion  in  an  attempt  to  make  an  object  trace  a  circle  of  radius 
r,  but  the  force  isn’t  as  big  as  m\v\2/r ? 

I  Suppose  a  rotating  space  station  is  built  that  gives  its  occupants  the 
illusion  of  ordinary  gravity.  What  happens  when  a  person  in  the  station 
lets  go  of  a  ball?  What  happens  when  she  throws  a  ball  straight  “up”  in 
the  air  (i.e.,  towards  the  center)? 

3.4.5  The  dot  product 

How  would  we  generalize  the  mechanical  work  equation  d E  = 
F  dx  to  three  dimensions?  Energy  is  a  scalar,  but  force  and  distance 
are  vectors,  so  it  might  seem  at  first  that  the  kind  of  “magic-wand” 
generalization  discussed  on  page  202  failed  here,  since  we  don’t  know 
of  any  way  to  multiply  two  vectors  together  to  get  a  scalar.  Actually, 
this  is  Nature  giving  us  a  hint  that  there  is  such  a  multiplication 
operation  waiting  for  us  to  invent  it,  and  since  Nature  is  simple,  we 
can  be  assured  that  this  operation  will  work  just  fine  in  any  situation 
where  a  similar  generalization  is  required. 

How  should  this  operation  be  defined?  Let’s  consider  what  we 
would  get  by  performing  this  operation  on  various  combinations  of 
the  unit  vectors  x,  y,  and  z.  The  conventional  notation  for  the 
operation  is  to  put  a  dot,  •,  between  the  two  vectors,  and  the  op¬ 
eration  is  therefore  called  the  dot  product.  Rotational  invariance 
requires  that  we  handle  the  three  coordinate  axes  in  the  same  way, 
without  giving  special  treatment  to  any  of  them,  so  we  must  have 
x-x  =  y'y  =  z-  z  and  x-y  =  y-  z  =  z-x.  This  is  supposed 
to  be  a  way  of  generalizing  ordinary  multiplication,  so  for  consis¬ 
tency  with  the  property  1  X  1  =  1  of  ordinary  numbers,  the  result 
of  multiplying  a  magnitude-one  vector  by  itself  had  better  be  the 
scalar  1,  sox-x  =  y-  y  =  z-  z  =  1.  Furthermore,  there  is  no  way 
to  satisfy  rotational  invariance  unless  we  define  the  mixed  products 
to  be  zero,  x-y  =  y-  z  =  z-  x  =  0;  for  example,  a  90-degree  ro¬ 
tation  of  our  frame  of  reference  about  the  z  axis  reverses  the  sign 
of  x  •  y,  but  rotational  invariance  requires  that  x  •  y  produce  the 
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same  result  either  way,  and  zero  is  the  only  number  that  stays  the 
same  when  we  reverse  its  sign.  Establishing  these  six  products  of 
unit  vectors  suffices  to  define  the  operation  in  general,  since  any 
two  vectors  that  we  want  to  multiply  can  be  broken  down  into  com¬ 
ponents,  e.g.,  (2x  +  3z)  -  z  =  2x-z  +  3z-z  =  0-|-3  =  3.  Thus 
by  requiring  rotational  invariance  and  consistency  with  multiplica¬ 
tion  of  ordinary  numbers,  we  find  that  there  is  only  one  possible 
way  to  define  a  multiplication  operation  on  two  vectors  that  gives  a 
scalar  as  the  result.1'  The  dot  product  has  all  of  the  properties  we 
normally  associate  with  multiplication,  except  that  there  is  no  “dot 
division.” 

Dot  product  in  terms  of  components  example  74 

If  we  know  the  components  of  any  two  vectors  b  and  c,  we  can 
find  their  dot  product: 

b  c  =  (bxx  +  by<j  +  bz z)  •  (cxx  +  cyy  +  cz z) 

=  b  X  Cx  +  by  Cy  +  b  /  Cz  ■ 


Magnitude  expressed  with  a  dot  product  example  75 

If  we  take  the  dot  product  of  any  vector  b  with  itself,  we  find 

b  b  =  (bxx  +  by<j  +  bz z)  •  [bxx  +  by y  +  bz z) 

=  b2  +  by  +  b2z, 


so  its  magnitude  can  be  expressed  as 


|b|  =  Vb  b. 

We  will  often  write  b2  to  mean  b  •  b,  when  the  context  makes 
it  clear  what  is  intended.  For  example,  we  could  express  kinetic 
energy  as  (1/2)m|v|2,  (1/2)mv-v,  or  (1/2)mv2.  In  the  third  version, 
nothing  but  context  tells  us  that  v  really  stands  for  the  magnitude 
of  some  vector  v. 

Geometric  interpretation  example  76 

In  figure  ag,  vectors  a,  b,  and  c  represent  the  sides  of  a  triangle, 
and  a  =  b  +  c.  The  law  of  cosines  gives 

|c|2  =  |a|2  +  |b|2  -  2ja||b|  cos  0. 

Using  the  result  of  example  75,  we  can  also  write  this  as 

|c|2  =  c  c 

=  (a  -  b)  •  (a  -  b) 

=  aa  +  bb-2ab. 


ag/The  geometric  interpre¬ 
tation  of  the  dot  product. 


17There  is,  however,  a  different  operation,  discussed  in  the  next  chapter,  which 
multiplies  two  vectors  to  give  a  vector. 
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Matching  up  terms  in  these  two  expressions,  we  find 


a  b  =  |a||b|  cos  0, 


ah  /  Breaking  trail,  by  Walter 
E.  Bohl.  The  pack  horse  is  not 
doing  any  work  on  the  pack, 
because  the  pack  is  moving  in  a 
horizontal  line  at  constant  speed, 
and  therefore  there  is  no  kinetic 
or  gravitational  energy  being 
transferred  into  or  out  of  it. 


which  is  a  geometric  interpretation  for  the  dot  product. 

The  result  of  example  76  is  very  useful.  It  gives  us  a  way  to 
find  the  angle  between  two  vectors  if  we  know  their  components.  It 
can  be  used  to  show  that  the  dot  product  of  any  two  perpendicular 
vectors  is  zero.  It  also  leads  to  a  nifty  proof  that  the  dot  product 
is  rotationally  invariant  —  up  until  now  I’ve  only  proved  that  if  a 
rotationally  invariant  product  exists,  the  dot  product  is  it  —  because 
angles  and  lengths  aren’t  affected  by  a  rotation,  so  the  right  side  of 
the  equation  is  rotationally  invariant,  and  therefore  so  is  the  left 
side. 

I  introduced  the  whole  discussion  of  the  dot  product  by  way  of 
generalizing  the  equation  d E  =  F  dx  to  three  dimensions.  In  terms 
of  a  dot  product,  we  have 


d E  =  F  •  dr. 


If  F  is  a  constant,  integrating  both  sides  gives 


AE  =  F  ■  Ar. 


(If  that  step  seemed  like  black  magic,  try  writing  it  out  in  terms 
of  components.)  If  the  force  is  perpendicular  to  the  motion,  as  in 
figure  ah,  then  the  work  done  is  zero.  The  pack  horse  is  doing  work 
within  its  own  body,  but  is  not  doing  work  on  the  pack. 

Pushing  a  lawnmower  example  77 

>  I  push  a  lawnmower  with  a  force  F=(1 1 0  N)x  -  (40  N)y,  and  the 
total  distance  I  travel  is  (100  m)x.  How  much  work  do  I  do? 

>  The  dot  product  is  1 1 000  N  •  m  =  1 1 000  J. 

A  good  application  of  the  dot  product  is  to  allow  us  to  write  a 
simple,  streamlined  proof  of  separate  conservation  of  the  momentum 
components.  (You  can  skip  the  proof  without  losing  the  continuity 
of  the  text.)  The  argument  is  a  generalization  of  the  one-dimensional 
proof  on  page  130,  and  makes  the  same  assumption  about  the  type 
of  system  of  particles  we’re  dealing  with.  The  kinetic  energy  of 
one  of  the  particles  is  (l/2)mv  •  v,  and  when  we  transform  into  a 
different  frame  of  reference  moving  with  velocity  u  relative  to  the 
original  frame,  the  one-dimensional  rule  v  — >  v  +  u  turns  into  vector 
addition,  v  — >  v  +  u.  In  the  new  frame  of  reference,  the  kinetic 
energy  is  (l/2)m(v  +  u)  •  (v  +  u).  For  a  system  of  n  particles,  we 
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have 


K  =  Y1  9mi(vi  +  u)  '  (VJ  +  u) 

3= 1 

n  n  n 

E^-vi-vi+2E  TOjVj  •  u  +  m^u  •  u 
j=i  i=i  i=i 


1 

2 


As  in  the  proof  on  page  130,  the  first  sum  is  simply  the  total  kinetic 
energy  in  the  original  frame  of  reference,  and  the  last  sum  is  a  con¬ 
stant,  which  has  no  effect  on  the  validity  of  the  conservation  law. 
The  middle  sum  can  be  rewritten  as 

n  n 

2  m jVj  •  u  =  2  u  •  ^2  mjVj 

3= 1  3=1 

n 

=  2u  Z]p  j’ 

3= 1 

so  the  only  way  energy  can  be  conserved  for  all  values  of  u  is  if  the 
vector  sum  of  the  momenta  is  conserved  as  well. 


3.4.6  Gradients  and  line  integrals  (optional) 

This  subsection  introduces  a  little  bit  of  vector  calculus.  It  can 
be  omitted  without  loss  of  continuity,  but  the  techniques  will  be 
needed  in  our  study  of  electricity  and  magnetism,  and  it  may  be 
helpful  to  be  exposed  to  them  in  easy-to-visualize  mechanical  con¬ 
texts  before  applying  them  to  invisible  electrical  and  magnetic  phe¬ 
nomena. 

In  physics  we  often  deal  with  fields  of  force,  meaning  situations 
where  the  force  on  an  object  depends  on  its  position.  For  instance, 
figure  ai  could  represent  a  map  of  the  trade  winds  affecting  a  sailing 
ship,  or  a  chart  of  the  gravitational  forces  experienced  by  a  space 
probe  entering  a  double-star  system.  An  object  moving  under  the 
influence  of  this  force  will  not  necessarily  be  moving  in  the  same 
direction  as  the  force  at  every  moment.  The  sailing  ship  can  tack 
against  the  wind,  due  to  the  force  from  the  water  on  the  keel.  The 
space  probe,  if  it  entered  from  the  top  of  the  diagram  at  high  speed, 
would  start  to  curve  around  to  the  right,  but  its  inertia  would  carry 
it  forward,  and  it  wouldn’t  instantly  swerve  to  match  the  direction 
of  the  gravitational  force.  For  convenience,  we’ve  defined  the  gravi¬ 
tational  field,  g,  as  the  force  per  unit  mass,  but  that  trick  only  leads 
to  a  simplification  because  the  gravitational  force  on  an  object  is 
proportional  to  its  mass.  Since  this  subsection  is  meant  to  apply 
to  any  kind  of  force,  we’ll  discuss  everything  in  terms  of  the  actual 
force  vector,  F,  in  units  of  newtons. 

If  an  object  moves  through  the  field  of  force  along  some  curved 
path  from  point  ri  to  point  r2,  the  force  will  do  a  certain  amount 


ai  /  An  object  moves  through 
a  field  of  force. 
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of  work  on  it.  To  calculate  this  work,  we  can  break  the  path  up 
into  infinitesimally  short  segments,  find  the  work  done  along  each 
segment,  and  add  them  all  up.  For  an  object  traveling  along  a  nice 
straight  x  axis,  we  use  the  symbol  dx  to  indicate  the  length  of  any 
infinitesimally  short  segment.  In  three  dimensions,  moving  along  a 
curve,  each  segment  is  a  tiny  vector  dr  =  xdx  +  y  dy  +  zd z.  The 
work  theorem  can  be  expressed  as  a  dot  product,  so  the  work  done 
along  a  segment  is  F  •  dr.  We  want  to  integrate  this,  but  we  don’t 
know  how  to  integrate  with  respect  to  a  variable  that’s  a  vector, 
so  let’s  define  a  variable  s  that  indicates  the  distance  traveled  so 
far  along  the  curve,  and  integrate  with  respect  to  it  instead.  The 
expression  F  •  dr  can  be  rewritten  as  |F|  |  dr|  cos  6,  where  6  is  the 
angle  between  F  and  dr.  But  |  dr|  is  simply  ds,  so  the  amount  of 
work  done  becomes 

rr  2 

A E  =  /  |F|  cos  9  ds. 

J  ri 

Both  F  and  9  are  functions  of  s.  As  a  matter  of  notation,  it’s 
cumbersome  to  have  to  write  the  integral  like  this.  Vector  notation 
was  designed  to  eliminate  this  kind  of  drudgery.  We  therefore  define 
the  line  integral 


as  a  way  of  notating  this  type  of  integral.  The  ‘C’  refers  to  the  curve 
along  which  the  object  travels.  If  we  don’t  know  this  curve  then  we 
typically  can’t  evaluate  the  line  integral  just  by  knowing  the  initial 
and  final  positions  ri  and  r2- 

The  basic  idea  of  calculus  is  that  integration  undoes  differen¬ 
tiation,  and  vice-versa.  In  one  dimension,  we  could  describe  an 
interaction  either  in  terms  of  a  force  or  in  terms  of  an  interaction 
energy.  We  could  integrate  force  with  respect  to  position  to  find 
minus  the  energy,  or  we  could  find  the  force  by  taking  minus  the 
derivative  of  the  energy.  In  the  line  integral,  position  is  represented 
by  a  vector.  What  would  it  mean  to  take  a  derivative  with  respect 
to  a  vector?  The  correct  way  to  generalize  the  derivative  d U /  dx  to 
three  dimensions  is  to  replace  it  with  the  following  vector, 

dU  „  dU  „  dU „ 

d^x+d?y+d7z’ 

called  the  gradient  of  U,  and  written  with  an  upside-down  delta18 
like  this,  W.  Each  of  these  three  derivatives  is  really  what’s  known 
as  a  partial  derivative.  What  that  means  is  that  when  you’re  differ¬ 
entiating  U  with  respect  to  x,  you’re  supposed  to  treat  y  and  z  and 
constants,  and  similarly  when  you  do  the  other  two  derivatives.  To 
emphasize  that  a  derivative  is  a  partial  derivative,  it’s  customary  to 

18The  symbol  V  is  called  a  “nabla.”  Cool  word! 
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write  it  using  the  symbol  d  in  place  of  the  differential  d’s.  Putting 
all  this  notation  together,  we  have 

TT  dU  „  dU  A  dU  <  ri  c  . 

VU  =  — — x  +  — — y  +  — —  z  [definition  of  the  gradient  . 

dx  dydz 

The  gradient  looks  scary,  but  it  has  a  very  simple  physical  inter¬ 
pretation.  It’s  a  vector  that  points  in  the  direction  in  which  U  is 
increasing  most  rapidly,  and  it  tells  you  how  rapidly  U  is  increasing 
in  that  direction.  For  instance,  sperm  cells  in  plants  and  animals 
find  the  egg  cells  by  traveling  in  the  direction  of  the  gradient  of  the 
concentration  of  certain  hormones.  When  they  reach  the  location 
of  the  strongest  hormone  concentration,  they  find  their  destiny.  In 
terms  of  the  gradient,  the  force  corresponding  to  a  given  interaction 
energy  is  F  =  —'VU. 

Force  exerted  by  a  spring  example  78 

In  one  dimension,  Hooke’s  law  is  U  =  (1/2 )kx2.  Suppose  we 
tether  one  end  of  a  spring  to  a  post,  but  it’s  free  to  stretch  and 
swing  around  in  a  plane.  Let’s  say  its  equilibrium  length  is  zero, 
and  let’s  choose  the  origin  of  our  coordinate  system  to  be  at  the 
post.  Rotational  invariance  requires  that  its  energy  only  depend 
on  the  magnitude  of  the  r  vector,  not  its  direction,  so  in  two  di¬ 
mensions  we  have  U  =  (1/2)/cjr|2  =  (V2)k  (x2  +  y2).  The  force 
exerted  by  the  spring  is  then 

F  =  —VU 

dU  „  dll  „ 

=  -^x~a7y 

=  -kxx  -  kyy. 

The  magnitude  of  this  force  vector  is  k |r|,  and  its  direction  is  to¬ 
ward  the  origin. 

This  chapter  is  summarized  on  page  991 .  Notation  and  terminology 
are  tabulated  on  pages  984-985. 
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The  symbols  \/,  ^ ,  etc.  are  explained  on  page  238. 

1  Derive  a  formula  expressing  the  kinetic  energy  of  an  object  in 

terms  of  its  momentum  and  mass.  V 

2  Two  people  in  a  rowboat  wish  to  move  around  without  causing 

the  boat  to  move.  What  should  be  true  about  their  total  momen¬ 
tum?  Explain.  ^ 

3  A  bullet  leaves  the  barrel  of  a  gun  with  a  kinetic  energy  of  90 
J.  The  gun  barrel  is  50  cm  long.  The  gun  has  a  mass  of  4  kg,  the 
bullet  10  g. 

(a)  Find  the  bullet’s  final  velocity.  V 

(b)  Find  the  bullet’s  final  momentum.  V 

(c)  Find  the  momentum  of  the  recoiling  gun. 

(d)  Find  the  kinetic  energy  of  the  recoiling  gun,  and  explain  why 

the  recoiling  gun  does  not  kill  the  shooter.  V  ^ 

4  The  big  difference  between  the  equations  for  momentum  and 
kinetic  energy  is  that  one  is  proportional  to  v  and  one  to  v2.  Both, 
however,  are  proportional  to  m.  Suppose  someone  tells  you  that 
there’s  a  third  quantity,  funkosity,  defined  as  /  =  m2v ,  and  that 
funkosity  is  conserved.  How  do  you  know  your  leg  is  being  pulled? 

>  Solution,  p.  973  - 

5  A  ball  of  mass  2 m  collides  head-on  with  an  initially  stationary 
ball  of  mass  m.  No  kinetic  energy  is  transformed  into  heat  or  sound. 
In  what  direction  is  the  mass-2m  ball  moving  after  the  collision,  and 
how  fast  is  it  going  compared  to  its  original  velocity? 

>  Answer,  p.  969 

6  A  very  massive  object  with  velocity  v  collides  head-on  with 
an  object  at  rest  whose  mass  is  very  small.  No  kinetic  energy  is 
converted  into  other  forms.  Prove  that  the  low-mass  object  recoils 
with  velocity  2v.  [Hint:  Use  the  center-of-mass  frame  of  reference.] 

■ 

7  A  mass  m  moving  at  velocity  v  collides  with  a  stationary  target 

having  the  same  mass  m.  Find  the  maximum  amount  of  energy  that 
can  be  released  as  heat  and  sound.  V  ■ 

8  A  rocket  ejects  exhaust  with  an  exhaust  velocity  it.  The  rate 
at  which  the  exhaust  mass  is  used  (mass  per  unit  time)  is  b.  We 
assume  that  the  rocket  accelerates  in  a  straight  line  starting  from 
rest,  and  that  no  external  forces  act  on  it.  Let  the  rocket’s  initial 
mass  (fuel  plus  the  body  and  payload)  be  m, ,  and  mj  be  its  final 
mass,  after  all  the  fuel  is  used  up.  (a)  Find  the  rocket’s  final  velocity, 
v,  in  terms  of  u.  rrii,  and  my.  Neglect  the  effects  of  special  relativity, 
(b)  A  typical  exhaust  velocity  for  chemical  rocket  engines  is  4000 
m/s.  Estimate  the  initial  mass  of  a  rocket  that  could  accelerate  a 
one-ton  payload  to  10%  of  the  speed  of  light,  and  show  that  this 
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design  won’t  work.  (For  the  sake  of  the  estimate,  ignore  the  mass  of 
the  fuel  tanks.  The  speed  is  fairly  small  compared  to  c,  so  it’s  not 
an  unreasonable  approximation  to  ignore  relativity.)  V  Si 

9  An  object  is  observed  to  be  moving  at  constant  speed  along 
a  line.  Can  you  conclude  that  no  forces  are  acting  on  it?  Explain. 
[Based  on  a  problem  by  Serway  and  Faughn.] 

10  At  low  speeds,  every  car’s  acceleration  is  limited  by  traction, 
not  by  the  engine’s  power.  Suppose  that  at  low  speeds,  a  certain 
car  is  normally  capable  of  an  acceleration  of  3  m/s2.  If  it  is  towing 
a  trailer  with  half  as  much  mass  as  the  car  itself,  what  acceleration 
can  it  achieve?  [Based  on  a  problem  from  PSSC  Physics.] 

11  (a)  Let  T  be  the  maximum  tension  that  an  elevator’s  cable 

can  withstand  without  breaking,  i.e. ,  the  maximum  force  it  can 
exert.  If  the  motor  is  programmed  to  give  the  car  an  acceleration 
a  (a  >  0  is  upward),  what  is  the  maximum  mass  that  the  car  can 
have,  including  passengers,  if  the  cable  is  not  to  break?  v 

(b)  Interpret  the  equation  you  derived  in  the  special  cases  of  a  =  0 
and  of  a  downward  acceleration  of  magnitude  g. 


12  When  the  contents  of  a  refrigerator  cool  down,  the  changed 

molecular  speeds  imply  changes  in  both  momentum  and  energy. 
Why,  then,  does  a  fridge  transfer  power  through  its  radiator  coils, 
but  not  force ?  >  Solution,  p.  973  ■ 

13  A  helicopter  of  mass  m  is  taking  off  vertically.  The  only 
forces  acting  on  it  are  the  earth’s  gravitational  force  and  the  force, 
Fair ,  of  the  air  pushing  up  on  the  propeller  blades. 

(a)  If  the  helicopter  lifts  off  at  t  =  0,  what  is  its  vertical  speed  at 
time  tl 

(b)  Check  that  the  units  of  your  answer  to  part  a  make  sense. 

(c)  Discuss  how  your  answer  to  part  a  depends  on  all  three  variables, 
and  show  that  it  makes  sense.  That  is,  for  each  variable,  discuss 
what  would  happen  to  the  result  if  you  changed  it  while  keeping  the 
other  two  variables  constant.  Would  a  bigger  value  give  a  smaller 
result,  or  a  bigger  result?  Once  you’ve  figured  out  this  mathematical 
relationship,  show  that  it  makes  sense  physically. 

(d)  Plug  numbers  into  your  equation  from  part  a,  using  m  =  2300 

kg,  Fair  =  27000  N,  and  f  =  4.0  s.  V  ■ 

14  A  blimp  is  initially  at  rest,  hovering,  when  at  t  =  0  the  pilot 
turns  on  the  engine  driving  the  propeller.  The  engine  cannot  in¬ 
stantly  get  the  propeller  going,  but  the  propeller  speeds  up  steadily. 
The  steadily  increasing  force  between  the  air  and  the  propeller  is 
given  by  the  equation  F  =  kt,  where  A:  is  a  constant.  If  the  mass 
of  the  blimp  is  m,  find  its  position  as  a  function  of  time.  (Assume 
that  during  the  period  of  time  you’re  dealing  with,  the  blimp  is  not 
yet  moving  fast  enough  to  cause  a  significant  backward  force  due  to 
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Problem  16. 


rubber  wheel  — 

— \ - 

car 

□•Cl 


steel  rail 

Problem  19. 
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air  resistance.)  V  - 

15  A  car  is  accelerating  forward  along  a  straight  road.  If  the  force 

of  the  road  on  the  car’s  wheels,  pushing  it  forward,  is  a  constant  3.0 
kN,  and  the  car’s  mass  is  1000  kg,  then  how  long  will  the  car  take 
to  go  from  20  m/s  to  50  m/s?  >  Solution,  p.  973  »- 

16  A  little  old  lady  and  a  pro  football  player  collide  head-on. 

Compare  their  forces  on  each  other,  and  compare  their  accelerations. 
Explain.  ^ 

17  The  earth  is  attracted  to  an  object  with  a  force  equal  and 
opposite  to  the  force  of  the  earth  on  the  object.  If  this  is  true, 
why  is  it  that  when  you  drop  an  object,  the  earth  does  not  have  an 
acceleration  equal  and  opposite  to  that  of  the  object? 

18  When  you  stand  still,  there  are  two  forces  acting  on  you, 
the  force  of  gravity  (your  weight)  and  the  normal  force  of  the  floor 
pushing  up  on  your  feet.  Are  these  forces  equal  and  opposite?  Does 
Newton’s  third  law  relate  them  to  each  other?  Explain. 

19  Today’s  tallest  buildings  are  really  not  that  much  taller  than 
the  tallest  buildings  of  the  1940’s.  One  big  problem  with  making  an 
even  taller  skyscraper  is  that  every  elevator  needs  its  own  shaft  run¬ 
ning  the  whole  height  of  the  building.  So  many  elevators  are  needed 
to  serve  the  building’s  thousands  of  occupants  that  the  elevator 
shafts  start  taking  up  too  much  of  the  space  within  the  building. 
An  alternative  is  to  have  elevators  that  can  move  both  horizontally 
and  vertically:  with  such  a  design,  many  elevator  cars  can  share  a 
few  shafts,  and  they  don’t  get  in  each  other’s  way  too  much  because 
they  can  detour  around  each  other.  In  this  design,  it  becomes  im¬ 
possible  to  hang  the  cars  from  cables,  so  they  would  instead  have  to 
ride  on  rails  which  they  grab  onto  with  wheels.  Friction  would  keep 
them  from  slipping.  The  figure  shows  such  a  frictional  elevator  in 
its  vertical  travel  mode.  (The  wheels  on  the  bottom  are  for  when  it 
needs  to  switch  to  horizontal  motion.) 

(a)  If  the  coefficient  of  static  friction  between  rubber  and  steel  is 

jis .  and  the  maximum  mass  of  the  car  plus  its  passengers  is  M, 
how  much  force  must  there  be  pressing  each  wheel  against  the  rail 
in  order  to  keep  the  car  from  slipping?  (Assume  the  car  is  not 
accelerating.)  V 

(b)  Show  that  your  result  has  physically  reasonable  behavior  with 

respect  to  jis.  In  other  words,  if  there  was  less  friction,  would  the 
wheels  need  to  be  pressed  more  firmly  or  less  firmly?  Does  your 
equation  behave  that  way?  ■ 
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20  A  tugboat  of  mass  m  pulls  a  ship  of  mass  M,  accelerating 
it.  Ignore  fluid  friction  acting  on  their  hulls,  although  there  will  of 
course  need  to  be  fluid  friction  acting  on  the  tug’s  propellers. 

(a)  If  the  force  acting  on  the  tug’s  propeller  is  F,  what  is  the  tension, 

T,  in  the  cable  connecting  the  two  ships?  >  Hint,  p.  959  V 

(b)  Interpret  your  answer  in  the  special  cases  of  M  =  0  and  M  =  oo. 

■ 

21  Someone  tells  you  she  knows  of  a  certain  type  of  Central 

American  earthworm  whose  skin,  when  rubbed  on  polished  dia¬ 
mond,  has  fik  >  Ms-  Why  is  this  not  just  empirically  unlikely  but 
logically  suspect?  - 

22  A  uranium  atom  deep  in  the  earth  spits  out  an  alpha  particle. 
An  alpha  particle  is  a  fragment  of  an  atom.  This  alpha  particle  has 
initial  speed  v,  and  travels  a  distance  d  before  stopping  in  the  earth. 

(a)  Find  the  force,  F,  from  the  dirt  that  stopped  the  particle,  in 

terms  of  v,d,  and  its  mass,  m.  Don’t  plug  in  any  numbers  yet. 
Assume  that  the  force  was  constant.  V 

(b)  Show  that  your  answer  has  the  right  units. 

(c)  Discuss  how  your  answer  to  part  a  depends  on  all  three  variables, 
and  show  that  it  makes  sense.  That  is,  for  each  variable,  discuss 
what  would  happen  to  the  result  if  you  changed  it  while  keeping  the 
other  two  variables  constant.  Would  a  bigger  value  give  a  smaller 
result,  or  a  bigger  result?  Once  you’ve  figured  out  this  mathematical 
relationship,  show  that  it  makes  sense  physically. 

(d)  Evaluate  your  result  for  m  =  6.7  x  10-27  kg,  v  =  2.0  x  104  km/s, 

and  d  =  0.71  mm.  v 


23  You  are  given  a  large  sealed  box,  and  are  not  allowed  to  open 
it.  Which  of  the  following  experiments  measure  its  mass,  and  which 
measure  its  weight?  [Hint:  Which  experiments  would  give  different 
results  on  the  moon?] 

(a)  Put  it  on  a  frozen  lake,  throw  a  rock  at  it,  and  see  how  fast  it 
scoots  away  after  being  hit. 

(b)  Drop  it  from  a  third-floor  balcony,  and  measure  how  loud  the 
sound  is  when  it  hits  the  ground. 

(c)  As  shown  in  the  figure,  connect  it  with  a  spring  to  the  wall,  and 
watch  it  vibrate. 

>  Solution,  p.  973  - 

24  While  escaping  from  the  palace  of  the  evil  Martian  em¬ 
peror,  Sally  Spacehound  jumps  from  a  tower  of  height  h  down  to 
the  ground.  Ordinarily  the  fall  would  be  fatal,  but  she  fires  her 
blaster  rifle  straight  down,  producing  an  upward  force  of  magnitude 
Fb ■  This  force  is  insufficient  to  levitate  her,  but  it  does  cancel  out 
some  of  the  force  of  gravity.  During  the  time  t  that  she  is  falling, 
Sally  is  unfortunately  exposed  to  fire  from  the  emperor’s  minions, 
and  can’t  dodge  their  shots.  Let  m  be  her  mass,  and  g  the  strength 
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of  gravity  on  Mars. 

(a)  Find  the  time  t  in  terms  of  the  other  variables. 

(b)  Check  the  units  of  your  answer  to  part  a. 

(c)  For  sufficiently  large  values  of  Fb,  your  answer  to  part  a  becomes 

nonsense  —  explain  what’s  going  on.  V 

■ 

25  When  I  cook  rice,  some  of  the  dry  grains  always  stick  to  the 
measuring  cup.  To  get  them  out,  I  turn  the  measuring  cup  upside- 
down  and  hit  the  “roof”  with  my  hand  so  that  the  grains  come  off  of 
the  “ceiling.”  (a)  Explain  why  static  friction  is  irrelevant  here,  (b) 
Explain  why  gravity  is  negligible,  (c)  Explain  why  hitting  the  cup 
works,  and  why  its  success  depends  on  hitting  the  cup  hard  enough. 

■ 

26  A  flexible  rope  of  mass  m  and  length  L  slides  without  friction 
over  the  edge  of  a  table.  Let  x  be  the  length  of  the  rope  that  is 
hanging  over  the  edge  at  a  given  moment  in  time. 

(a)  Show  that  x  satisfies  the  equation  of  motion  &  x/  df2  =  gx/L. 
[Hint:  Use  F  =  dp/  df,  which  allows  you  to  handle  the  two  parts  of 
the  rope  separately  even  though  mass  is  moving  out  of  one  part  and 
into  the  other.] 

(b)  Give  a  physical  explanation  for  the  fact  that  a  larger  value  of 
x  on  the  right-hand  side  of  the  equation  leads  to  a  greater  value  of 
the  acceleration  on  the  left  side. 

(c)  When  we  take  the  second  derivative  of  the  function  x(t)  we  are 
supposed  to  get  essentially  the  same  function  back  again,  except 
for  a  constant  out  in  front.  The  function  ex  has  the  property  that 
it  is  unchanged  by  differentiation,  so  it  is  reasonable  to  look  for 
solutions  to  this  problem  that  are  of  the  form  x  =  bect,  where  b  and 
c  are  constants.  Show  that  this  does  indeed  provide  a  solution  for 
two  specific  values  of  c  (and  for  any  value  of  b). 

(d)  Show  that  the  sum  of  any  two  solutions  to  the  equation  of  motion 
is  also  a  solution. 

(e)  Find  the  solution  for  the  case  where  the  rope  starts  at  rest  at 

t  =  0  with  some  nonzero  value  of  x.  ^ 

In  problems  27-31,  analyze  the  forces  using  a  table  in  the  format 
shown  in  section  3.2.6.  Analyze  the  forces  in  which  the  italicized 
object  participates. 

27  Some  people  put  a  spare  car  key  in  a  little  magnetic  box 
that  they  stick  under  the  chassis  of  their  car.  Let’s  say  that  the 
box  is  stuck  directly  underneath  a  horizontal  surface,  and  the  car  is 
parked.  (See  instructions  above.) 

28  Analyze  two  examples  of  objects  at  rest  relative  to  the  earth 
that  are  being  kept  from  falling  by  forces  other  than  the  normal 
force.  Do  not  use  objects  in  outer  space,  and  do  not  duplicate 
problem  27  or  31.  (See  instructions  above.) 
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29  A  person  is  rowing  a  boat,  with  her  feet  braced.  She  is  doing 
the  part  of  the  stroke  that  propels  the  boat,  with  the  ends  of  the 
oars  in  the  water  (not  the  part  where  the  oars  are  out  of  the  water) . 
(See  instructions  above.) 

30  A  farmer  is  in  a  stall  with  a  cow  when  the  cow  decides 

to  press  him  against  the  wall,  pinning  him  with  his  feet  off  the 
ground.  Analyze  the  forces  in  which  the  farmer  participates.  (See 
instructions  above.)  - 

31  A  propeller  plane  is  cruising  east  at  constant  speed  and 

altitude.  (See  instructions  above.)  - 

32  The  figure  shows  a  stack  of  two  blocks,  sitting  on  top  of  a  table 
that  is  bolted  to  the  floor.  All  three  objects  are  made  from  iden¬ 
tical  wood,  with  their  surfaces  finished  identically  using  the  same 
sandpaper.  We  tap  the  middle  block,  giving  it  an  initial  velocity  v 
to  the  right.  The  tap  is  executed  so  rapidly  that  almost  no  initial 
velocity  is  imparted  to  the  top  block. 

(a)  Find  the  time  that  will  elapse  until  the  slipping  between  the  top 

and  middle  blocks  stops.  Express  your  answer  in  terms  of  v,  m,  M, 
g ,  and  the  relevant  coefficient  of  friction.  V 

(b)  Show  that  your  answer  makes  sense  in  terms  of  units. 

(c)  Check  that  your  result  has  the  correct  behavior  when  you  make 
m  bigger  or  smaller.  Explain.  This  means  that  you  should  discuss 
the  mathematical  behavior  of  the  result,  and  then  explain  how  this 
corresponds  to  what  would  really  happen  physically. 

(d)  Similarly,  discuss  what  happens  when  you  make  M  bigger  or 
smaller. 

(e)  Similarly,  discuss  what  happens  when  you  make  g  bigger  or 

smaller.  - 

33  Ginny  has  a  plan.  She  is  going  to  ride  her  sled  while  her  dog 
Foo  pulls  her,  and  she  holds  on  to  his  leash.  However,  Ginny  hasn’t 
taken  physics,  so  there  may  be  a  problem:  she  may  slide  right  off 
the  sled  when  Foo  starts  pulling. 

(a)  Analyze  all  the  forces  in  which  Ginny  participates,  making  a 
table  as  in  subsection  3.2.6. 

(b)  Analyze  all  the  forces  in  which  the  sled  participates. 

(c)  The  sled  has  mass  m,  and  Ginny  has  mass  M .  The  coefficient 

of  static  friction  between  the  sled  and  the  snow  is  /j  i ,  and  is 
the  corresponding  quantity  for  static  friction  between  the  sled  and 
her  snow  pants.  Ginny  must  have  a  certain  minimum  mass  so  that 
she  will  not  slip  off  the  sled.  Find  this  in  terms  of  the  other  three 
variables.  V 

(d)  Interpreting  your  equation  from  part  c,  under  what  conditions 

will  there  be  no  physically  realistic  solution  for  Ml  Discuss  what 
this  means  physically.  - 

34  In  each  case,  identify  the  force  that  causes  the  acceleration, 
and  give  its  Newton’s-third-law  partner.  Describe  the  effect  of  the 


Problem  31 . 


Problem  32 


Problem  33. 


Problems 


partner  force,  (a)  A  swimmer  speeds  up.  (b)  A  golfer  hits  the  ball 
off  of  the  tee.  (c)  An  archer  fires  an  arrow,  (d)  A  locomotive  slows 
down.  >  Solution,  p.  973  3 

35  A  cop  investigating  the  scene  of  an  accident  measures  the 

length  L  of  a  car’s  skid  marks  in  order  to  find  out  its  speed  v  at 
the  beginning  of  the  skid.  Express  v  in  terms  of  L  and  any  other 
relevant  variables.  V  ■ 

36  An  ice  skater  builds  up  some  speed,  and  then  coasts  across 
the  ice  passively  in  a  straight  line,  (a)  Analyze  the  forces,  using  a 
table  in  the  format  shown  in  subsection  3.2.6. 

(b)  If  his  initial  speed  is  v,  and  the  coefficient  of  kinetic  friction  is  //*,, 

find  the  maximum  theoretical  distance  he  can  glide  before  coming 
to  a  stop.  Ignore  air  resistance.  V 

(c)  Show  that  your  answer  to  part  b  has  the  right  units. 

(d)  Show  that  your  answer  to  part  b  depends  on  the  variables  in  a 
way  that  makes  sense  physically. 

(e)  Evaluate  your  answer  numerically  for  =  0.0046,  and  a  world- 
record  speed  of  14.58  m/s.  (The  coefficient  of  friction  was  measured 
by  De  Koning  et  ah,  using  special  skates  worn  by  real  speed  skaters.) 

V 

(f)  Comment  on  whether  your  answer  in  part  e  seems  realistic.  If  it 
doesn’t,  suggest  possible  reasons  why. 


Problem  38 


37  (a)  Using  the  solution  of  problem  37  on  page  124,  predict 
how  the  spring  constant  of  a  fiber  will  depend  on  its  length  and 
cross-sectional  area. 

(b)  The  constant  of  proportionality  is  called  the  Young’s  modulus, 
E,  and  typical  values  of  the  Young’s  modulus  are  about  1010  to 
1011.  What  units  would  the  Young’s  modulus  have  in  the  SI  system? 

>  Solution,  p.  974  - 

38  This  problem  depends  on  the  results  of  problems  problem  37 
on  page  124  and  problem  37  from  this  chapter.  When  atoms  form 
chemical  bonds,  it  makes  sense  to  talk  about  the  spring  constant  of 
the  bond  as  a  measure  of  how  “stiff”  it  is.  Of  course,  there  aren’t 
really  little  springs  —  this  is  just  a  mechanical  model.  The  purpose 
of  this  problem  is  to  estimate  the  spring  constant,  k,  for  a  single 
bond  in  a  typical  piece  of  solid  matter.  Suppose  we  have  a  fiber, 
like  a  hair  or  a  piece  of  fishing  line,  and  imagine  for  simplicity  that 
it  is  made  of  atoms  of  a  single  element  stacked  in  a  cubical  manner, 
as  shown  in  the  figure,  with  a  center-to-center  spacing  b.  A  typical 
value  for  b  would  be  about  10“ 10  m. 

(a)  Find  an  equation  for  k  in  terms  of  b,  and  in  terms  of  the  Young’s 
modulus,  E,  defined  in  problem  37  and  its  solution. 

(b)  Estimate  k  using  the  numerical  data  given  in  problem  37. 

(c)  Suppose  you  could  grab  one  of  the  atoms  in  a  diatomic  molecule 
like  H2  or  O2,  and  let  the  other  atom  hang  vertically  below  it.  Does 
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the  bond  stretch  by  any  appreciable  fraction  due  to  gravity? 

39  This  problem  has  been  deleted.  ■ 

40  Many  fish  have  an  organ  known  as  a  swim  bladder,  an  air- 
filled  cavity  whose  main  purpose  is  to  control  the  fish’s  buoyancy 
and  allow  it  to  keep  from  rising  or  sinking  without  having  to  use  its 
muscles.  In  some  fish,  however,  the  swim  bladder  (or  a  small  exten¬ 
sion  of  it)  is  linked  to  the  ear  and  serves  the  additional  purpose  of 
amplifying  sound  waves.  For  a  typical  fish  having  such  an  anatomy, 
the  bladder  has  a  resonant  frequency  of  300  Hz,  the  bladder’s  Q  is  3, 
and  the  maximum  amplification  is  about  a  factor  of  100  in  energy. 
Over  what  range  of  frequencies  would  the  amplification  be  at  least 
a  factor  of  50? 

41  An  oscillator  with  sufficiently  strong  damping  has  its  maxi¬ 
mum  response  at  uj  =  0.  Using  the  result  derived  on  page  952  ,  find 
the  value  of  Q  at  which  this  behavior  sets  in. 

>  Hint,  p.  959  >  Answer,  p.  970  - 


42  An  oscillator  has  (5=6.00,  and,  for  convenience,  let’s  assume 
Frn  =  1.00,  ui0  =  1.00,  and  m  =  1.00.  The  usual  approximations 
would  give 

Mres  = 

Ares  =  6.00,  and 
Au  =  1/6.00. 

Determine  these  three  quantities  numerically  using  the  result  de¬ 
rived  on  page  952  ,  and  compare  with  the  approximations. 


43  The  apparatus  in  figure  d  on  page  61  had  a  natural  period 
of  oscillation  of  5  hours  and  20  minutes.  The  authors  estimated, 
based  on  calculations  of  internal  friction  in  the  tungsten  wire,  that 
its  Q  was  on  the  order  of  106,  but  they  were  unable  to  measure  it 
empirically  because  it  would  have  taken  years  for  the  amplitude  to 
die  down  by  any  measurable  amount.  Although  each  aluminum  or 
platinum  mass  was  really  moving  along  an  arc  of  a  circle,  any  actual 
oscillations  caused  by  a  violation  of  the  equivalence  of  gravitational 
and  inertial  mass  would  have  been  measured  in  millions  of  a  degree, 
so  it’s  a  good  approximation  to  say  that  each  mass’s  motion  was 
along  a  (very  short!)  straight  line  segment.  We  can  also  treat  each 
mass  as  if  it  was  oscillating  separately  from  the  others.  If  the  prin¬ 
ciple  of  equivalence  had  been  violated  at  the  10“ 12  level,  the  limit  of 
their  experiment’s  sensitivity,  the  sun’s  gravitational  force  on  one  of 
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the  0.4-gram  masses  would  have  been  about  3  x  10~19  N,  oscillating 
with  a  period  of  24  hours  due  to  the  rotation  of  the  earth.  (We  ig¬ 
nore  the  inertia  of  the  arms,  whose  total  mass  was  only  about  25% 
of  the  total  mass  of  the  rotating  assembly.) 

(a)  Find  the  amplitude  of  the  resulting  oscillations,  and  determine 

the  angle  to  which  they  would  have  corresponded,  given  that  the 
radius  of  the  balance  arms  was  10  cm.  >  Answer,  p.  970 

(b)  Show  that  even  if  their  estimate  of  Q  was  wildly  wrong,  it 

wouldn’t  have  affected  this  result.  ■ 

44  A  firework  shoots  up  into  the  air,  and  just  before  it  explodes 
it  has  a  certain  momentum  and  kinetic  energy.  What  can  you  say 
about  the  momenta  and  kinetic  energies  of  the  pieces  immediately 
after  the  explosion?  [Based  on  a  problem  from  PSSC  Physics.] 

>  Solution,  p.  974  ^ 

45  The  figure  shows  a  view  from  above  of  a  collision  about 

to  happen  between  two  air  hockey  pucks  sliding  without  friction. 
They  have  the  same  speed,  Vi,  before  the  collision,  but  the  big  puck 
is  2.3  times  more  massive  than  the  small  one.  Their  sides  have 
sticky  stuff  on  them,  so  when  they  collide,  they  will  stick  together. 
At  what  angle  will  they  emerge  from  the  collision?  In  addition  to 
giving  a  numerical  answer,  please  indicate  by  drawing  on  the  figure 
how  your  angle  is  defined.  >  Solution,  p.  974  - 

Problem  45 

46  A  learjet  traveling  due  east  at  300  mi/hr  collides  with  a 

jumbo  jet  which  was  heading  southwest  at  150  mi/hr.  The  jumbo 
jet’s  mass  is  five  times  greater  than  that  of  the  learjet.  When  they 
collide,  the  learjet  sticks  into  the  fuselage  of  the  jumbo  jet,  and  they 
fall  to  earth  together.  Their  engines  stop  functioning  immediately 
after  the  collision.  On  a  map,  what  will  be  the  direction  from  the 
location  of  the  collision  to  the  place  where  the  wreckage  hits  the 
ground?  (Give  an  angle.)  V  ■ 

47  (a)  A  ball  is  thrown  straight  up  with  velocity  v.  Find  an 

equation  for  the  height  to  which  it  rises.  v 

(b)  Generalize  your  equation  for  a  ball  thrown  at  an  angle  9  above 
horizontal,  in  which  case  its  initial  velocity  components  are  vx  = 
v  cos  9  and  vy  =  v  sin  9.  V 

■ 

48  At  the  2010  Salinas  Lettuce  Festival  Parade,  the  Lettuce 

Queen  drops  her  bouquet  while  riding  on  a  float  moving  toward  the 
right.  Sketch  the  shape  of  its  trajectory  in  her  frame  of  reference, 
and  compare  with  the  shape  seen  by  one  of  her  admirers  standing 
on  the  sidewalk.  - 

49  Two  daredevils,  Wendy  and  Bill,  go  over  Niagara  Falls. 
Wendy  sits  in  an  inner  tube,  and  lets  the  30  km/hr  velocity  of  the 
river  throw  her  out  horizontally  over  the  falls.  Bill  paddles  a  kayak, 
adding  an  extra  10  km/hr  to  his  velocity.  They  go  over  the  edge 


226 


Chapter  3 


Conservation  of  Momentum 


of  the  falls  at  the  same  moment,  side  by  side.  Ignore  air  friction. 
Explain  your  reasoning. 

(a)  Who  hits  the  bottom  first? 

(b)  What  is  the  horizontal  component  of  Wendy’s  velocity  on  im¬ 
pact? 

(c)  What  is  the  horizontal  component  of  Bill’s  velocity  on  impact? 

(d)  Who  is  going  faster  on  impact?  ■ 

50  A  baseball  pitcher  throws  a  pitch  clocked  at  vx  =  73.3  miles/hour. 
He  throws  horizontally.  By  what  amount,  d,  does  the  ball  drop  by 
the  time  it  reaches  home  plate,  L  =  60.0  feet  away? 

(a)  First  find  a  symbolic  answer  in  terms  of  L,  vx,  and  g.  V 

(b)  Plug  in  and  find  a  numerical  answer.  Express  your  answer 

in  units  of  ft.  (Note:  1  foot=12  inches,  1  mile=5280  feet,  and  1 
inch=2.54  cm)  V  ■ 


Problem  50. 


L=60.0  ft 


51  A  batter  hits  a  baseball  at  speed  v,  at  an  angle  9  above 
horizontal. 

(a)  Find  an  equation  for  the  range  (horizontal  distance  to  where 
the  ball  falls),  R,  in  terms  of  the  relevant  variables.  Neglect  air 
friction  and  the  height  of  the  ball  above  the  ground  when  it  is  hit. 

>  Answer,  p.  970 

(b)  Interpret  your  equation  in  the  cases  of  9=0  and  9  =  90°. 

(c)  Find  the  angle  that  gives  the  maximum  range. 

>  Answer,  p.  970  - 

52  In  this  problem  you’ll  extend  the  analysis  in  problem  51 

to  include  air  friction  by  writing  a  computer  program.  For  a 
game  played  at  sea  level,  the  force  due  to  air  friction  is  approxi¬ 
mately  (7  x  10~4  N-s2/m2)u2,  in  the  direction  opposite  to  the  motion 
of  the  ball. The  mass  of  a  baseball  is  0.146  kg. 

(a)  For  a  ball  hit  at  a  speed  of  45.0  m/s  from  a  height  of  1.0  m,  find 

the  optimal  angle  and  the  resulting  range.  i>  Answer,  p.  970 

(b)  How  much  farther  would  the  ball  fly  at  the  Colorado  Rockies’ 
stadium,  where  the  thinner  air  gives  18  percent  less  air  friction? 

>  Answer,  p.  970  £ 
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53  If  you  walk  35  km  at  an  angle  25°  counterclockwise  from 
east,  and  then  22  km  at  230°counterclockwise  from  east,  find  the 
distance  and  direction  from  your  starting  point  to  your  destination. 

V  ^ 

54  The  figure  shows  vectors  A  and  B.  Graphically  calculate  the 
following,  as  in  figure  q  on  p.  200. 

A  +  B,  A  -  B,  B  -  A,  — 2B,  A  -  2B 

No  numbers  are  involved. 

55  Phnom  Penh  is  470  km  east  and  250  km  south  of  Bangkok. 
Hanoi  is  60  km  east  and  1030  km  north  of  Phnom  Penh. 

(a)  Choose  a  coordinate  system,  and  translate  these  data  into  Ax 
and  Ay  values  with  the  proper  plus  and  minus  signs. 

(b)  Find  the  components  of  the  Ar  vector  pointing  from  Bangkok 

to  Hanoi.  V 

■ 

56  Is  it  possible  for  a  helicopter  to  have  an  acceleration  due  east 

and  a  velocity  due  west?  If  so,  what  would  be  going  on?  If  not,  why 
not?  ■ 


north 


direction  of  motion 
of  fossil  relative  to 
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Problem  57. 


57  As  shown  in  the  diagram,  a  dinosaur  fossil  is  slowly  moving 

down  the  slope  of  a  glacier  under  the  influence  of  wind,  rain  and 
gravity.  At  the  same  time,  the  glacier  is  moving  relative  to  the 
continent  underneath.  The  dashed  lines  represent  the  directions  but 
not  the  magnitudes  of  the  velocities.  Pick  a  scale,  and  use  graphical 
addition  of  vectors  to  find  the  magnitude  and  the  direction  of  the 
fossil’s  velocity  relative  to  the  continent.  You  will  need  a  ruler  and 
protractor.  V  ^ 

58  A  bird  is  initially  flying  horizontally  east  at  21.1  m/s,  but  one 
second  later  it  has  changed  direction  so  that  it  is  flying  horizontally 
and  7°  north  of  east,  at  the  same  speed.  What  are  the  magnitude 
and  direction  of  its  acceleration  vector  during  that  one  second  time 
interval?  (Assume  its  acceleration  was  roughly  constant.)  V  ^ 
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59  Your  hand  presses  a  block  of  mass  m  against  a  wall  with 

a  force  F u  acting  at  an  angle  9,  as  shown  in  the  figure.  Find  the 
minimum  and  maximum  possible  values  of  |F#|  that  can  keep  the 
block  stationary,  in  terms  of  m,  g,  9 ,  and  gs ,  the  coefficient  of  static 
friction  between  the  block  and  the  wall.  Check  both  your  answers 
in  the  case  of  9  =  90°,  and  interpret  the  case  where  the  maximum 
force  is  infinite.  Z  H 

60  A  skier  of  mass  m  is  coasting  down  a  slope  inclined  at 
an  angle  9  compared  to  horizontal.  Assume  for  simplicity  that  the 
treatment  of  kinetic  friction  given  in  chapter  5  is  appropriate  here, 
although  a  soft  and  wet  surface  actually  behaves  a  little  differently. 
The  coefficient  of  kinetic  friction  acting  between  the  skis  and  the 
snow  is  /I k,  and  in  addition  the  skier  experiences  an  air  friction 
force  of  magnitude  bv 2,  where  b  is  a  constant. 

(a)  Find  the  maximum  speed  that  the  skier  will  attain,  in  terms  of 

the  variables  m,  g ,  9 ,  and  b.  V 

(b)  For  angles  below  a  certain  minimum  angle  9min ,  the  equation 

gives  a  result  that  is  not  mathematically  meaningful.  Find  an  equa¬ 
tion  for  9min,  and  give  a  physical  explanation  of  what  is  happening 
for  9  <  9 min .  V 

■ 

61  A  gun  is  aimed  horizontally  to  the  west.  The  gun  is  fired,  and 
the  bullet  leaves  the  muzzle  at  t  =  0.  The  bullet’s  position  vector 
as  a  function  of  time  is  r  =  6x  +  ct.y  +  dt2 z,  where  b ,  c,  and  d  are 
positive  constants. 

(a)  What  units  would  b,  c,  and  d  need  to  have  for  the  equation  to 
make  sense? 

(b)  Find  the  bullet’s  velocity  and  acceleration  as  functions  of  time. 

(c)  Give  physical  interpretations  of  b,  c,  d,  x,  y,  and  z. 

62  Annie  Oakley,  riding  north  on  horseback  at  30  mi/hr,  shoots 

her  rifle,  aiming  horizontally  and  to  the  northeast.  The  muzzle  speed 
of  the  rifle  is  140  mi/hr.  When  the  bullet  hits  a  defenseless  fuzzy 
animal,  what  is  its  speed  of  impact?  Neglect  air  resistance,  and 
ignore  the  vertical  motion  of  the  bullet.  >  Solution,  p.  974  - 

63  A  cargo  plane  has  taken  off  from  a  tiny  airstrip  in  the  Andes, 

and  is  climbing  at  constant  speed,  at  an  angle  of  9  =  17°  with  respect 
to  horizontal.  Its  engines  supply  a  thrust  of  Ftj irust  =  200  kN,  and 
the  lift  from  its  wings  is  Fuft  =  654  kN.  Assume  that  air  resistance 
(drag)  is  negligible,  so  the  only  forces  acting  are  thrust,  lift,  and 
weight.  What  is  its  mass,  in  kg?  >  Solution,  p.  975  - 

64  A  wagon  is  being  pulled  at  constant  speed  up  a  slope  9 
by  a  rope  that  makes  an  angle  (j)  with  the  vertical,  (a)  Assuming 
negligible  friction,  show  that  the  tension  in  the  rope  is  given  by  the 
equation 

sin# 

T  =  ■  ia  i 

sm(#  +  (p) 
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(b)  Interpret  this  equation  in  the  special  cases  of  <f>  =  0  and  </>  = 
180°  —  6.  >  Solution,  p.  975  - 

65  The  angle  of  repose  is  the  maximum  slope  on  which  an  object 
will  not  slide.  On  airless,  geologically  inert  bodies  like  the  moon  or 
an  asteroid,  the  only  thing  that  determines  whether  dust  or  rubble 
will  stay  on  a  slope  is  whether  the  slope  is  less  steep  than  the  angle 
of  repose. 

(a)  Find  an  equation  for  the  angle  of  repose,  deciding  for  yourself 
what  are  the  relevant  variables. 

(b)  On  an  asteroid,  where  g  can  be  thousands  of  times  lower  than 
on  Earth,  would  rubble  be  able  to  lie  at  a  steeper  angle  of  repose? 

>  Solution,  p.  975  - 

66  When  you’re  done  using  an  electric  mixer,  you  can  get  most 
of  the  batter  off  of  the  beaters  by  lifting  them  out  of  the  batter  with 
the  motor  running  at  a  high  enough  speed.  Let’s  imagine,  to  make 
things  easier  to  visualize,  that  we  instead  have  a  piece  of  tape  stuck 
to  one  of  the  beaters. 

(a)  Explain  why  static  friction  has  no  effect  on  whether  or  not  the 
tape  flies  off. 

(b)  Analyze  the  forces  in  which  the  tape  participates,  using  a  table 
in  the  format  shown  in  subsection  3.2.6. 

(c)  Suppose  you  find  that  the  tape  doesn’t  fly  off  when  the  motor  is 
on  a  low  speed,  but  at  a  greater  speed,  the  tape  won’t  stay  on.  Why 
would  the  greater  speed  change  things?  [Hint:  If  you  don’t  invoke 
any  law  of  physics,  you  haven’t  explained  it.] 

67  Show  that  the  expression  |v|2/r  has  the  units  of  acceleration. 


68  A  plane  is  flown  in  a  loop-the-loop  of  radius  1.00  km.  The 

plane  starts  out  flying  upside-down,  straight  and  level,  then  begins 
curving  up  along  the  circular  loop,  and  is  right-side  up  when  it 
reaches  the  top.  (The  plane  may  slow  down  somewhat  on  the  way 
up.)  How  fast  must  the  plane  be  going  at  the  top  if  the  pilot  is  to 
experience  no  force  from  the  seat  or  the  seatbelt  while  at  the  top  of 
the  loop?  V 

69  Find  the  angle  between  the  following  two  vectors: 

x  +  2y  +  3z 
4x  +  5y  +  6z 


>  Hint,  p.  959  V 

70  The  two  blocks  shown  in  the  figure  have  equal  mass,  m,  and 
the  surface  is  frictionless,  (a)  What  is  the  tension  in  the  massless 
rope?  >  Hint,  p.  959  V 

(b)  Show  that  the  units  of  your  answer  make  sense. 

(c)  Check  the  physical  behavior  of  your  answer  in  the  special  cases 
of  4>  <  0  and  6  =  0,  (j)  =  90°. 
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71  (a)  We  observe  that  the  amplitude  of  a  certain  free  oscillation 

decreases  from  A0  to  A0/Z  after  n  oscillations.  Find  its  Q.  V 
(b)  The  figure  is  from  Shape  memory  in  Spider  draglines ,  Emile, 
Le  Floch,  and  Vollrath,  Nature  440:621  (2006).  Panel  1  shows  an 
electron  microscope’s  image  of  a  thread  of  spider  silk.  In  2,  a  spi¬ 
der  is  hanging  from  such  a  thread.  From  an  evolutionary  point  of 
view,  it’s  probably  a  bad  thing  for  the  spider  if  it  twists  back  and 
forth  while  hanging  like  this.  (We’re  referring  to  a  back-and-forth 
rotation  about  the  axis  of  the  thread,  not  a  swinging  motion  like  a 
pendulum.)  The  authors  speculate  that  such  a  vibration  could  make 
the  spider  easier  for  predators  to  see,  and  it  also  seems  to  me  that 
it  would  be  a  bad  thing  just  because  the  spider  wouldn’t  be  able 
to  control  its  orientation  and  do  what  it  was  trying  to  do.  Panel  3 
shows  a  graph  of  such  an  oscillation,  which  the  authors  measured 
using  a  video  camera  and  a  computer,  with  a  0.1  g  mass  hung  from  it 
in  place  of  a  spider.  Compared  to  human-made  fibers  such  as  kevlar 
or  copper  wire,  the  spider  thread  has  an  unusual  set  of  properties: 


1.  It  has  a  low  Q,  so  the  vibrations  damp  out  quickly. 


2.  It  doesn’t  become  brittle  with  repeated  twisting  as  a  copper 
wire  would. 


3.  When  twisted,  it  tends  to  settle  in  to  a  new  equilibrium  angle, 
rather  than  insisting  on  returning  to  its  original  angle.  You 
can  see  this  in  panel  2,  because  although  the  experimenters 
initially  twisted  the  wire  by  35  degrees,  the  thread  only  per¬ 
formed  oscillations  with  an  amplitude  much  smaller  than  ±35 
degrees,  settling  down  to  a  new  equilibrium  at  27  degrees. 


4.  Over  much  longer  time  scales  (hours),  the  thread  eventually 
resets  itself  to  its  original  equilbrium  angle  (shown  as  zero 
degrees  on  the  graph).  (The  graph  reproduced  here  only  shows 
the  motion  over  a  much  shorter  time  scale.)  Some  human- 
made  materials  have  this  “memory”  property  as  well,  but  they 
typically  need  to  be  heated  in  order  to  make  them  go  back  to 
their  original  shapes. 


Focusing  on  property  number  1,  estimate  the  Q  of  spider  silk  from 
the  graph.  V  ^ 
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72  A  cross-country  skier  is  gliding  on  a  level  trail,  with  negligi¬ 
ble  friction.  Then,  when  he  is  at  position  x  =  0,  the  tip  of  his  skis 
enters  a  patch  of  dirt.  As  he  rides  onto  the  dirt,  more  and  more  of 
his  weight  is  being  supported  by  the  dirt.  The  skis  have  length  £, 
so  if  he  reached  x  =  £  without  stopping,  his  weight  would  be  com¬ 
pletely  on  the  dirt.  This  problem  deals  with  the  motion  for  x  <  £. 

(a)  Find  the  acceleration  in  terms  of  x,  as  well  as  any  other  relevant 
constants. 

(b)  This  is  a  second-order  differential  equation.  You  should  be  able 
to  find  the  solution  simply  by  thinking  about  some  commonly  oc- 
curing  functions  that  you  know  about,  and  finding  two  that  have 
the  right  properties.  If  these  functions  are  x  =  f(t)  and  x  =  g(t ), 
then  the  most  general  solution  to  the  equations  of  motion  will  be  of 
the  form  x  =  af  +  bg,  where  a  and  b  are  constants  to  be  determined 
from  the  initial  conditions. 

(c)  Suppose  that  the  initial  velocity  vQ  at  x  =  0  is  such  that  he  stops 

at  x  <  £.  Find  the  time  until  he  stops,  and  show  that,  counterintu¬ 
itively,  this  time  is  independent  of  vQ.  Explain  physically  why  this 
is  true.  V 


73  A  microwave  oven  works  by  twisting  molecules  one  way  and 
then  the  other,  counterclockwise  and  then  clockwise  about  their  own 
centers,  millions  of  times  a  second.  If  you  put  an  ice  cube  or  a  stick 
of  butter  in  a  microwave,  you’ll  observe  that  the  solid  doesn’t  heat 
very  quickly,  although  eventually  melting  begins  in  one  small  spot. 
Once  this  spot  forms,  it  grows  rapidly,  while  the  rest  of  the  solid 
remains  solid;  it  appears  that  a  microwave  oven  heats  a  liquid  much 
more  rapidly  than  a  solid.  Explain  why  this  should  happen,  based 
on  the  atomic-level  description  of  heat,  solids,  and  liquids.  (See, 
e.g.,  figure  b  on  page  110.) 

Don’t  repeat  the  following  common  mistakes: 

In  a  solid,  the  atoms  are  packed  more  tightly  and  have  less  space 
between  them.  Not  true.  Ice  floats  because  it’s  less  dense  than 
water. 

In  a  liquid,  the  atoms  are  moving  much  faster.  No,  the  difference  in 
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average  speed  between  ice  at  —  1°C  and  water  at  1°C  is  only  0.4%. 


74  Problem  2-16  on  page  120  was  intended  to  be  solved  using 

conservation  of  energy.  Solve  the  same  problem  using  Newton’s 
laws.  ■ 

75  A  bead  slides  down  along  a  piece  of  wire  that  is  in  the  shape 
of  a  helix.  The  helix  lies  on  the  surface  of  a  vertical  cylinder  of 
radius  r,  and  the  vertical  distance  between  turns  is  d. 

(a)  Ordinarily  when  an  object  slides  downhill  under  the  influence  of 
kinetic  friction,  the  velocity-independence  of  kinetic  friction  implies 
that  the  acceleration  is  constant,  and  therefore  there  is  no  limit  to 
the  object’s  velocity.  Explain  the  physical  reason  why  this  argument 
fails  here,  so  that  the  bead  will  in  fact  have  some  limiting  velocity. 

(b)  Find  the  limiting  velocity. 

(c)  Show  that  your  result  has  the  correct  behavior  in  the  limit  of 

r  — >  oo.  [Problem  by  B.  Korsunsky.]  P  Si 

76  A  person  on  a  bicycle  is  to  coast  down  a  ramp  of  height  h  and 

then  pass  through  a  circular  loop  of  radius  r.  What  is  the  small¬ 
est  value  of  h  for  which  the  cyclist  will  complete  the  loop  without 
falling?  (Ignore  the  kinetic  energy  of  the  spinning  wheels.)  % 


77  A  car  accelerates  from  rest.  At  low  speeds,  its  acceleration 
is  limited  by  static  friction,  so  that  if  we  press  too  hard  on  the 
gas,  we  will  “burn  rubber”  (or,  for  many  newer  cars,  a  computer¬ 
ized  traction-control  system  will  override  the  gas  pedal).  At  higher 
speeds,  the  limit  on  acceleration  comes  from  the  power  of  the  engine, 
which  puts  a  limit  on  how  fast  kinetic  energy  can  be  developed. 

(a)  Show  that  if  a  force  F  is  applied  to  an  object  moving  at  speed 
v,  the  power  required  is  given  by  P  =  vF. 

(b)  Find  the  speed  v  at  which  we  cross  over  from  the  first  regime  de¬ 

scribed  above  to  the  second.  At  speeds  higher  than  this,  the  engine 
does  not  have  enough  power  to  burn  rubber.  Express  your  result 
in  terms  of  the  car’s  power  P ,  its  mass  m.  the  coefficient  of  static 
friction  fj,s,  and  g.  V 

(c)  Show  that  your  answer  to  part  b  has  units  that  make  sense. 

(d)  Show  that  the  dependence  of  your  answer  on  each  of  the  four 
variables  makes  sense  physically. 

(e)  The  2010  Maserati  Gran  Turismo  Convertible  has  a  maximum 
power  of  3.23  x  105  W  (433  horsepower)  and  a  mass  (including  a  50- 
kg  driver)  of  2.03  x  103  kg.  (This  power  is  the  maximum  the  engine 
can  supply  at  its  optimum  frequency  of  7600  r.p.rn.  Presumably  the 
automatic  transmission  is  designed  so  a  gear  is  available  in  which 
the  engine  will  be  running  at  very  nearly  this  frequency  when  the 
car  is  moving  at  r.)  Rubber  on  asphalt  has  g,s  ~  0.9.  Find  v  for 
this  car.  Answer:  18  m/s,  or  about  40  miles  per  hour. 

(f)  Our  analysis  has  neglected  air  friction,  which  can  probably  be 
approximated  as  a  force  proportional  to  v2 .  The  existence  of  this 
force  is  the  reason  that  the  car  has  a  maximum  speed,  which  is  176 
miles  per  hour.  To  get  a  feeling  for  how  good  an  approximation 
it  is  to  ignore  air  friction,  find  what  fraction  of  the  engine’s  maxi¬ 
mum  power  is  being  used  to  overcome  air  resistance  when  the  car  is 
moving  at  the  speed  v  found  in  part  e.  Answer:  1% 

78  Two  wheels  of  radius  r  rotate  in  the  same  vertical  plane 

with  angular  velocities  +12  and  — fl  (rates  of  rotation  in  radians 
per  second)  about  axes  that  are  parallel  and  at  the  same  height. 
The  wheels  touch  one  another  at  a  point  on  their  circumferences, 
so  that  their  rotations  mesh  like  gears  in  a  gear  train.  A  board  is 
laid  on  top  of  the  wheels,  so  that  two  friction  forces  act  upon  it, 
one  from  each  wheel.  Characterize  the  three  qualitatively  different 
types  of  motion  that  the  board  can  exhibit,  depending  on  the  initial 
conditions.  ■ 


Problem  78. 
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79  For  safety,  mountain  climbers  often  wear  a  climbing  harness 
and  tie  in  to  other  climbers  on  a  rope  team  or  to  anchors  such  as 
pitons  or  snow  anchors.  When  using  anchors,  the  climber  usually 
wants  to  tie  in  to  more  than  one,  both  for  extra  strength  and  for 
redundancy  in  case  one  fails.  The  figure  shows  such  an  arrangement, 
with  the  climber  hanging  from  a  pair  of  anchors  forming  a  “Y”  at 
an  angle  9.  The  metal  piece  at  the  center  is  called  a  carabiner.  The 
usual  advice  is  to  make  6  <  90°;  for  large  values  of  9,  the  stress 
placed  on  the  anchors  can  be  many  times  greater  than  the  actual 
load  L,  so  that  two  anchors  are  actually  less  safe  than  one. 

(a)  Find  the  force  S  at  each  anchor  in  terms  of  L  and  9.  V 

(b)  Verify  that  your  answer  makes  sense  in  the  case  of  9  =  0. 

(c)  Interpret  your  answer  in  the  case  of  9  =  180°. 

(d)  What  is  the  smallest  value  of  9  for  which  S  equals  or  exceeds 

L ,  so  that  for  larger  angles  a  failure  of  at  least  one  anchor  is  more 
likely  than  it  would  have  been  with  a  single  anchor?  V 


Problem  79. 


80  Mountain  climbers  with  masses  m  and  M  are  roped  together 
while  crossing  a  horizontal  glacier  when  a  vertical  crevasse  opens  up 
under  the  climber  with  mass  M.  The  climber  with  mass  m  drops 
down  on  the  snow  and  tries  to  stop  by  digging  into  the  snow  with 
the  pick  of  an  ice  ax.  Alas,  this  story  does  not  have  a  happy  ending, 
because  this  doesn’t  provide  enough  friction  to  stop.  Both  m  and  M 
continue  accelerating,  with  M  dropping  down  into  the  crevasse  and 
m  being  dragged  across  the  snow,  slowed  only  by  the  kinetic  friction 
with  coefficient  fik  acting  between  the  ax  and  the  snow.  There  is  no 
significant  friction  between  the  rope  and  the  lip  of  the  crevasse. 

(a)  Find  the  acceleration  a.  v 

(b)  Check  the  units  of  your  result. 

(c)  Check  the  dependence  of  your  equation  on  the  variables.  That 
means  that  for  each  variable,  you  should  determine  what  its  effect 
on  a  should  be  physically,  and  then  what  your  answer  from  part  a 
says  its  effect  would  be  mathematically. 


m 


Problem  80. 
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Problem  81 . 


81  Complete  example  71  on  p.  205  by  expressing  the  remaining 
nine  x  and  y  components  of  the  forces  in  terms  of  the  five  magnitudes 
and  the  small,  positive  angle  0  ~  9°  by  which  the  crack  overhangs. 

V  ■ 

82  In  a  well  known  stunt  from  circuses  and  carnivals,  a  motor¬ 
cyclist  rides  around  inside  a  big  bowl,  gradually  speeding  up  and 
rising  higher.  Eventually  the  cyclist  can  get  up  to  where  the  walls 
of  the  bowl  are  vertical.  Let’s  estimate  the  conditions  under  which 
a  running  human  could  do  the  same  thing. 

(a)  If  the  runner  can  run  at  speed  v,  and  her  shoes  have  a  coefficient 
of  static  friction  /js ,  what  is  the  maximum  radius  of  the  circle?  v 

(b)  Show  that  the  units  of  your  answer  make  sense. 

(c)  Check  that  its  dependence  on  the  variables  makes  sense. 

(d)  Evaluate  your  result  numerically  for  v  =  10  m/s  (the  speed  of 

an  Olympic  sprinter)  and  fis  =  5.  (This  is  roughly  the  highest  coeffi¬ 
cient  of  static  friction  ever  achieved  for  surfaces  that  are  not  sticky. 
The  surface  has  an  array  of  microscopic  fibers  like  a  hair  brush,  and 
is  inspired  by  the  hairs  on  the  feet  of  a  gecko.  These  assumptions 
are  not  necessarily  realistic,  since  the  person  would  have  to  run  at 
an  angle,  which  would  be  physically  awkward.)  v 
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83  Problem  79  discussed  a  possible  correct  way  of  setting  up 
a  redundant  anchor  for  mountaineering.  The  figure  for  this  prob¬ 
lem  shows  an  incorrect  way  of  doing  it,  by  arranging  the  rope  in 
a  triangle  (which  we’ll  take  to  be  isoceles).  One  of  the  bad  things 
about  the  triangular  arrangement  is  that  it  requires  more  force  from 
the  anchors,  making  them  more  likely  to  fail,  (a)  Using  the  same 
notation  as  in  problem  79,  find  S  in  terms  of  L  and  9.  v 

(b)  Verify  that  your  answer  makes  sense  in  the  case  of  9  =  0,  and 
compare  with  the  correct  setup. 


Problem  83. 


84  At  a  picnic,  someone  hands  you  a  can  of  beer.  The  ground  is 
uneven,  and  you  don’t  want  to  spill  your  drink.  You  reason  that  it 
will  be  more  stable  if  you  drink  some  of  it  first  in  order  to  lower  its 
center  of  mass.  How  much  should  you  drink  in  order  to  make  the 
center  of  mass  as  low  as  possible?  [Based  on  a  problem  by  Walter 
van  B.  Roberts  and  Martin  Gardner.] 


85  “Big  wall”  climbing  is  a  specialized  type  of  rock  climbing  that 
involves  going  up  tall  cliffs  such  as  the  ones  in  Yosemite,  usually  with 
the  climbers  spending  at  least  one  night  sleeping  on  a  natural  ledge 
or  an  artificial  “portaledge.”  In  this  style  of  climbing,  each  pitch  of 
the  climb  involves  strenuously  hauling  up  several  heavy  bags  of  gear 
—  a  fact  that  has  caused  these  climbs  to  be  referred  to  as  “vertical 
ditch  digging.”  (a)  If  an  80  kg  haul  bag  has  to  be  pulled  up  the  full 
length  of  a  60  m  rope,  how  much  work  is  done?  (b)  Since  it  can  be 
difficult  to  lift  80  kg,  a  2:1  pulley  is  often  used.  The  hauler  then 
lifts  the  equivalent  of  40  kg,  but  has  to  pull  in  120  m  of  rope.  How 
much  work  is  done  in  this  case?  V  - 
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86  The  figure  shows  an  arcade  game  called  skee  ball  that  is 
similar  to  bowling.  The  player  rolls  the  ball  down  a  horizontal  alley. 
The  ball  then  rides  up  a  curved  lip  and  is  launched  at  an  initial 
speed  u,  at  an  angle  a  above  horizontal.  Suppose  we  want  the  ball 
to  go  into  a  hole  that  is  at  horizontal  distance  t  and  height  h.  as 
shown  in  the  figure. 

(a)  Find  the  initial  speed  u  that  is  required,  in  terms  of  the  other 

variables  and  g.  V 

(b)  Check  that  your  answer  to  part  a  has  units  that  make  sense. 

(c)  Check  that  your  answer  to  part  a  depends  on  g  in  a  way  that 
makes  sense.  This  means  that  you  should  first  determine  on  physical 
grounds  whether  increasing  g  should  increase  u,  or  decrease  it.  Then 
see  whether  your  answer  to  part  a  has  this  mathematical  behavior. 

(d)  Do  the  same  for  the  dependence  on  h. 

(e)  Interpret  your  equation  in  the  case  where  a  =  90°. 

(f)  Interpret  your  equation  in  the  case  where  tana  =  h/£. 

(g)  Find  u  numerically  if  h  =  70  cm,  £  =  60  cm,  and  a  =  65°.  V 


Problem  86. 


Problem  87. 


87  The  figure  shows  the  International  Space  Station  (ISS).  One  of 
the  purposes  of  the  ISS  is  supposed  to  be  to  carry  out  experiments  in 
microgravity.  However,  the  following  factor  limits  this  application. 
The  ISS  orbits  the  earth  once  every  92.6  minutes.  It  is  desirable  to 
keep  the  same  side  of  the  station  always  oriented  toward  the  earth, 
which  means  that  the  station  has  to  rotate  with  the  same  period. 
In  the  photo,  the  direction  of  orbital  motion  is  left  or  right  on  the 
page,  so  the  rotation  is  about  the  axis  shown  as  up  and  down  on  the 
page.  The  greatest  distance  of  any  pressurized  compartment  from 
the  axis  of  rotation  is  36.5  meters.  Find  the  acceleration  due  to  the 
rotation  at  this  point,  and  the  apparent  weight  of  a  60  kg  astronaut 
at  that  location.  V  ^ 


Key  to  symbols: 

easy  -  typical  B  challenging  B  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  3A:  Force  and  Motion 

Equipment: 

2-meter  pieces  of  butcher  paper 
wood  blocks  with  hooks 
string 

masses  to  put  on  top  of  the  blocks  to  increase  friction 

spring  scales  (preferably  calibrated  in  Newtons) 

Suppose  a  person  pushes  a  crate,  sliding  it  across  the  floor  at  a  certain  speed,  and  then  repeats 
the  same  thing  but  at  a  higher  speed.  This  is  essentially  the  situation  you  will  act  out  in  this 
exercise.  What  do  you  think  is  different  about  her  force  on  the  crate  in  the  two  situations? 
Discuss  this  with  your  group  and  write  down  your  hypothesis: 


1.  First  you  will  measure  the  amount  of  friction  between  the  wood  block  and  the  butcher  paper 
when  the  wood  and  paper  surfaces  are  slipping  over  each  other.  The  idea  is  to  attach  a  spring 
scale  to  the  block  and  then  slide  the  butcher  paper  under  the  block  while  using  the  scale  to 
keep  the  block  from  moving  with  it.  Depending  on  the  amount  of  force  your  spring  scale  was 
designed  to  measure,  you  may  need  to  put  an  extra  mass  on  top  of  the  block  in  order  to  increase 
the  amount  of  friction.  It  is  a  good  idea  to  use  long  piece  of  string  to  attach  the  block  to  the 
spring  scale,  since  otherwise  one  tends  to  pull  at  an  angle  instead  of  directly  horizontally. 

First  measure  the  amount  of  friction  force  when  sliding  the  butcher  paper  as  slowly  as  possi¬ 
ble: _ 

Now  measure  the  amount  of  friction  force  at  a  significantly  higher  speed,  say  1  meter  per  second. 
(If  you  try  to  go  too  fast,  the  motion  is  jerky,  and  it  is  impossible  to  get  an  accurate  reading.) 


Discuss  your  results.  Why  are  we  justified  in  assuming  that  the  string’s  force  on  the  block  (i.e., 
the  scale  reading)  is  the  same  amount  as  the  paper’s  frictional  force  on  the  block? 

2.  Now  try  the  same  thing  but  with  the  block  moving  and  the  paper  standing  still.  Try  two 
different  speeds. 

Do  your  results  agree  with  your  original  hypothesis?  If  not,  discuss  what’s  going  on.  How  does 
the  block  “know”  how  fast  to  go? 
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Exercise  3B:  Vibrations 

Equipment: 

•  air  track  and  carts  of  two  different  masses 

•  springs 

•  spring  scales 


spring 


Place  the  cart  on  the  air  track  and  attach  springs  so  that  it  can  vibrate. 

1.  Test  whether  the  period  of  vibration  depends  on  amplitude.  Try  at  least  two  moderate 
amplitudes,  for  which  the  springs  do  not  go  slack,  and  at  least  one  amplitude  that  is  large 
enough  so  that  they  do  go  slack. 

2.  Try  a  cart  with  a  different  mass.  Does  the  period  change  by  the  expected  factor,  based  on 
the  equation  T  =  2TT^/m/k? 

3.  Use  a  spring  scale  to  pull  the  cart  away  from  equilibrium,  and  make  a  graph  of  force  versus 
position.  Is  it  linear?  If  so,  what  is  its  slope? 

4.  Test  the  equation  T  =  2-K^/m/k  numerically. 
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Exercise  3C:  Worksheet  on  Resonance 

1.  Compare  the  oscillator’s  energies  at  A,  B,  C,  and  D. 


A  B  CD 


2.  Compare  the  Q  values  of  the  two  oscillators. 


3.  Match  the  x-t  graphs  in  ^2  with  the  amplitude-frequency  graphs  below. 


response 


response 
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Exercise  D  is  on  the  following  two  pages. 
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Exercise  3D:  Vectors  and  Motion 

Each  diagram  on  page  244  shows  the  motion  of  an  object  in  an  x  —  y  plane.  Each  dot  is  one 
location  of  the  object  at  one  moment  in  time.  The  time  interval  from  one  dot  to  the  next  is 
always  the  same,  so  you  can  think  of  the  vector  that  connects  one  dot  to  the  next  as  a  v  vector, 
and  subtract  to  find  Av  vectors. 

1.  Suppose  the  object  in  diagram  1  is  moving  from  the  top  left  to  the  bottom  right.  Deduce 
whatever  you  can  about  the  force  acting  on  it.  Does  the  force  always  have  the  same  magnitude? 
The  same  direction? 

Invent  a  physical  situation  that  this  diagram  could  represent. 

What  if  you  reinterpret  the  diagram,  and  reverse  the  object’s  direction  of  motion? 

2.  What  can  you  deduce  about  the  force  that  is  acting  in  diagram  2? 

Invent  a  physical  situation  that  diagram  2  could  represent. 

3.  What  can  you  deduce  about  the  force  that  is  acting  in  diagram  3? 

Invent  a  physical  situation. 
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Chapter  4 

Conservation  of  Angular  Mo¬ 
mentum 

4.1  Angular  Momentum  In  Two  Dimensions 

4.1.1  Angular  momentum 

“Sure,  and  maybe  the  sun  won’t  come  up  tomorrow.”  Of  course, 
the  sun  only  appears  to  go  up  and  down  because  the  earth  spins, 
so  the  cliche  should  really  refer  to  the  unlikelihood  of  the  earth’s 
stopping  its  rotation  abruptly  during  the  night.  Why  can’t  it  stop? 

It  wouldn’t  violate  conservation  of  momentum,  because  the  earth’s 
rotation  doesn’t  add  anything  to  its  momentum.  While  California 
spins  in  one  direction,  some  equally  massive  part  of  India  goes  the 
opposite  way,  canceling  its  momentum.  A  halt  to  Earth’s  rotation 
would  entail  a  drop  in  kinetic  energy,  but  that  energy  could  simply 
be  converted  into  some  other  form,  such  as  heat. 


a /The  jumper  can’t  move  his 
legs  counterclockwise  without 
moving  his  arms  clockwise. 
( Thomas  Eakins.) 


Other  examples  along  these  lines  are  not  hard  to  find.  An  atom 
spins  at  the  same  rate  for  billions  of  years.  A  high-diver  who  is 
rotating  when  he  comes  off  the  board  does  not  need  to  make  any 
physical  effort  to  continue  rotating,  and  indeed  would  be  unable  to 
stop  rotating  before  he  hit  the  water. 

These  observations  have  the  hallmarks  of  a  conservation  law: 

A  closed  system  is  involved.  Nothing  is  making  an  effort  to  twist 
the  earth,  the  hydrogen  atom,  or  the  high-diver.  They  are  isolated 
from  rotation-changing  influences,  i.e. ,  they  are  closed  systems. 

Something  remains  unchanged.  There  appears  to  be  a  numerical 
quantity  for  measuring  rotational  motion  such  that  the  total  amount 
of  that  quantity  remains  constant  in  a  closed  system. 
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b  /  An  overhead  view  of  a 
piece  of  putty  being  thrown  at 
a  door.  Even  though  the  putty 
is  neither  spinning  nor  traveling 
along  a  curve,  we  must  define  it 
has  having  some  kind  of  “rota¬ 
tion”  because  it  is  able  to  make 
the  door  rotate. 


Something  can  he  transferred  back  and  forth  without  changing 
the  total  amount.  In  the  photo  of  the  old-fashioned  high  jump,  a, 
the  jumper  wants  to  get  his  feet  out  in  front  of  him  so  he  can  keep 
from  doing  a  “face  plant”  when  he  lands.  Bringing  his  feet  forward 
would  involve  a  certain  quantity  of  counterclockwise  rotation,  but  he 
didn’t  start  out  with  any  rotation  when  he  left  the  ground.  Suppose 
we  consider  counterclockwise  as  positive  and  clockwise  as  negative. 
The  only  way  his  legs  can  acquire  some  positive  rotation  is  if  some 
other  part  of  his  body  picks  up  an  equal  amount  of  negative  rotation. 
This  is  why  he  swings  his  arms  up  behind  him,  clockwise. 

What  numerical  measure  of  rotational  motion  is  conserved?  Car 
engines  and  old-fashioned  LP  records  have  speeds  of  rotation  mea¬ 
sured  in  rotations  per  minute  (r.p.rn.),  but  the  number  of  rota¬ 
tions  per  minute  (or  per  second)  is  not  a  conserved  quantity.  A 
twirling  figure  skater,  for  instance,  can  pull  her  arms  in  to  increase 
her  r.p.rn. ’s.  The  first  section  of  this  chapter  deals  with  the  nu¬ 
merical  definition  of  the  quantity  of  rotation  that  results  in  a  valid 
conservation  law. 

When  most  people  think  of  rotation,  they  think  of  a  solid  object 
like  a  wheel  rotating  in  a  circle  around  a  fixed  point.  Examples  of 
this  type  of  rotation,  called  rigid  rotation  or  rigid-body  rotation,  in¬ 
clude  a  spinning  top,  a  seated  child’s  swinging  leg,  and  a  helicopter’s 
spinning  propeller.  Rotation,  however,  is  a  much  more  general  phe¬ 
nomenon,  and  includes  noncircular  examples  such  as  a  comet  in 
an  elliptical  orbit  around  the  sun,  or  a  cyclone,  in  which  the  core 
completes  a  circle  more  quickly  than  the  outer  parts. 

If  there  is  a  numerical  measure  of  rotational  motion  that  is  a 
conserved  quantity,  then  it  must  include  nonrigid  cases  like  these, 
since  nonrigid  rotation  can  be  traded  back  and  forth  with  rigid  ro¬ 
tation.  For  instance,  there  is  a  trick  for  finding  out  if  an  egg  is 
raw  or  hardboiled.  If  you  spin  a  hardboiled  egg  and  then  stop  it 
briefly  with  your  finger,  it  stops  dead.  But  if  you  do  the  same  with 
a  raw  egg,  it  springs  back  into  rotation  because  the  soft  interior  was 
still  swirling  around  within  the  momentarily  motionless  shell.  The 
pattern  of  flow  of  the  liquid  part  is  presumably  very  complex  and 
nonuniform  due  to  the  asymmetric  shape  of  the  egg  and  the  differ¬ 
ent  consistencies  of  the  yolk  and  the  white,  but  there  is  apparently 
some  way  to  describe  the  liquid’s  total  amount  of  rotation  with  a 
single  number,  of  which  some  percentage  is  given  back  to  the  shell 
when  you  release  it. 

The  best  strategy  is  to  devise  a  way  of  defining  the  amount  of 
rotation  of  a  single  small  part  of  a  system.  The  amount  of  rotation 
of  a  system  such  as  a  cyclone  will  then  be  defined  as  the  total  of  all 
the  contributions  from  its  many  small  parts. 

The  quest  for  a  conserved  quantity  of  rotation  even  requires  us 
to  broaden  the  rotation  concept  to  include  cases  where  the  motion 
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doesn’t  repeat  or  even  curve  around.  If  you  throw  a  piece  of  putty 
at  a  door,  b,  the  door  will  recoil  and  start  rotating.  The  putty  was 
traveling  straight,  not  in  a  circle,  but  if  there  is  to  be  a  general 
conservation  law  that  can  cover  this  situation,  it  appears  that  we 
must  describe  the  putty  as  having  had  some  “rotation,”  which  it 
then  gave  up  to  the  door.  The  best  way  of  thinking  about  it  is  to 
attribute  rotation  to  any  moving  object  or  part  of  an  object  that 
changes  its  angle  in  relation  to  the  axis  of  rotation.  In  the  putty- 
and-door  example,  the  hinge  of  the  door  is  the  natural  point  to  think 
of  as  an  axis,  and  the  putty  changes  its  angle  as  seen  by  someone 
standing  at  the  hinge,  c.  For  this  reason,  the  conserved  quantity 
we  are  investigating  is  called  angular  momentum.  The  symbol  for 
angular  momentum  can’t  be  “a”  or  “m,”  since  those  are  used  for 
acceleration  and  mass,  so  the  letter  L  is  arbitrarily  chosen  instead. 

Imagine  a  1  kg  blob  of  putty,  thrown  at  the  door  at  a  speed  of 
1  m/s,  which  hits  the  door  at  a  distance  of  1  m  from  the  hinge. 
We  define  this  blob  to  have  1  unit  of  angular  momentum.  When 
it  hits  the  door,  the  door  will  recoil  and  start  rotating.  We  can 
use  the  speed  at  which  the  door  recoils  as  a  measure  of  the  angular 
momentum  the  blob  brought  in.1 

Experiments  show,  not  surprisingly,  that  a  2  kg  blob  thrown  in 
the  same  way  makes  the  door  rotate  twice  as  fast,  so  the  angular 
momentum  of  the  putty  blob  must  be  proportional  to  mass, 

L  oc  m. 

Similarly,  experiments  show  that  doubling  the  velocity  of  the 
blob  will  have  a  doubling  effect  on  the  result,  so  its  angular  momen¬ 
tum  must  be  proportional  to  its  velocity  as  well, 

L  oc  mv. 

You  have  undoubtedly  had  the  experience  of  approaching  a  closed 
door  with  one  of  those  bar-shaped  handles  on  it  and  pushing  on  the 
wrong  side,  the  side  close  to  the  hinges.  You  feel  like  an  idiot,  be¬ 
cause  you  have  so  little  leverage  that  you  can  hardly  budge  the  door. 
The  same  would  be  true  with  the  putty  blob.  Experiments  would 
show  that  the  amount  of  rotation  the  blob  can  give  to  the  door  is 
proportional  to  the  distance,  r,  from  the  axis  of  rotation,  so  angular 
momentum  must  be  proportional  to  r  as  well, 

L  oc  mvr. 

We  are  almost  done,  but  there  is  one  missing  ingredient.  We 
know  on  grounds  of  symmetry  that  a  putty  ball  thrown  directly 

1We  assume  that  the  door  is  much  more  massive  than  the  blob.  Under  this 
assumption,  the  speed  at  which  the  door  recoils  is  much  less  than  the  original 
speed  of  the  blob,  so  the  blob  has  lost  essentially  all  its  angular  momentum,  and 
given  it  to  the  door. 


''O 


c  /  As  seen  by  someone  standing 
at  the  axis,  the  putty  changes 
its  angular  position.  We  there¬ 
fore  define  it  as  having  angular 
momentum. 


O 

O 

O 


d  /  A  putty  blob  thrown  di¬ 
rectly  at  the  axis  has  no  angular 
motion,  and  therefore  no  angular 
momentum.  It  will  not  cause  the 
door  to  rotate. 
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e/Only  the  component  of 
the  velocity  vector  perpendicular 
to  the  line  connecting  the  object 
to  the  axis  should  be  counted 
into  the  definition  of  angular 
momentum. 


f/A  figure  skater  pulls  in  her 
arms  so  that  she  can  execute  a 
spin  more  rapidly. 


inward  toward  the  hinge  will  have  no  angular  momentum  to  give 
to  the  door.  After  all,  there  would  not  even  be  any  way  to  de¬ 
cide  whether  the  ball’s  rotation  was  clockwise  or  counterclockwise 
in  this  situation.  It  is  therefore  only  the  component  of  the  blob’s 
velocity  vector  perpendicular  to  the  door  that  should  be  counted  in 
its  angular  momentum, 

L  =  7Tiv±r. 

More  generally,  v±  should  be  thought  of  as  the  component  of  the 
object’s  velocity  vector  that  is  perpendicular  to  the  line  joining  the 
object  to  the  axis  of  rotation. 

We  find  that  this  equation  agrees  with  the  definition  of  the  orig¬ 
inal  putty  blob  as  having  one  unit  of  angular  momentum,  and  we 
can  now  see  that  the  units  of  angular  momentum  are  (kg-m/s)-m, 
i.e.,  kg-m2/s.  Summarizing,  we  have 

L  =  mv±r  [angular  momentum  of  a  particle  in  two  dimensions], 

where  m  is  the  particle’s  mass,  is  the  component  of  its  velocity 
vector  perpendicular  to  the  line  joining  it  to  the  axis  of  rotation, 
and  r  is  its  distance  from  the  axis.  (Note  that  r  is  not  necessarily 
the  radius  of  a  circle.)  Positive  and  negative  signs  of  angular  mo¬ 
mentum  are  used  to  describe  opposite  directions  of  rotation.  The 
angular  momentum  of  a  finite-sized  object  or  a  system  of  many  ob¬ 
jects  is  found  by  dividing  it  up  into  many  small  parts,  applying  the 
equation  to  each  part,  and  adding  to  find  the  total  amount  of  an¬ 
gular  momentum.  (As  implied  by  the  word  “particle,”  matter  isn’t 
the  only  thing  that  can  have  angular  momentum.  Light  can  also 
have  angular  momentum,  and  the  above  equation  would  not  apply 
to  light.) 

Conservation  of  angular  momentum  has  been  verified  over  and 
over  again  by  experiment,  and  is  now  believed  to  be  one  of  the  most 
fundamental  principles  of  physics,  along  with  conservation  of  mass, 
energy,  and  momentum. 

A  figure  skater  pulls  her  arms  in.  example  1 

When  a  figure  skater  is  twirling,  there  is  very  little  friction  between 
her  and  the  ice,  so  she  is  essentially  a  closed  system,  and  her 
angular  momentum  is  conserved.  If  she  pulls  her  arms  in,  she  is 
decreasing  r  for  all  the  atoms  in  her  arms.  It  would  violate  con¬ 
servation  of  angular  momentum  if  she  then  continued  rotating  at 
the  same  speed,  i.e.,  taking  the  same  amount  of  time  for  each 
revolution,  because  her  arms’  contributions  to  her  angular  mo¬ 
mentum  would  have  decreased,  and  no  other  part  of  her  would 
have  increased  its  angular  momentum.  This  is  impossible  be¬ 
cause  it  would  violate  conservation  of  angular  momentum.  If  her 
total  angular  momentum  is  to  remain  constant,  the  decrease  in  r 
for  her  arms  must  be  compensated  for  by  an  overall  increase  in 
her  rate  of  rotation.  That  is,  by  pulling  her  arms  in,  she  substan¬ 
tially  reduces  the  time  for  each  rotation. 
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Earth’s  slowing  rotation  and  the  receding  moon  example  2 
The  earth’s  rotation  is  actually  slowing  down  very  gradually,  with 
the  kinetic  energy  being  dissipated  as  heat  by  friction  between  the 
land  and  the  tidal  bulges  raised  in  the  seas  by  the  earth’s  gravity. 
Does  this  mean  that  angular  momentum  is  not  really  perfectly 
conserved?  No,  it  just  means  that  the  earth  is  not  quite  a  closed 
system  by  itself.  If  we  consider  the  earth  and  moon  as  a  system, 
then  the  angular  momentum  lost  by  the  earth  must  be  gained  by 
the  moon  somehow.  In  fact  very  precise  measurements  of  the 
distance  between  the  earth  and  the  moon  have  been  carried  out 
by  bouncing  laser  beams  off  of  a  mirror  left  there  by  astronauts, 
and  these  measurements  show  that  the  moon  is  receding  from 
the  earth  at  a  rate  of  4  centimeters  per  year!  The  moon’s  greater 
value  of  r  means  that  it  has  a  greater  angular  momentum,  and 
the  increase  turns  out  to  be  exactly  the  amount  lost  by  the  earth. 
In  the  days  of  the  dinosaurs,  the  days  were  significantly  shorter, 
and  the  moon  was  closer  and  appeared  bigger  in  the  sky. 

But  what  force  is  causing  the  moon  to  speed  up,  drawing  it  out 
into  a  larger  orbit?  It  is  the  gravitational  forces  of  the  earth’s  tidal 
bulges.  In  figure  g,  the  earth’s  rotation  is  counterclockwise  (ar¬ 
row).  The  moon’s  gravity  creates  a  bulge  on  the  side  near  it,  be¬ 
cause  its  gravitational  pull  is  stronger  there,  and  an  “anti-bulge” 
on  the  far  side,  since  its  gravity  there  is  weaker.  For  simplicity,  let’s 
focus  on  the  tidal  bulge  closer  to  the  moon.  Its  frictional  force  is 
trying  to  slow  down  the  earth’s  rotation,  so  its  force  on  the  earth’s 
solid  crust  is  toward  the  bottom  of  the  figure.  By  Newton’s  third 
law,  the  crust  must  thus  make  a  force  on  the  bulge  which  is  to¬ 
ward  the  top  of  the  figure.  This  causes  the  bulge  to  be  pulled 
forward  at  a  slight  angle,  and  the  bulge’s  gravity  therefore  pulls 
the  moon  forward,  accelerating  its  orbital  motion  about  the  earth 
and  flinging  it  outward. 

The  result  would  obviously  be  extremely  difficult  to  calculate  di¬ 
rectly,  and  this  is  one  of  those  situations  where  a  conservation 
law  allows  us  to  make  precise  quantitative  statements  about  the 
outcome  of  a  process  when  the  calculation  of  the  process  itself 
would  be  prohibitively  complex. 


g/A  view  of  the  earth-moon 
system  from  above  the  north 
pole.  All  distances  have  been 
highly  distorted  for  legibility. 


Restriction  to  rotation  in  a  plane 

Is  angular  momentum  a  vector,  or  a  scalar?  It  does  have  a  di¬ 
rection  in  space,  but  it’s  a  direction  of  rotation,  not  a  straight-line 
direction  like  the  directions  of  vectors  such  as  velocity  or  force.  It 
turns  out  that  there  is  a  way  of  defining  angular  momentum  as  a 
vector,  but  in  this  section  the  examples  will  be  confined  to  a  single 
plane  of  rotation,  i.e.,  effectively  two-dimensional  situations.  In  this 
special  case,  we  can  choose  to  visualize  the  plane  of  rotation  from 
one  side  or  the  other,  and  to  define  clockwise  and  counterclockwise 
rotation  as  having  opposite  signs  of  angular  momentum.  “Effec- 
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h  /  The  area  swept  out  by  a 
planet  in  its  orbit. 


tively”  two-dimensional  means  that  we  can  deal  with  objects  that 
aren’t  flat,  as  long  as  the  velocity  vectors  of  all  their  parts  lie  in  a 
plane. 

Discussion  Questions 

A  Conservation  of  plain  old  momentum,  p,  can  be  thought  of  as  the 
greatly  expanded  and  modified  descendant  of  Galileo’s  original  principle 
of  inertia,  that  no  force  is  required  to  keep  an  object  in  motion.  The  princi¬ 
ple  of  inertia  is  counterintuitive,  and  there  are  many  situations  in  which  it 
appears  superficially  that  a  force  is  needed  to  maintain  motion,  as  main¬ 
tained  by  Aristotle.  Think  of  a  situation  in  which  conservation  of  angular 
momentum,  L,  also  seems  to  be  violated,  making  it  seem  incorrectly  that 
something  external  must  act  on  a  closed  system  to  keep  its  angular  mo¬ 
mentum  from  “running  down.” 

4.1.2  Application  to  planetary  motion 

We  now  discuss  the  application  of  conservation  of  angular  mo¬ 
mentum  to  planetary  motion,  both  because  of  its  intrinsic  impor¬ 
tance  and  because  it  is  a  good  way  to  develop  a  visual  intuition  for 
angular  momentum. 

Kepler’s  law  of  equal  areas  states  that  the  area  swept  out  by 
a  planet  in  a  certain  length  of  time  is  always  the  same.  Angular 
momentum  had  not  been  invented  in  Kepler’s  time,  and  he  did  not 
even  know  the  most  basic  physical  facts  about  the  forces  at  work.  He 
thought  of  this  law  as  an  entirely  empirical  and  unexpectedly  simple 
way  of  summarizing  his  data,  a  rule  that  succeeded  in  describing 
and  predicting  how  the  planets  sped  up  and  slowed  down  in  their 
elliptical  paths.  It  is  now  fairly  simple,  however,  to  show  that  the 
equal  area  law  amounts  to  a  statement  that  the  planet’s  angular 
momentum  stays  constant. 

There  is  no  simple  geometrical  rule  for  the  area  of  a  pie  wedge 
cut  out  of  an  ellipse,  but  if  we  consider  a  very  short  time  interval, 
as  shown  in  figure  h,  the  shaded  shape  swept  out  by  the  planet  is 
very  nearly  a  triangle.  We  do  know  how  to  compute  the  area  of  a 
triangle.  It  is  one  half  the  product  of  the  base  and  the  height: 

area  =  -bh. 

2 

We  wish  to  relate  this  to  angular  momentum,  which  contains  the 
variables  r  and  u_l  .  If  we  consider  the  sun  to  be  the  axis  of  rotation, 
then  the  variable  r  is  identical  to  the  base  of  the  triangle,  r  =  b. 
Referring  to  the  magnified  portion  of  the  figure,  v±  can  be  related 
to  h,  because  the  two  right  triangles  are  similar: 

h  v± 

distance  traveled  |v| 

The  area  can  thus  be  rewritten  as 

1  v±  (distance  traveled) 

area  =  -r - : — ; - . 

2  v 
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The  distance  traveled  equals  |v|At,  so  this  simplifies  to 

area  =  -rv±At. 


We  have  found  the  following  relationship  between  angular  momen¬ 
tum  and  the  rate  at  which  area  is  swept  out: 


L  =  2m 


area 

~At' 


The  factor  of  2  in  front  is  simply  a  matter  of  convention,  since  any 
conserved  quantity  would  be  an  equally  valid  conserved  quantity  if 
you  multiplied  it  by  a  constant.  The  factor  of  m  was  not  relevant 
to  Kepler,  who  did  not  know  the  planets’  masses,  and  who  was  only 
describing  the  motion  of  one  planet  at  a  time. 

We  thus  find  that  Kepler’s  equal-area  law  is  equivalent  to  a  state¬ 
ment  that  the  planet’s  angular  momentum  remains  constant.  But 
wait,  why  should  it  remain  constant?  —  the  planet  is  not  a  closed 
system,  since  it  is  being  acted  on  by  the  sun’s  gravitational  force. 
There  are  two  valid  answers.  The  first  is  that  it  is  actually  the  to¬ 
tal  angular  momentum  of  the  sun  plus  the  planet  that  is  conserved. 
The  sun,  however,  is  millions  of  times  more  massive  than  the  typical 
planet,  so  it  accelerates  very  little  in  response  to  the  planet’s  gravi¬ 
tational  force.  It  is  thus  a  good  approximation  to  say  that  the  sun 
doesn’t  move  at  all,  so  that  no  angular  momentum  is  transferred 
between  it  and  the  planet. 

The  second  answer  is  that  to  change  the  planet’s  angular  mo¬ 
mentum  requires  not  just  a  force  but  a  force  applied  in  a  certain 
way.  Later  in  this  section  (starting  on  page  254)  we  discuss  the 
transfer  of  angular  momentum  by  a  force,  but  the  basic  idea  here  is 
that  a  force  directly  in  toward  the  axis  does  not  change  the  angular 
momentum. 

Discussion  Questions 

A  Suppose  an  object  is  simply  traveling  in  a  straight  line  at  constant 
speed.  If  we  pick  some  point  not  on  the  line  and  call  it  the  axis  of  rotation, 
is  area  swept  out  by  the  object  at  a  constant  rate? 

B  The  figure  is  a  strobe  photo  of  a  pendulum  bob,  taken  from  under¬ 
neath  the  pendulum  looking  straight  up.  The  black  string  can’t  be  seen  in 
the  photograph.  The  bob  was  given  a  slight  sideways  push  when  it  was 
released,  so  it  did  not  swing  in  a  plane.  The  bright  spot  marks  the  center, 
i.e.,  the  position  the  bob  would  have  if  it  hung  straight  down  at  us.  Does 
the  bob’s  angular  momentum  appear  to  remain  constant  if  we  consider 
the  center  to  be  the  axis  of  rotation? 


4.1.3  Two  theorems  about  angular  momentum 

With  plain  old  momentum,  p,  we  had  the  freedom  to  work  in 
any  inertial  frame  of  reference  we  liked.  The  same  object  could 
have  different  values  of  momentum  in  two  different  frames,  if  the 


i  /  Discussion  question  B. 
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j  /  Two  asteroids  collide. 


k  /  Everyone  has  a  strong 
tendency  to  think  of  the  diver  as 
rotating  about  his  own  center  of 
mass.  However,  he  is  flying  in  an 
arc,  and  he  also  has  angular  mo¬ 
mentum  because  of  this  motion. 


I /This  rigid  object  has  angu¬ 
lar  momentum  both  because  it  is 
spinning  about  its  center  of  mass 
and  because  it  is  moving  through 
space. 


frames  were  not  at  rest  with  respect  to  each  other.  Conservation 
of  momentum,  however,  would  be  true  in  either  frame.  As  long  as 
we  employed  a  single  frame  consistently  throughout  a  calculation, 
everything  would  work. 

The  same  is  true  for  angular  momentum,  and  in  addition  there 
is  an  ambiguity  that  arises  from  the  definition  of  an  axis  of  rotation. 
For  a  wheel,  the  natural  choice  of  an  axis  of  rotation  is  obviously  the 
axle,  but  what  about  an  egg  rotating  on  its  side?  The  egg  has  an 
asymmetric  shape,  and  thus  no  clearly  defined  geometric  center.  A 
similar  issue  arises  for  a  cyclone,  which  does  not  even  have  a  sharply 
defined  shape,  or  for  a  complicated  machine  with  many  gears.  The 
following  theorem,  the  first  of  two  presented  in  this  section,  explains 
how  to  deal  with  this  issue.  Although  I  have  put  descriptive  titles 
above  both  theorems,  they  have  no  generally  accepted  names.  The 
proofs,  given  on  page  953,  use  the  vector  cross-product  technique 
introduced  in  section  4.3,  which  greatly  simplifies  them. 

The  choice  of  axis  theorem:  It  is  entirely  arbitrary  what  point 
one  defines  as  the  axis  for  purposes  of  calculating  angular  momen¬ 
tum.  If  a  closed  system’s  angular  momentum  is  conserved  when 
calculated  with  one  choice  of  axis,  then  it  will  be  conserved  for  any 
other  choice  of  axis.  Likewise,  any  inertial  frame  of  reference  may 
be  used.  The  theorem  also  holds  in  the  case  where  the  system  is  not 
closed,  but  the  total  external  force  is  zero. 

Colliding  asteroids  described  with  different  axes  example  3 
Observers  on  planets  A  and  B  both  see  the  two  asteroids  collid¬ 
ing.  The  asteroids  are  of  equal  mass  and  their  impact  speeds  are 
the  same.  Astronomers  on  each  planet  decide  to  define  their  own 
planet  as  the  axis  of  rotation.  Planet  A  is  twice  as  far  from  the  col¬ 
lision  as  planet  B.  The  asteroids  collide  and  stick.  For  simplicity, 
assume  planets  A  and  B  are  both  at  rest. 

With  planet  A  as  the  axis,  the  two  asteroids  have  the  same  amount 
of  angular  momentum,  but  one  has  positive  angular  momentum 
and  the  other  has  negative.  Before  the  collision,  the  total  angular 
momentum  is  therefore  zero.  After  the  collision,  the  two  asteroids 
will  have  stopped  moving,  and  again  the  total  angular  momen¬ 
tum  is  zero.  The  total  angular  momentum  both  before  and  after 
the  collision  is  zero,  so  angular  momentum  is  conserved  if  you 
choose  planet  A  as  the  axis. 

The  only  difference  with  planet  B  as  axis  is  that  r  is  smaller  by  a 
factor  of  two,  so  all  the  angular  momenta  are  halved.  Even  though 
the  angular  momenta  are  different  than  the  ones  calculated  by 
planet  A,  angular  momentum  is  still  conserved. 

The  earth  spins  on  its  own  axis  once  a  day,  but  simultaneously 
travels  in  its  circular  one-year  orbit  around  the  sun,  so  any  given 
part  of  it  traces  out  a  complicated  loopy  path.  It  would  seem  difficult 
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to  calculate  the  earth’s  angular  momentum,  but  it  turns  out  that 
there  is  an  intuitively  appealing  shortcut:  we  can  simply  add  up  the 
angular  momentum  due  to  its  spin  plus  that  arising  from  its  center 
of  mass’s  circular  motion  around  the  sun.  This  is  a  special  case  of 
the  following  general  theorem: 

The  spin  theorem:  An  object’s  angular  momentum  with  respect 
to  some  outside  axis  A  can  be  found  by  adding  up  two  parts: 

(1)  The  first  part  is  the  object’s  angular  momentum  found  by  using 
its  own  center  of  mass  as  the  axis,  i.e.,  the  angular  momentum  the 
object  has  because  it  is  spinning. 

(2)  The  other  part  equals  the  angular  momentum  that  the  object 
would  have  with  respect  to  the  axis  A  if  it  had  all  its  mass  concen¬ 
trated  at  and  moving  with  its  center  of  mass. 

A  system  with  its  center  of  mass  at  rest  example  4 

In  the  special  case  of  an  object  whose  center  of  mass  is  at  rest, 
the  spin  theorem  implies  that  the  object’s  angular  momentum  is 
the  same  regardless  of  what  axis  we  choose.  (This  is  an  even 
stronger  statement  than  the  choice  of  axis  theorem,  which  only 
guarantees  that  angular  momentum  is  conserved  for  any  given 
choice  of  axis,  without  specifying  that  it  is  the  same  for  all  such 
choices.) 

Angular  momentum  of  a  rigid  object  example  5 

>  A  motorcycle  wheel  has  almost  all  its  mass  concentrated  at 
the  outside.  If  the  wheel  has  mass  m  and  radius  r,  and  the  time 
required  for  one  revolution  is  T,  what  is  the  spin  part  of  its  angular 
momentum? 

o  This  is  an  example  of  the  commonly  encountered  special  case 
of  rigid  motion,  as  opposed  to  the  rotation  of  a  system  like  a  hur¬ 
ricane  in  which  the  different  parts  take  different  amounts  of  time 
to  go  around.  We  don’t  really  have  to  go  through  a  laborious 
process  of  adding  up  contributions  from  all  the  many  parts  of  a 
wheel,  because  they  are  all  at  about  the  same  distance  from  the 
axis,  and  are  all  moving  around  the  axis  at  about  the  same  speed. 
The  velocity  is  all  perpendicular  to  the  spokes, 

v±  =  (circumference)/ T 
=  2nr/T 

and  the  angular  momentum  of  the  wheel  about  its  center  is 

L  =  mv±r 
=  m(2nr/T)r 
=  2nmr2  /T. 


Note  that  although  the  factors  of  27r  in  this  expression  is  peculiar 
to  a  wheel  with  its  mass  concentrated  on  the  rim,  the  proportional¬ 
ity  to  m/T  would  have  been  the  same  for  any  other  rigidly  rotating 
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m/The  plane’s  four  engines 
produce  zero  total  torque  but  not 
zero  total  force. 


axis 

X  • 


X  • 

1 

F 

n/The  simple  physical  situa¬ 
tion  we  use  to  derive  an  equation 
for  torque.  A  force  that  points 
directly  in  at  or  out  away  from  the 
axis  produces  neither  clockwise 
nor  counterclockwise  angular 
momentum.  A  force  in  the  per¬ 
pendicular  direction  does  transfer 
angular  momentum. 


object.  Although  an  object  with  a  noncircular  shape  does  not  have 
a  radius,  it  is  also  true  in  general  that  angular  momentum  is  pro¬ 
portional  to  the  square  of  the  object’s  size  for  fixed  values  of  m  and 
T.  For  instance  doubling  an  object’s  size  doubles  both  the  v±  and 
r  factors  in  the  contribution  of  each  of  its  parts  to  the  total  angular 
momentum,  resulting  in  an  overall  factor  of  four  increase. 

4.1.4  Torque 

Force  is  the  rate  of  transfer  of  momentum.  The  corresponding 
quantity  in  the  case  of  angular  momentum  is  called  torque  (rhymes 
with  “fork”).  Where  force  tells  us  how  hard  we  are  pushing  or 
pulling  on  something,  torque  indicates  how  hard  we  are  twisting  on 
it.  Torque  is  represented  by  the  Greek  letter  tau,  r,  and  the  rate 
of  change  of  an  object’s  angular  momentum  equals  the  total  torque 
acting  on  it, 

Ttotal  =  d  L/  d  t. 


As  with  force  and  momentum,  it  often  happens  that  angular 
momentum  recedes  into  the  background  and  we  focus  our  interest  on 
the  torques.  The  torque-focused  point  of  view  is  exemplified  by  the 
fact  that  many  scientifically  untrained  but  mechanically  apt  people 
know  all  about  torque,  but  none  of  them  have  heard  of  angular 
momentum.  Car  enthusiasts  eagerly  compare  engines’  torques,  and 
there  is  a  tool  called  a  torque  wrench  which  allows  one  to  apply  a 
desired  amount  of  torque  to  a  screw  and  avoid  overtightening  it. 

Torque  distinguished  from  force 

Of  course  a  force  is  necessary  in  order  to  create  a  torque  —  you 
can’t  twist  a  screw  without  pushing  on  the  wrench  —  but  force  and 
torque  are  two  different  things.  One  distinction  between  them  is 
direction.  We  use  positive  and  negative  signs  to  represent  forces  in 
the  two  possible  directions  along  a  line.  The  direction  of  a  torque, 
however,  is  clockwise  or  counterclockwise,  not  a  linear  direction. 

The  other  difference  between  torque  and  force  is  a  matter  of 
leverage.  A  given  force  applied  at  a  door’s  knob  will  change  the 
door’s  angular  momentum  twice  as  rapidly  as  the  same  force  applied 
halfway  between  the  knob  and  the  hinge.  The  same  amount  of  force 
produces  different  amounts  of  torque  in  these  two  cases. 

It’s  possible  to  have  a  zero  total  torque  with  a  nonzero  total 
force.  An  airplane  with  four  jet  engines  would  be  designed  so  that 
their  forces  are  balanced  on  the  left  and  right.  Their  forces  are  all  in 
the  same  direction,  but  the  clockwise  torques  of  two  of  the  engines 
are  canceled  by  the  counterclockwise  torques  of  the  other  two,  giving 
zero  total  torque. 

Conversely  we  can  have  zero  total  force  and  nonzero  total  torque. 
A  merry-go-round’s  engine  needs  to  supply  a  nonzero  torque  on  it 
to  bring  it  up  to  speed,  but  there  is  zero  total  force  on  it.  If  there 
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was  not  zero  total  force  on  it,  its  center  of  mass  would  accelerate! 

Relationship  between  force  and  torque 

How  do  we  calculate  the  amount  of  torque  produced  by  a  given 
force?  Since  it  depends  on  leverage,  we  should  expect  it  to  depend 
on  the  distance  between  the  axis  and  the  point  of  application  of 
the  force.  I’ll  work  out  an  equation  relating  torque  to  force  for  a 
particular  very  simple  situation,  and  give  a  more  rigorous  derivation 
on  page  284,  after  developing  some  mathematical  techniques  that 
dramatically  shorten  and  simplify  the  proof. 

Consider  a  pointlike  object  which  is  initially  at  rest  at  a  distance 
r  from  the  axis  we  have  chosen  for  defining  angular  momentum. 
We  first  observe  that  a  force  directly  inward  or  outward,  along  the 
line  connecting  the  axis  to  the  object,  does  not  impart  any  angular 
momentum  to  the  object. 

A  force  perpendicular  to  the  line  connecting  the  axis  and  the 
object  does,  however,  make  the  object  pick  up  angular  momentum. 
Newton’s  second  law  gives 

a  =  F/m, 

and  using  a  =  dv/  dt  we  find  the  velocity  the  object  acquires  after 
a  time  dt, 

du  =  Fdt/m. 

We’re  trying  to  relate  force  to  a  change  in  angular  momentum,  so 
we  multiply  both  sides  of  the  equation  by  rnr  to  give 

to  dvr  =  F  dtr 
dL  =  F  dt  r. 

Dividing  by  dt  gives  the  torque: 

dL 
dt 

T 

If  a  force  acts  at  an  angle  other  than  0  or  90°  with  respect  to  the 
line  joining  the  object  and  the  axis,  it  would  be  only  the  component 
of  the  force  perpendicular  to  the  line  that  would  produce  a  torque, 

r  =  F±r. 

Although  this  result  was  proved  under  a  simplified  set  of  circum¬ 
stances,  it  is  more  generally  valid:2 

Relationship  between  force  and  torque:  The  rate  at  which 
a  force  transfers  angular  momentum  to  an  object,  i.e.,  the  torque 
produced  by  the  force,  is  given  by 

M  =  r\F±\, 

2  A  proof  is  given  in  example  28  on  page  284 


=  Fr 
=  Fr. 


F 


o/The  geometric  relationships 
referred  to  in  the  relationship 
between  force  and  torque. 
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where  r  is  the  distance  from  the  axis  to  the  point  of  application  of 
the  force,  and  Fj_  is  the  component  of  the  force  that  is  perpendicular 
to  the  line  joining  the  axis  to  the  point  of  application. 

The  equation  is  stated  with  absolute  value  signs  because  the 
positive  and  negative  signs  of  force  and  torque  indicate  different 
things,  so  there  is  no  useful  relationship  between  them.  The  sign 
of  the  torque  must  be  found  by  physical  inspection  of  the  case  at 
hand. 

From  the  equation,  we  see  that  the  units  of  torque  can  be  writ¬ 
ten  as  newtons  multiplied  by  meters.  Metric  torque  wrenches  are 
calibrated  in  N-m,  but  American  ones  use  foot-pounds,  which  is  also 
a  unit  of  distance  multiplied  by  a  unit  of  force.  We  know  from  our 
study  of  mechanical  work  that  newtons  multiplied  by  meters  equal 
joules,  but  torque  is  a  completely  different  quantity  from  work,  and 
nobody  writes  torques  with  units  of  joules,  even  though  it  would  be 
technically  correct. 


p  /  Self-check. 


self-check  A 

Compare  the  magnitudes  and  signs  of  the  four  torques  shown  in  figure 
p.  >  Answer,  p.  963 

How  torque  depends  on  the  direction  of  the  force  example  6 

>  How  can  the  torque  applied  to  the  wrench  in  the  figure  be  ex¬ 
pressed  in  terms  of  r,  |Fj,  and  the  angle  0? 

>  The  force  vector  and  its  F±  component  form  the  hypotenuse 
and  one  leg  of  a  right  triangle, 


and  the  interior  angle  opposite  to 
of  F±  can  thus  be  expressed  as 

F±  =  |F| 


Fj_  equals  0.  The  absolute  value 
sin  0, 


leading  to 


t|  =  r|F|  sin  0. 
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Sometimes  torque  can  be  more  neatly  visualized  in  terms  of  the 
quantity  rj_  shown  in  the  figure  on  the  left,  which  gives  us  a  third 
way  of  expressing  the  relationship  between  torque  and  force: 

|T|  =  r_i_|.F|. 

Of  course  you  wouldn’t  want  to  go  and  memorize  all  three  equa¬ 
tions  for  torque.  Starting  from  any  one  of  them  you  could  easily 
derive  the  other  two  using  trigonometry.  Familiarizing  yourself  with 
them  can  however  clue  you  in  to  easier  avenues  of  attack  on  certain 
problems. 

The  torque  due  to  gravity 

Up  until  now  we’ve  been  thinking  in  terms  of  a  force  that  acts 
at  a  single  point  on  an  object,  such  as  the  force  of  your  hand  on  the 
wrench.  This  is  of  course  an  approximation,  and  for  an  extremely 
realistic  calculation  of  your  hand’s  torque  on  the  wrench  you  might 
need  to  add  up  the  torques  exerted  by  each  square  millimeter  where 
your  skin  touches  the  wrench.  This  is  seldom  necessary.  But  in 
the  case  of  a  gravitational  force,  there  is  never  any  single  point  at 
which  the  force  is  applied.  Our  planet  is  exerting  a  separate  tug  on 
every  brick  in  the  Leaning  Tower  of  Pisa,  and  the  total  gravitational 
torque  on  the  tower  is  the  sum  of  the  torques  contributed  by  all  the 
little  forces.  Luckily  there  is  a  trick  that  allows  us  to  avoid  such 
a  massive  calculation.  It  turns  out  that  for  purposes  of  computing 
the  total  gravitational  torque  on  an  object,  you  can  get  the  right 
answer  by  just  pretending  that  the  whole  gravitational  force  acts  at 
the  object’s  center  of  mass. 

Gravitational  torque  on  an  outstretched  arm  example  7 

>  Your  arm  has  a  mass  of  3.0  kg,  and  its  center  of  mass  is  30 
cm  from  your  shoulder.  What  is  the  gravitational  torque  on  your 
arm  when  it  is  stretched  out  horizontally  to  one  side,  taking  the 
shoulder  to  be  the  axis? 

o  The  total  gravitational  force  acting  on  your  arm  is 
|F|  =  (3.0  kg)(9.8  m/s2)  =  29  N. 

For  the  purpose  of  calculating  the  gravitational  torque,  we  can 
treat  the  force  as  if  it  acted  at  the  arm’s  center  of  mass.  The  force 
is  straight  down,  which  is  perpendicular  to  the  line  connecting  the 
shoulder  to  the  center  of  mass,  so 

F±  =  |F|  =  29  N. 

Continuing  to  pretend  that  the  force  acts  at  the  center  of  the  arm, 
r  equals  30  cm  =  0.30  m,  so  the  torque  is 

x  =  r  Fj_  =  9  N  •  m. 


q  /  Visualizing  torque  in  terms  of 
r±. 


r  /  Example  7. 
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t  /  Discussion  question  E. 


Discussion  Questions 

A  This  series  of  discussion  questions  deals  with  past  students’  incorrect 
reasoning  about  the  following  problem. 

Suppose  a  comet  is  at  the  point  in  its  orbit  shown  in  the  figure.  The 
only  force  on  the  comet  is  the  sun’s  gravitational  force.  Throughout  the 
question,  define  all  torques  and  angular  momenta  using  the  sun  as  the 
axis. 

(1)  Is  the  sun  producing  a  nonzero  torque  on  the  comet?  Explain. 

(2)  Is  the  comet’s  angular  momentum  increasing,  decreasing,  or  staying 
the  same?  Explain. 

Explain  what  is  wrong  with  the  following  answers.  In  some  cases,  the 
answer  is  correct,  but  the  reasoning  leading  up  to  it  is  wrong. 

(a)  Incorrect  answer  to  part  (1):  “Yes,  because  the  sun  is  exerting  a  force 
on  the  comet,  and  the  comet  is  a  certain  distance  from  the  sun.” 

(b)  Incorrect  answer  to  part  (1):  “No,  because  the  torques  cancel  out.” 

(c)  Incorrect  answer  to  part  (2):  “Increasing,  because  the  comet  is  speed¬ 
ing  up.” 


comet 


u  /  Discussion  question  A. 


B  You  whirl  a  rock  over  your  head  on  the  end  of  a  string,  and  gradually 
pull  in  the  string,  eventually  cutting  the  radius  in  half.  What  happens  to 
the  rock’s  angular  momentum?  What  changes  occur  in  its  speed,  the  time 
required  for  one  revolution,  and  its  acceleration?  Why  might  the  string 
break? 

C  A  helicopter  has,  in  addition  to  the  huge  fan  blades  on  top,  a  smaller 
propeller  mounted  on  the  tail  that  rotates  in  a  vertical  plane.  Why? 

D  Which  claw  hammer  would  make  it  easier  to  get  the  nail  out  of  the 
wood  if  the  same  force  was  applied  in  the  same  direction? 

E  The  photo  shows  an  amusement  park  ride  whose  two  cars  rotate  in 
opposite  directions.  Why  is  this  a  good  design? 

4.1.5  Applications  to  statics 

In  chapter  2  I  defined  equilibrium  as  a  situation  where  the  inter¬ 
action  energy  is  minimized.  This  is  the  same  as  a  condition  of  zero 
total  force,  or  constant  momentum.  Thus  a  car  is  in  equilibrium  not 
just  when  it  is  parked  but  also  when  it  is  cruising  down  a  straight 
road  with  constant  momentum. 

Likewise  there  are  many  cases  where  a  system  is  not  closed  but 
maintains  constant  angular  momentum.  When  a  merry-go-round 
is  running  at  constant  angular  momentum,  the  engine’s  torque  is 
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being  canceled  by  the  torque  due  to  friction. 

It’s  not  enough  for  a  boat  not  to  sink  —  we’d  also  like  to  avoid 
having  it  capsize.  For  this  reason,  we  now  redefine  equilibrium  as 
follows. 

When  an  object  has  constant  momentum  and  constant  angular 
momentum,  we  say  that  it  is  in  equilibrium.  Again,  this  is  a  scientific 
redefinition  of  the  common  English  word,  since  in  ordinary  speech 
nobody  would  describe  a  car  spinning  out  on  an  icy  road  as  being 
in  equilibrium. 

Very  commonly,  however,  we  are  interested  in  cases  where  an 
object  is  not  only  in  equilibrium  but  also  at  rest,  and  this  corre¬ 
sponds  more  closely  to  the  usual  meaning  of  the  word.  Statics  is  the 
branch  of  physics  concerned  with  problems  such  as  these. 

Solving  statics  problems  is  now  simply  a  matter  of  applying  and 
combining  some  things  you  already  know: 

-  You  know  the  behaviors  of  the  various  types  of  forces,  for  exam¬ 
ple  that  a  frictional  force  is  always  parallel  to  the  surface  of  contact. 

-  You  know  about  vector  addition  of  forces.  It  is  the  vector  sum 
of  the  forces  that  must  equal  zero  to  produce  equilibrium. 

-  You  know  about  torque.  The  total  torque  acting  on  an  object 
must  be  zero  if  it  is  to  be  in  equilibrium. 

-  You  know  that  the  choice  of  axis  is  arbitrary,  so  you  can  make 
a  choice  of  axis  that  makes  the  problem  easy  to  solve. 

In  general,  this  type  of  problem  could  involve  four  equations  in 
four  unknowns:  three  equations  that  say  the  force  components  add 
up  to  zero,  and  one  equation  that  says  the  total  torque  is  zero.  Most 
cases  you’ll  encounter  will  not  be  this  complicated.  In  the  example 
below,  only  the  equation  for  zero  total  torque  is  required  in  order 
to  get  an  answer. 

A  flagpole  example  8 

>  A  10-kg  flagpole  is  being  held  up  by  a  lightweight  horizontal 
cable,  and  is  propped  against  the  foot  of  a  wall  as  shown  in  the 
figure.  If  the  cable  is  only  capable  of  supporting  a  tension  of  70 
N,  how  great  can  the  angle  a  be  without  breaking  the  cable? 

o  All  three  objects  in  the  figure  are  supposed  to  be  in  equilibrium: 
the  pole,  the  cable,  and  the  wall.  Whichever  of  the  three  objects 
we  pick  to  investigate,  all  the  forces  and  torques  on  it  have  to 
cancel  out.  It  is  not  particularly  helpful  to  analyze  the  forces  and 
torques  on  the  wall,  since  it  has  forces  on  it  from  the  ground  that 
are  not  given  and  that  we  don’t  want  to  find.  We  could  study  the 
forces  and  torques  on  the  cable,  but  that  doesn’t  let  us  use  the 
given  information  about  the  pole.  The  object  we  need  to  analyze 
is  the  pole. 


v/The  windmills  are  not  closed 
systems,  but  angular  momentum 
is  being  transferred  out  of  them 
at  the  same  rate  it  is  transferred 
in,  resulting  in  constant  angular 
momentum.  To  get  an  idea  of 
the  huge  scale  of  the  modern 
windmill  farm,  note  the  sizes  of 
the  trucks  and  trailers. 


w  /  Example  8. 
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The  pole  has  three  forces  on  it,  each  of  which  may  also  result  in 
a  torque:  (1)  the  gravitational  force,  (2)  the  cable’s  force,  and  (3) 
the  wall’s  force. 

We  are  free  to  define  an  axis  of  rotation  at  any  point  we  wish,  and 
it  is  helpful  to  define  it  to  lie  at  the  bottom  end  of  the  pole,  since 
by  that  definition  the  wall’s  force  on  the  pole  is  applied  at  r  =  0 
and  thus  makes  no  torque  on  the  pole.  This  is  good,  because  we 
don’t  know  what  the  wall’s  force  on  the  pole  is,  and  we  are  not 
trying  to  find  it. 

With  this  choice  of  axis,  there  are  two  nonzero  torques  on  the 
pole,  a  counterclockwise  torque  from  the  cable  and  a  clockwise 
torque  from  gravity.  Choosing  to  represent  counterclockwise  torques 
as  positive  numbers,  and  using  the  equation  |t|  =  rjFjsine,  we 
have 


l~cable \F cable  sin  &cable  ~  f grav\F grav\  Sin  Qgrav  —  0. 

A  little  geometry  gives  Q cable  =  90°  -  a  and  Qgrav  =  a,  so 

rcable\F cable  Sin(90  —  a)  —  fgrav\F grav\  Sin  a  =  0. 

The  gravitational  force  can  be  considered  as  acting  at  the  pole’s 
center  of  mass,  i.e.,  at  its  geometrical  center,  so  rcabte  is  twice 
rgrav,  and  we  can  simplify  the  equation  to  read 

2|FcaWe|  sin(90°  -  a)  -  \Fgrav\  sin  a  =  0. 

These  are  all  quantities  we  were  given,  except  for  a,  which  is  the 
angle  we  want  to  find.  To  solve  for  a  we  need  to  use  the  trig 
identity  sin(90°  -  x)  =  cosx, 


9 1 F cable  COS  Ot  —  \  F grav  \  Sin  (X  —  0, 


which  allows  us  to  find 


tari  a  =  2^4 

\'  grav\ 


a  =  tan  1 

(  2 

\Fcable\ 

\ 

1 F grav  \  , 

=  tan-1 

(2 

70  N 
X  98  N 

=  55°. 

260 


Chapter  4  Conservation  of  Angular  Momentum 


b 


Art!  example  9 

>  The  abstract  sculpture  shown  in  figure  x  contains  a  cube  of 
mass  m  and  sides  of  length  b.  The  cube  rests  on  top  of  a  cylinder, 
which  is  off-center  by  a  distance  a.  Find  the  tension  in  the  cable. 

>  There  are  four  forces  on  the  cube:  a  gravitational  force  mg,  the 
force  Ft  from  the  cable,  the  upward  normal  force  from  the  cylin¬ 
der,  Fn,  and  the  horizontal  static  frictional  force  from  the  cylinder, 
Fs- 

The  total  force  on  the  cube  in  the  vertical  direction  is  zero: 

Fn  —  mg  =  0. 


◄ - ► 


As  our  axis  for  defining  torques,  it’s  convenient  to  choose  the  point 
of  contact  between  the  cube  and  the  cylinder,  because  then  nei¬ 
ther  Fs  nor  Fn  makes  any  torque.  The  cable’s  torque  is  coun¬ 
terclockwise,  and  the  torque  due  to  gravity  is  clockwise,  and  the 
cylinder’s  torque  is  clockwise.  Letting  counterclockwise  torques 
be  positive,  and  using  the  convenient  equation  r  =  r±F,  we  find 
the  equation  for  the  total  torque: 

bFj  —  FNa  =  0. 

We  could  also  write  down  the  equation  saying  that  the  total  hori¬ 
zontal  force  is  zero,  but  that  would  bring  in  the  cylinder’s  frictional 
force  on  the  cube,  which  we  don’t  know  and  don’t  need  to  find.  We 
already  have  two  equations  in  the  two  unknowns  FT  and  FN,  so 
there’s  no  need  to  make  it  into  three  equations  in  three  unknowns. 
Solving  the  first  equation  for  FN  =  mg,  we  then  substitute  into  the 
second  equation  to  eliminate  FN,  and  solve  for  FT  =  (a/b)mg. 

Why  is  one  equilibrium  stable  and  another  unstable?  Try  push¬ 
ing  your  own  nose  to  the  left  or  the  right.  If  you  push  it  a  millimeter 
to  the  left,  it  responds  with  a  gentle  force  to  the  right.  If  you  push 
it  a  centimeter  to  the  left,  its  force  on  your  finger  becomes  much 
stronger.  The  defining  characteristic  of  a  stable  equilibrium  is  that 
the  farther  the  object  is  moved  away  from  equilibrium,  the  stronger 
the  force  is  that  tries  to  bring  it  back. 

The  opposite  is  true  for  an  unstable  equilibrium.  In  the  top 
figure,  the  ball  resting  on  the  round  hill  theoretically  has  zero  total 
force  on  it  when  it  is  exactly  at  the  top.  But  in  reality  the  total 
force  will  not  be  exactly  zero,  and  the  ball  will  begin  to  move  off  to 
one  side.  Once  it  has  moved,  the  net  force  on  the  ball  is  greater  than 
it  was,  and  it  accelerates  more  rapidly.  In  an  unstable  equilibrium, 
the  farther  the  object  gets  from  equilibrium,  the  stronger  the  force 
that  pushes  it  farther  from  equilibrium. 

This  idea  can  be  rephrased  in  terms  of  energy.  The  difference 
between  the  stable  and  unstable  equilibria  shown  in  figure  y  is  that 
in  the  stable  equilibrium,  the  energy  is  at  a  minimum,  and  moving 


x  /  Example  9. 


unstable 


y  /  Stable  and  unstable  equi¬ 
libria. 


z/The  dancer’s  equilibrium 
is  unstable.  If  she  didn’t  con¬ 
stantly  make  tiny  adjustments, 
she  would  tip  over. 
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aa  /  Example  10. 


to  either  side  of  equilibrium  will  increase  it,  whereas  the  unstable 
equilibrium  represents  a  maximum. 

Note  that  we  are  using  the  term  “stable”  in  a  weaker  sense  than 
in  ordinary  speech.  A  domino  standing  upright  is  stable  in  the  sense 
we  are  using,  since  it  will  not  spontaneously  fall  over  in  response  to 
a  sneeze  from  across  the  room  or  the  vibration  from  a  passing  truck. 
We  would  only  call  it  unstable  in  the  technical  sense  if  it  could  be 
toppled  by  any  force,  no  matter  how  small.  In  everyday  usage,  of 
course,  it  would  be  considered  unstable,  since  the  force  required  to 
topple  it  is  so  small. 

An  application  of  calculus  example  1 0 

t>  Nancy  Neutron  is  living  in  a  uranium  nucleus  that  is  undergoing 
fission.  Nancy’s  nuclear  energy  as  a  function  of  position  can  be 
approximated  by  U  =  x4  -  x2,  where  all  the  units  and  numeri¬ 
cal  constants  have  been  suppressed  for  simplicity.  Use  calculus 
to  locate  the  equilibrium  points,  and  determine  whether  they  are 
stable  or  unstable. 

>  The  equilibrium  points  occur  where  the  U  is  at  a  minimum  or 
maximum,  and  minima  and  maxima  occur  where  the  derivative 
(which  equals  minus  the  force  on  Nancy)  is  zero.  This  derivative 
is  dU/dx  =  4x3  -  2x,  and  setting  it  equal  to  zero,  we  have  x  = 
0,  ±1  )V2.  Minima  occur  where  the  second  derivative  is  positive, 
and  maxima  where  it  is  negative.  The  second  derivative  is  1 2x2  - 
2,  which  is  negative  at  x  =  0  (unstable)  and  positive  at  x  =  ±1  /V2 
(stable).  Interpretation:  the  graph  of  U  is  shaped  like  a  rounded 
letter  ‘W,’  with  the  two  troughs  representing  the  two  halves  of  the 
splitting  nucleus.  Nancy  is  going  to  have  to  decide  which  half  she 
wants  to  go  with. 


4.1.6  Proof  of  Kepler’s  elliptical  orbit  law 

Kepler  determined  purely  empirically  that  the  planets’  orbits 
were  ellipses,  without  understanding  the  underlying  reason  in  terms 
of  physical  law.  Newton’s  proof  of  this  fact  based  on  his  laws  of 
motion  and  law  of  gravity  was  considered  his  crowning  achievement 
both  by  him  and  by  his  contemporaries,  because  it  showed  that  the 
same  physical  laws  could  be  used  to  analyze  both  the  heavens  and 
the  earth.  Newton’s  proof  was  very  lengthy,  but  by  applying  the 
more  recent  concepts  of  conservation  of  energy  and  angular  momen¬ 
tum  we  can  carry  out  the  proof  quite  simply  and  succinctly.  This 
subsection  can  be  skipped  without  losing  the  continuity  of  the  text. 

The  basic  idea  of  the  proof  is  that  we  want  to  describe  the  shape 
of  the  planet’s  orbit  with  an  equation,  and  then  show  that  this  equa¬ 
tion  is  exactly  the  one  that  represents  an  ellipse.  Newton’s  original 
proof  had  to  be  very  complicated  because  it  was  based  directly  on 
his  laws  of  motion,  which  include  time  as  a  variable.  To  make  any 
statement  about  the  shape  of  the  orbit,  he  had  to  eliminate  time 
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from  his  equations,  leaving  only  space  variables.  But  conservation 
laws  tell  us  that  certain  things  don’t  change  over  time,  so  they  have 
already  had  time  eliminated  from  them. 

There  are  many  ways  of  representing  a  curve  by  an  equation,  of 
which  the  most  familiar  is  y  =  ax  +  b  for  a  line  in  two  dimensions.  It 
would  be  perfectly  possible  to  describe  a  planet’s  orbit  using  an  x-y 
equation  like  this,  but  remember  that  we  are  applying  conservation 
of  angular  momentum,  and  the  space  variables  that  occur  in  the 
equation  for  angular  momentum  are  the  distance  from  the  axis,  r, 
and  the  angle  between  the  velocity  vector  and  the  r  vector,  which 
we  will  call  p.  The  planet  will  have  p  =  90°  when  it  is  moving 
perpendicular  to  the  r  vector,  i.e.,  at  the  moments  when  it  is  at  its 
smallest  or  greatest  distances  from  the  sun.  When  <p  is  less  than  90° 
the  planet  is  approaching  the  sun,  and  when  it  is  greater  than  90°  it 
is  receding  from  it.  Describing  a  curve  with  an  r-p  equation  is  like 
telling  a  driver  in  a  parking  lot  a  certain  rule  for  what  direction  to 
steer  based  on  the  distance  from  a  certain  streetlight  in  the  middle 
of  the  lot. 

The  proof  is  broken  into  the  three  parts  for  easier  digestion. 
The  first  part  is  a  simple  and  intuitively  reasonable  geometrical  fact 
about  ellipses,  whose  proof  we  relegate  to  the  caption  of  figure  ac; 
you  will  not  be  missing  much  if  you  merely  absorb  the  result  without 
reading  the  proof. 

(1)  If  we  use  one  of  the  two  foci  of  an  ellipse  as  an  axis  for 
defining  the  variables  r  and  p,  then  the  angle  between  the  tangent 
line  and  the  line  drawn  to  the  other  focus  is  the  same  as  p ,  i.e.,  the 
two  angles  labeled  p  in  the  figure  are  in  fact  equal. 

The  other  two  parts  form  the  meat  of  our  proof.  We  state  the 
results  first  and  then  prove  them. 

(2)  A  planet,  moving  under  the  influence  of  the  sun’s  gravity 
with  less  than  the  energy  required  to  escape,  obeys  an  equation  of 
the  form 

1 

sm(^=  /  > 

V  —  prz  +  qr 

where  p  and  q  are  positive  constants  that  depend  on  the  planet’s 
energy  and  angular  momentum  and  p  is  greater  than  zero. 

(3)  A  curve  is  an  ellipse  if  and  only  if  its  r-p  equation  is  of  the 
form 

1 

sin  p  =  : 

y—pr 2  +  qr 

where  p  and  q  are  positive  constants  that  depend  on  the  size  and 
shape  of  the  ellipse. 

Proof  of  part  (2) 

The  component  of  the  planet’s  velocity  vector  that  is  perpen¬ 
dicular  to  the  r  vector  is  v±  =  usiny?,  so  conservation  of  angular 


ab  /  Describing  a  curve  by 
giving  cp  as  a  function  of  r. 


ac  /  Proof  that  the  two  an¬ 
gles  labeled  cp  are  in  fact  equal: 
The  definition  of  an  ellipse  is  that 
the  sum  of  the  distances  from 
the  two  foci  stays  constant.  If  we 
move  a  small  distance  £  along  the 
ellipse,  then  one  distance  shrinks 
by  an  amount  f  cos  <pi,  while  the 
other  grows  by  £  cos  cp2.  These 
are  equal,  so  (pi  =  cp2. 
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ad  /  Quantities  referred  to 
the  proof  of  part  (3). 


momentum  tells  us  that  L  =  rrirv  sin  p  is  a  constant.  Since  the 
planet’s  mass  is  a  constant,  this  is  the  same  as  the  condition 

rvsiiup  =  constant. 


Conservation  of  energy  gives 


1  2 

-mv 


Mm 

Gr - 

r 


=  constant. 


We  solve  the  first  equation  for  v  and  plug  into  the  second  equa¬ 
tion  to  eliminate  v .  Straightforward  algebra  then  leads  to  the  equa¬ 
tion  claimed  above,  with  the  constant  p  being  positive  because  of 
our  assumption  that  the  planet’s  energy  is  insufficient  to  escape  from 
the  sun,  i.e.,  its  total  energy  is  negative. 

Proof  of  part  (3) 

We  define  the  quantities  a,  d,  and  s  as  shown  in  figure  ad.  The 
law  of  cosines  gives 


d2  =  r2  +  s2  —  2 rs  cos  a. 

Using  a  =  180°  — 2 p  and  the  trigonometric  identities  cos(180°  —  x)  = 
—  cos  x  and  cos  2x  =  1  —  2  sin2  x,  we  can  rewrite  this  as 

d2  =  r2  +  s2  -  2 rs  (2  sin2  ip  -  l)  . 

Straightforward  algebra  transforms  this  into 


sin  p  = 


(r  +  s)2 
4rs 


d2 


Since  r  +  s  is  constant,  the  top  of  the  fraction  is  constant,  and  the 
denominator  can  be  rewritten  as  4 rs  =  4r  (constant  —  r),  which  is 
equivalent  to  the  desired  form. 
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4.2  Rigid-Body  Rotation 

4.2.1  Kinematics 

When  a  rigid  object  rotates,  every  part  of  it  (every  atom)  moves 
in  a  circle,  covering  the  same  angle  in  the  same  amount  of  time, 
a.  Every  atom  has  a  different  velocity  vector,  b.  Since  all  the 
velocities  are  different,  we  can’t  measure  the  speed  of  rotation  of 
the  top  by  giving  a  single  velocity.  We  can,  however,  specify  its 
speed  of  rotation  consistently  in  terms  of  angle  per  unit  time.  Let 
the  position  of  some  reference  point  on  the  top  be  denoted  by  its 
angle  8,  measured  in  a  circle  around  the  axis.  For  reasons  that  will 
become  more  apparent  shortly,  we  measure  all  our  angles  in  radians. 
Then  the  change  in  the  angular  position  of  any  point  on  the  top  can 
be  written  as  d 0,  and  all  parts  of  the  top  have  the  same  value  of  d 8 
over  a  certain  time  interval  dt.  We  define  the  angular  velocity,  oj 
(Greek  omega), 

dfl 

#  “  dt, 

[definition  of  angular  velocity;  8  in  units  of  radians] 

which  is  similar  to,  but  not  the  same  as,  the  quantity  oj  we  defined 
earlier  to  describe  vibrations.  The  relationship  between  oj  and  t 
is  exactly  analogous  to  that  between  x  and  t  for  the  motion  of  a 
particle  through  space. 

self-check  B 

If  two  different  people  chose  two  different  reference  points  on  the  top 
in  order  to  define  0=0,  how  would  their  Q-t  graphs  differ?  What  effect 
would  this  have  on  the  angular  velocities?  >  Answer,  p.  963 

The  angular  velocity  has  units  of  radians  per  second,  rad/s. 
However,  radians  are  not  really  units  at  all.  The  radian  measure 
of  an  angle  is  defined,  as  the  length  of  the  circular  arc  it  makes, 
divided  by  the  radius  of  the  circle.  Dividing  one  length  by  another 
gives  a  unitless  quantity,  so  anything  with  units  of  radians  is  really 
unitless.  We  can  therefore  simplify  the  units  of  angular  velocity,  and 
call  them  inverse  seconds,  s-1. 

A  78-rpm  record  example  1 1 

o  In  the  early  20th  century,  the  standard  format  for  music  record¬ 
ings  was  a  plastic  disk  that  held  a  single  song  and  rotated  at  78 
rpm  (revolutions  per  minute).  What  was  the  angular  velocity  of 
such  a  disk? 

>  If  we  measure  angles  in  units  of  revolutions  and  time  in  units 
of  minutes,  then  78  rpm  is  the  angular  velocity.  Using  standard 
physics  units  of  radians/second,  however,  we  have 

78  revolutions  27r  radians  1  minute  „  „  ■, 

_ x _ x _ _  8  2  s~ 

1  minute  1  revolution  60  seconds 


a  /  The  two  atoms  cover  the 
same  angle  in  a  given  time 
interval. 


b  /  Their  velocity  vectors,  how¬ 
ever,  differ  in  both  magnitude  and 
direction. 
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x  « — ►  e 

v  < — >  CO 


a  < — >  a 

c  /  Analogies  between  rota¬ 
tional  and  linear  quantities. 


d  /  We  construct  a  coordi¬ 
nate  system  that  coincides  with 
the  location  and  motion  of  the 
moving  point  of  interest  at  a 
certain  moment. 


In  the  absence  of  any  torque,  a  rigid  body  will  rotate  indefinitely 
with  the  same  angular  velocity.  If  the  angular  velocity  is  changing 
because  of  a  torque,  we  define  an  angular  acceleration, 


a  = 


da; 
d  t  ’ 


[definition  of  angular  acceleration] 


The  symbol  is  the  Greek  letter  alpha.  The  units  of  this  quantity  are 
rad/s2,  or  simply  s-2. 

The  mathematical  relationship  between  oj  and  6  is  the  same  as 
the  one  between  v  and  x,  and  similarly  for  a  and  a.  We  can  thus 
make  a  system  of  analogies,  c,  and  recycle  all  the  familiar  kinematic 
equations  for  constant-acceleration  motion. 

The  synodic  period  example  12 

Mars  takes  nearly  twice  as  long  as  the  Earth  to  complete  an  orbit. 

If  the  two  planets  are  alongside  one  another  on  a  certain  day, 
then  one  year  later,  Earth  will  be  back  at  the  same  place,  but 
Mars  will  have  moved  on,  and  it  will  take  more  time  for  Earth  to 
finish  catching  up.  Angular  velocities  add  and  subtract,  just  as 
velocity  vectors  do.  If  the  two  planets’  angular  velocities  are  uq 
and  (x>2 ,  then  the  angular  velocity  of  one  relative  to  the  other  is 
uq  -  cu2.  The  corresponding  period,  1  /(1/7i  -  1/T2)  is  known  as 
the  synodic  period. 

A  neutron  star  example  13 

>  A  neutron  star  is  initially  observed  to  be  rotating  with  an  angular 
velocity  of  2.0  s_1,  determined  via  the  radio  pulses  it  emits.  If  its 
angular  acceleration  is  a  constant  -1.0  x  1CT8  s~2,  how  many 
rotations  will  it  complete  before  it  stops?  (In  reality,  the  angular 
acceleration  is  not  always  constant;  sudden  changes  often  occur, 
and  are  referred  to  as  “starquakes!”) 

>  The  equation  vf-  vf=2aAx  can  be  translated  into  tu^-tu2=2ctA0, 
giving 


A0  =  (cu2  —  tu2)/2a 

=  2.0  x  108  radians 
=  3.2  x  107  rotations. 


4.2.2  Relations  between  angular  quantities  and  motion  of  a 
point 

It  is  often  necessary  to  be  able  to  relate  the  angular  quantities 
to  the  motion  of  a  particular  point  on  the  rotating  object.  As  we 
develop  these,  we  will  encounter  the  first  example  where  the  advan¬ 
tages  of  radians  over  degrees  become  apparent. 

The  speed  at  which  a  point  on  the  object  moves  depends  on  both 
the  object’s  angular  velocity  u ;  and  the  point’s  distance  r  from  the 
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axis.  We  adopt  a  coordinate  system,  d,  with  an  inward  (radial)  axis 
and  a  tangential  axis.  The  length  of  the  infinitesimal  circular  arc 
ds  traveled  by  the  point  in  a  time  interval  d t  is  related  to  d6  by  the 
definition  of  radian  measure,  d 6  =  ds/r,  where  positive  and  negative 
values  of  ds  represent  the  two  possible  directions  of  motion  along 
the  tangential  axis.  We  then  have  vt  =  d s/  dt  =  rdO/  df  =  wr,  or 

vt  =  wr.  [tangential  velocity  of  a  point  at  a 

distance  r  from  the  axis  of  rotation] 

The  radial  component  is  zero,  since  the  point  is  not  moving  inward 
or  outward, 

vr  =  0.  [radial  velocity  of  a  point  at  a 

distance  r  from  the  axis  of  rotation] 


Note  that  we  had  to  use  the  definition  of  radian  measure  in  this 
derivation.  Suppose  instead  we  had  used  units  of  degrees  for  our  an¬ 
gles  and  degrees  per  second  for  angular  velocities.  The  relationship 
between  d Odegrees  and  ds  is  d ddegrees  =  (360/27r)s/r,  where  the  extra 
conversion  factor  of  (360/27r)  comes  from  that  fact  that  there  are  360 
degrees  in  a  full  circle,  which  is  equivalent  to  2i r  radians.  The  equa¬ 
tion  for  vt  would  then  have  been  vt  =  (2it  / 360)  (u degrees  per  second)  (r), 
which  would  have  been  much  messier.  Simplicity,  then,  is  the  rea¬ 
son  for  using  radians  rather  than  degrees;  by  using  radians  we  avoid 
infecting  all  our  equations  with  annoying  conversion  factors. 

Since  the  velocity  of  a  point  on  the  object  is  directly  proportional 
to  the  angular  velocity,  you  might  expect  that  its  acceleration  would 
be  directly  proportional  to  the  angular  acceleration.  This  is  not  true, 
however.  Even  if  the  angular  acceleration  is  zero,  i.e.,  if  the  object 
is  rotating  at  constant  angular  velocity,  every  point  on  it  will  have 
an  acceleration  vector  directed  toward  the  axis,  e.  As  derived  on 
page  209,  the  magnitude  of  this  acceleration  is 

ar  =  oj2r.  [radial  acceleration  of  a  point 

at  a  distance  r  from  the  axis] 


e  /  Even  if  the  rotating  object 
has  zero  angular  acceleration, 
every  point  on  it  has  an  ac¬ 
celeration  towards  the  center. 


For  the  tangential  component,  any  change  in  the  angular  velocity 
dw  will  lead  to  a  change  d u-r  in  the  tangential  velocity,  so  it  is  easily 
shown  that 

at  =  ar.  [tangential  acceleration  of  a  point 

at  a  distance  r  from  the  axis] 


self-check  C 

Positive  and  negative  signs  of  o>  represent  rotation  in  opposite  direc¬ 
tions.  Why  does  it  therefore  make  sense  physically  that  cu  is  raised  to 
the  first  power  in  the  equation  for  vt  and  to  the  second  power  in  the  one 
for  ar?  >  Answer,  p.  963 
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Radial  acceleration  at  the  surface  of  the  Earth  example  14 
t>  What  is  your  radial  acceleration  due  to  the  rotation  of  the  earth 
if  you  are  at  the  equator? 

>  At  the  equator,  your  distance  from  the  Earth’s  rotation  axis  is 
the  same  as  the  radius  of  the  spherical  Earth,  6.4  x  106  m.  Your 
angular  velocity  is 


2n  radians 


=  7.3  x  1(T5  s— 1 , 
which  gives  an  acceleration  of 

ar  =  a )2r 

=  0.034  m/s2. 

The  angular  velocity  was  a  very  small  number,  but  the  radius  was 
a  very  big  number.  Squaring  a  very  small  number,  however,  gives 
a  very  very  small  number,  so  the  w2  factor  “wins,”  and  the  final 
result  is  small. 

If  you’re  standing  on  a  bathroom  scale,  this  small  acceleration  is 
provided  by  the  imbalance  between  the  downward  force  of  gravity 
and  the  slightly  weaker  upward  normal  force  of  the  scale  on  your 
foot.  The  scale  reading  is  therefore  a  little  lower  than  it  should 
be. 

4.2.3  Dynamics 

If  we  want  to  connect  all  this  kinematics  to  anything  dynamical, 
we  need  to  see  how  it  relates  to  torque  and  angular  momentum. 
Our  strategy  will  be  to  tackle  angular  momentum  first,  since  angu¬ 
lar  momentum  relates  to  motion,  and  to  use  the  additive  property 
of  angular  momentum:  the  angular  momentum  of  a  system  of  par¬ 
ticles  equals  the  sum  of  the  angular  momenta  of  all  the  individual 
particles.  The  angular  momentum  of  one  particle  within  our  rigidly 
rotating  object,  L  =  mv±r,  can  be  rewritten  as  L  =  r  p  sin  9, 
where  r  and  p  are  the  magnitudes  of  the  particle’s  r  and  momen¬ 
tum  vectors,  and  0  is  the  angle  between  these  two  vectors.  (The  r 
vector  points  outward  perpendicularly  from  the  axis  to  the  parti¬ 
cle’s  position  in  space.)  In  rigid-body  rotation  the  angle  9  is  90°, 
so  we  have  simply  L  =  rp.  Relating  this  to  angular  velocity,  we 
have  L  =  rp  =  (: r){mv )  =  (■ r)(mur )  =  mr2 u.  The  particle’s  con¬ 
tribution  to  the  total  angular  momentum  is  proportional  to  w,  with 
a  proportionality  constant  mr2.  We  refer  to  mr2  as  the  particle’s 
contribution  to  the  object’s  total  moment  of  inertia ,  I,  where  “mo¬ 
ment”  is  used  in  the  sense  of  “important,”  as  in  “momentous”  —  a 
bigger  value  of  I  tells  us  the  particle  is  more  important  for  deter¬ 
mining  the  total  angular  momentum.  The  total  moment  of  inertia 
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I  =  [definition  of  the  moment  of  inertia; 

for  rigid-body  rotation  in  a  plane;  r  is  the  distance 
from  the  axis,  measured  perpendicular  to  the  axis] 

The  angular  momentum  of  a  rigidly  rotating  body  is  then 

L  =  Iuj.  [angular  momentum  of 

rigid-body  rotation  in  a  plane] 

Since  torque  is  defined  as  d Lj  d t,  and  a  rigid  body  has  a  constant 
moment  of  inertia,  we  have  r  =  d Lj  df  =  I  du/  dt  =  la, 

t  =  la,  [relationship  between  torque  and 

angular  acceleration  for  rigid-body  rotation  in  a  plane] 
which  is  analogous  to  F  =  ma. 

The  complete  system  of  analogies  between  linear  motion  and 
rigid-body  rotation  is  given  in  figure  f. 

A  barbell  example  15 

>  The  barbell  shown  in  figure  g  consists  of  two  small,  dense,  mas¬ 
sive  balls  at  the  ends  of  a  very  light  rod.  The  balls  have  masses 
of  2.0  kg  and  1.0  kg,  and  the  length  of  the  rod  is  3.0  m.  Find 
the  moment  of  inertia  of  the  rod  (1)  for  rotation  about  its  center  of 
mass,  and  (2)  for  rotation  about  the  center  of  the  more  massive 
ball. 

>  (1)  The  ball’s  center  of  mass  lies  1/3  of  the  way  from  the  greater 
mass  to  the  lesser  mass,  i.e.,  1 .0  m  from  one  and  2.0  m  from  the 
other.  Since  the  balls  are  small,  we  approximate  them  as  if  they 
were  two  pointlike  particles.  The  moment  of  inertia  is 

/  =  (2.0  kg)(1 .0  m)2  +  (1 .0  kg)(2.0  m)2 

=  2.0  kg  m2  +  4.0  kg  m2 

=  6.0  kg  m2 

Perhaps  counterintuitively,  the  less  massive  ball  contributes  far 
more  to  the  moment  of  inertia. 

(2)  The  big  ball  theoretically  contributes  a  little  bit  to  the  moment 
of  inertia,  since  essentially  none  of  its  atoms  are  exactly  at  r= 0. 
However,  since  the  balls  are  said  to  be  small  and  dense,  we  as¬ 
sume  all  the  big  ball’s  atoms  are  so  close  to  the  axis  that  we  can 
ignore  their  small  contributions  to  the  total  moment  of  inertia: 

/  =  (1.0  kg)(3.0  m)2 

=  9.0  kg  m2 

This  example  shows  that  the  moment  of  inertia  depends  on  the 
choice  of  axis.  For  example,  it  is  easier  to  wiggle  a  pen  about  its 
center  than  about  one  end. 


x  « — >  e 

v  < — ►  CO 


a  « — ►  a 


m  « — ►  | 


p  « — >  L 
F  < — ►  T 

f  /  Analogies  between  rota¬ 
tional  and  linear  quantities. 


€ 

axis  2  axis  1 

g  /  Example  15 


Section  4.2  Rigid-Body  Rotation 


269 


The  parallel  axis  theorem  example  16 

>  Generalizing  the  previous  example,  suppose  we  pick  any  axis 
parallel  to  axis  1,  but  offset  from  it  by  a  distance  h.  Part  (2)  of 
the  previous  example  then  corresponds  to  the  special  case  of 
h  =  - 1.0  m  (negative  being  to  the  left).  What  is  the  moment  of 
inertia  about  this  new  axis? 

>  The  big  ball’s  distance  from  the  new  axis  is  (1.0  m)+h,  and  the 
small  one’s  is  (2.0  m)-h.  The  new  moment  of  inertia  is 

I  =  (2.0  kg)[(1 .0  m )+h]2  +  (1 .0  kg)[(2.0  m)  -  h]2 
=  6.0  kg  m2  +  (4.0  kg  m)/?  -  (4.0  kg-m)/?+  (3.0  kg )h2. 

The  constant  term  is  the  same  as  the  moment  of  inertia  about  the 
center-of-mass  axis,  the  first-order  terms  cancel  out,  and  the  third 
term  is  just  the  total  mass  multiplied  by  h2.  The  interested  reader 
will  have  no  difficulty  in  generalizing  this  to  any  set  of  particles 
(problem  38,  p.  295),  resulting  in  the  parallel  axis  theorem:  If  an 
object  of  total  mass  M  rotates  about  a  line  at  a  distance  h  from 
its  center  of  mass,  then  its  moment  of  inertia  equals  lcm  +  Mh2, 
where  lcm  is  the  moment  of  inertia  for  rotation  about  a  parallel  line 
through  the  center  of  mass. 

Scaling  of  the  moment  of  inertia  example  1 7 

>  (1)  Suppose  two  objects  have  the  same  mass  and  the  same 
shape,  but  one  is  less  dense,  and  larger  by  a  factor  k.  How  do 
their  moments  of  inertia  compare? 

(2)  What  if  the  densities  are  equal  rather  than  the  masses? 

>  (1)  This  is  like  increasing  all  the  distances  between  atoms  by  a 
factor  k.  All  the  r’s  become  greater  by  this  factor,  so  the  moment 
of  inertia  is  increased  by  a  factor  of  k2. 

(2)  This  introduces  an  increase  in  mass  by  a  factor  of  k3,  so  the 
moment  of  inertia  of  the  bigger  object  is  greater  by  a  factor  of 
k5. 

4.2.4  Iterated  integrals 

In  various  places  in  this  book,  starting  with  subsection  4.2.5, 
we’ll  come  across  integrals  stuck  inside  other  integrals.  These  are 
known  as  iterated  integrals,  or  double  integrals,  triple  integrals,  etc. 
Similar  concepts  crop  up  all  the  time  even  when  you’re  not  doing 
calculus,  so  let’s  start  by  imagining  such  an  example.  Suppose  you 
want  to  count  how  many  squares  there  are  on  a  chess  board,  and  you 
don’t  know  how  to  multiply  eight  times  eight.  You  could  start  from 
the  upper  left,  count  eight  squares  across,  then  continue  with  the 
second  row,  and  so  on,  until  you  how  counted  every  square,  giving 
the  result  of  64.  In  slightly  more  formal  mathematical  language, 
we  could  write  the  following  recipe:  for  each  row,  r,  from  1  to  8, 
consider  the  columns,  c,  from  1  to  8,  and  add  one  to  the  count  for 
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each  one  of  them.  Using  the  sigma  notation,  this  becomes 


8  8 

EE1- 


r=  1  c=  1 


If  you’re  familiar  with  computer  programming,  then  you  can  think 
of  this  as  a  sum  that  could  be  calculated  using  a  loop  nested  inside 
another  loop.  To  evaluate  the  result  (again,  assuming  we  don’t  know 
how  to  multiply,  so  we  have  to  use  brute  force) ,  we  can  first  evaluate 
the  inside  sum,  which  equals  8,  giving 

8 

E*- 

r=  1 


Notice  how  the  “dummy”  variable  c  has  disappeared.  Finally  we  do 
the  outside  sum,  over  r,  and  find  the  result  of  64. 

Now  imagine  doing  the  same  thing  with  the  pixels  on  a  TV 
screen.  The  electron  beam  sweeps  across  the  screen,  painting  the 
pixels  in  each  row,  one  at  a  time.  This  is  really  no  different  than  the 
example  of  the  chess  board,  but  because  the  pixels  are  so  small,  you 
normally  think  of  the  image  on  a  TV  screen  as  continuous  rather 
than  discrete.  This  is  the  idea  of  an  integral  in  calculus.  Suppose 
we  want  to  find  the  area  of  a  rectangle  of  width  a  and  height  b,  and 
we  don’t  know  that  we  can  just  multiply  to  get  the  area  ab.  The 
brute  force  way  to  do  this  is  to  break  up  the  rectangle  into  a  grid  of 
infinitesimally  small  squares,  each  having  width  dx  and  height  d y, 
and  therefore  the  infinitesimal  area  dA  =  dxd y.  For  convenience, 
we’ll  imagine  that  the  rectangle’s  lower  left  corner  is  at  the  origin. 
Then  the  area  is  given  by  this  integral: 


Notice  how  the  leftmost  integral  sign,  over  y,  and  the  rightmost  dif¬ 
ferential,  d y,  act  like  bookends,  or  the  pieces  of  bread  on  a  sandwich. 
Inside  them,  we  have  the  integral  sign  that  runs  over  x,  and  the  dif¬ 
ferential  dx  that  matches  it  on  the  right.  Finally,  on  the  innermost 
layer,  we’d  normally  have  the  thing  we’re  integrating,  but  here’s  it’s 
1,  so  I’ve  omitted  it.  Writing  the  lower  limits  of  the  integrals  with 
x  =  and  y  =  helps  to  keep  it  straight  which  integral  goes  with  which 
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differential.  The  result  is 


rb  ra 

area  =  /  /  dA 

J  y— 0  J  X=0 


d  y 


ady 


d  y 


=  ab. 


Area  of  a  triangle  example  1 8 

t>  Find  the  area  of  a  45-45-90  right  triangle  having  legs  a. 

>  Let  the  triangle’s  hypotenuse  run  from  the  origin  to  the  point 
(a,  a),  and  let  its  legs  run  from  the  origin  to  (0,  a),  and  then  to 
(a,  a).  In  other  words,  the  triangle  sits  on  top  of  its  hypotenuse. 
Then  the  integral  can  be  set  up  the  same  way  as  the  one  before, 
but  for  a  particular  value  of  y,  values  of  x  only  run  from  0  (on  the 
y  axis)  to  y  (on  the  hypotenuse).  We  then  have 


J  y= 0 


Note  that  in  this  example,  because  the  upper  end  of  the  x  values 
depends  on  the  value  of  y,  it  makes  a  difference  which  order  we 
do  the  integrals  in.  The  x  integral  has  to  be  on  the  inside,  and  we 
have  to  do  it  first. 

Volume  of  a  cube  example  1 9 

>  Find  the  volume  of  a  cube  with  sides  of  length  a. 

>  This  is  a  three-dimensional  example,  so  we’ll  have  integrals 
nested  three  deep,  and  the  thing  we’re  integrating  is  the  volume 
6V  =  dxdydz. 
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volume  =  f  f  f  dxdydz 

J  z=0  J  y= 0  Jx= 0 

=  r  fa  adydz 
J  z= 0  -7  y=0 

=  a  fa  fa  dydz 

J z= 0  J  y=0 

=  a  [3  adz 

Jz=0 

=  a3 


Area  of  a  circle  example  20 

>  Find  the  area  of  a  circle. 

>  To  make  it  easy,  let’s  find  the  area  of  a  semicircle  and  then 
double  it.  Let  the  circle’s  radius  be  r,  and  let  it  be  centered  on  the 
origin  and  bounded  below  by  the  x  axis.  Then  the  curved  edge 
is  given  by  the  equation  r2  =  x2  +  y2,  or  y  =  V r2  -  x2.  Since 
the  y  integral’s  limit  depends  on  x,  the  x  integral  has  to  be  on  the 
outside.  The  area  is 


=  r  f  x/i  -(x/r)2dx. 

Jx=-r 


Substituting  u  =  x/r, 


area  = 


a2  da 


The  definite  integral  equals  n,  as  you  can  find  using  a  trig  sub¬ 
stitution  or  simply  by  looking  it  up  in  a  table,  and  the  result  is,  as 
expected,  nr2/2.  for  the  area  of  the  semicircle. 

4.2.5  Finding  moments  of  inertia  by  integration 

When  calculating  the  moment  of  inertia  of  an  ordinary-sized  ob¬ 
ject  with  perhaps  1026  atoms,  it  would  be  impossible  to  do  an  actual 
sum  over  atoms,  even  with  the  world’s  fastest  supercomputer.  Cal¬ 
culus,  however,  offers  a  tool,  the  integral,  for  breaking  a  sum  down 
to  infinitely  many  small  parts.  If  we  don’t  worry  about  the  exis¬ 
tence  of  atoms,  then  we  can  use  an  integral  to  compute  a  moment 
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of  inertia  as  if  the  object  was  smooth  and  continuous  throughout, 
rather  than  granular  at  the  atomic  level.  Of  course  this  granular¬ 
ity  typically  has  a  negligible  effect  on  the  result  unless  the  object 
is  itself  an  individual  molecule.  This  subsection  consists  of  three 
examples  of  how  to  do  such  a  computation,  at  three  distinct  levels 
of  mathematical  complication. 

Moment  of  inertia  of  a  thin  rod 

What  is  the  moment  of  inertia  of  a  thin  rod  of  mass  M  and 
length  L  about  a  line  perpendicular  to  the  rod  and  passing  through 
its  center?  We  generalize  the  discrete  sum 

1  =  mri 
to  a  continuous  one, 

I  =  J  r2  dm 

[r  =  |a;|,  so  r2  =  x 2] 


rL!2  2  M  , 
x  —  dx 


'  —  L/2 


L 


In  this  example  the  object  was  one-dimensional,  which  made 
the  math  simple.  The  next  example  shows  a  strategy  that  can  be 
used  to  simplify  the  math  for  objects  that  are  three-dimensional, 
but  possess  some  kind  of  symmetry. 


Moment  of  inertia  of  a  disk 


What  is  the  moment  of  inertia  of  a  disk  of  radius  b,  thickness  t, 
and  mass  M,  for  rotation  about  its  central  axis? 


We  break  the  disk  down  into  concentric  circular  rings  of  thick¬ 
ness  dr.  Since  all  the  mass  in  a  given  circular  slice  has  essentially 
the  same  value  of  r  (ranging  only  from  r  to  r  +  dr),  the  slice’s  con¬ 
tribution  to  the  total  moment  of  inertia  is  simply  r2  dm.  We  then 
have 


I  = 


s 


r2  dm 


/ 


r2pdV, 


where  V  =  7 rb2t  is  the  total  volume,  p  =  M/V  =  M/nb2t  is  the 
density,  and  the  volume  of  one  slice  can  be  calculated  as  the  volume 
enclosed  by  its  outer  surface  minus  the  volume  enclosed  by  its  inner 
surface,  dV  =  ir(r  +  dr)2t  —  irr2t  =  2-ntrdr. 


I  = 


2  M 

r  ,  „  27rf  r  dr 
irbzt 


=  -Mb2. 
2 
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In  the  most  general  case  where  there  is  no  symmetry  about  the 
rotation  axis,  we  must  use  iterated  integrals,  as  discussed  in  subsec¬ 
tion  4.2.4.  The  example  of  the  disk  possessed  two  types  of  symme¬ 
try  with  respect  to  the  rotation  axis:  (1)  the  disk  is  the  same  when 
rotated  through  any  angle  about  the  axis,  and  (2)  all  slices  perpen¬ 
dicular  to  the  axis  are  the  same.  These  two  symmetries  reduced  the 
number  of  layers  of  integrals  from  three  to  one.  The  following  ex¬ 
ample  possesses  only  one  symmetry,  of  type  (2),  and  we  simply  set 
it  up  as  a  triple  integral.  You  may  not  have  seen  multiple  integrals 
yet  in  a  math  course.  If  so,  just  skim  this  example. 


Moment  of  inertia  of  a  cube 

What  is  the  moment  of  inertia  of  a  cube  of  side  b ,  for  rotation 
about  an  axis  that  passes  through  its  center  and  is  parallel  to  four 
of  its  faces?  Let  the  origin  be  at  the  center  of  the  cube,  and  let  x 
be  the  rotation  axis. 


I  = 


s 


r 2  dm 


=  P 


1 


r 2  dY 


The  fact  that  the  last  step  is  a  trivial  integral  results  from  the  sym¬ 
metry  of  the  problem.  The  integrand  of  the  remaining  double  in¬ 
tegral  breaks  down  into  two  terms,  each  of  which  depends  on  only 
one  of  the  variables,  so  we  break  it  into  two  integrals, 

rb/ 2  rb/2  rb/2  rb/2 

I  =  pb  /  y2  dydz  +  pb  /  z 2  dy  dz 

7-6/2  7-6/2  7-6/2  7-6/2 


which  we  know  have  identical  results.  We  therefore  only  need  to 
evaluate  one  of  them  and  double  the  result: 

j-b/2  rb/2 

I  =  2pb  /  z2dydz 

J~b/2  J-b/2 

rb/2 

=  2 pb1  /  z2  dz 

J-b/2 


1 

6 

1 

6 


pb 5 
Mb2 


Figure  h  shows  the  moments  of  inertia  of  some  shapes,  which 
were  evaluated  with  techniques  like  these. 
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h  /  Moments  of  inertia  of  some 
geometric  shapes. 


OI=  mR  2 

wheel  or  hoop  of  radius  R,  with 
its  mass  concentrated  on  the  rim 


sphere  of  radius  R  with 
uniform  density  throughout 


thin  rod  of  length  b 
rotating  end  over  end 


cylinder  of  radius  R 
rotating  about  its  axis 


cube  with  sides  of  length  b 


cylinder  of  radius  R  and 
length  b  rotating  end  over  end 


The  hammer  throw  example  21 

>  In  the  men’s  Olympic  hammer  throw,  a  steel  ball  of  radius  6.1  cm 
is  swung  on  the  end  of  a  wire  of  length  1.22  m.  What  fraction  of 
the  ball’s  angular  momentum  comes  from  its  rotation,  as  opposed 
to  its  motion  through  space? 

>  It’s  always  important  to  solve  problems  symbolically  first,  and 
plug  in  numbers  only  at  the  end,  so  let  the  radius  of  the  ball  be  b , 
and  the  length  of  the  wire  £.  If  the  time  the  ball  takes  to  go  once 
around  the  circle  is  T,  then  this  is  also  the  time  it  takes  to  revolve 
once  around  its  own  axis.  Its  speed  is  v  =  2 n£/T,  so  its  angular 
momentum  due  to  its  motion  through  space  is  mv£  =  2nm£2  /T. 
Its  angular  momentum  due  to  its  rotation  around  its  own  cen¬ 
ter  is  (4n/5)mb2/T.  The  ratio  of  these  two  angular  momenta  is 
(2/5 ){b/l)2  =  1  .Ox  10”3.  The  angular  momentum  due  to  the  ball’s 
spin  is  extremely  small. 

Toppling  a  rod  example  22 

>  A  rod  of  length  b  and  mass  m  stands  upright.  We  want  to  strike 
the  rod  at  the  bottom,  causing  it  to  fall  and  land  flat.  Find  the 
momentum,  p,  that  should  be  delivered,  in  terms  of  m,  b,  and 
g.  Can  this  really  be  done  without  having  the  rod  scrape  on  the 
floor? 

>  This  is  a  nice  example  of  a  question  that  can  very  nearly  be 
answered  based  only  on  units.  Since  the  three  variables,  m,  b, 
and  g,  all  have  different  units,  they  can’t  be  added  or  subtracted. 
The  only  way  to  combine  them  mathematically  is  by  multiplication 
or  division.  Multiplying  one  of  them  by  itself  is  exponentiation,  so 
in  general  we  expect  that  the  answer  must  be  of  the  form 

i  /  Example  22.  p  = 

where  A,  j,  k,  and  /  are  unitless  constants.  The  result  has  to  have 
units  of  kg- m/s.  To  get  kilograms  to  the  first  power,  we  need 

7  =  1. 
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meters  to  the  first  power  requires 


k  +  l=  1, 

and  seconds  to  the  power  -1  implies 


/=  1/2. 

We  find  j  =  1 ,  k  =  1  /2,  and  /  =  1  /2,  so  the  solution  must  be  of  the 
form 

p  =  Amy/bg. 

Note  that  no  physics  was  required! 

Consideration  of  units,  however,  won’t  help  us  to  find  the  unit¬ 
less  constant  A.  Let  t  be  the  time  the  rod  takes  to  fall,  so  that 
(1  /2 )gt2  =  b/2.  If  the  rod  is  going  to  land  exactly  on  its  side, 
then  the  number  of  revolutions  it  completes  while  in  the  air  must 
be  1/4,  or  3/4,  or  5/4,  . . . ,  but  all  the  possibilities  greater  than  1/4 
would  cause  the  head  of  the  rod  to  collide  with  the  floor  prema¬ 
turely.  The  rod  must  therefore  rotate  at  a  rate  that  would  cause 
it  to  complete  a  full  rotation  in  a  time  T  =  At,  and  it  has  angular 
momentum  L=  (n/6)mb2/T. 

The  momentum  lost  by  the  object  striking  the  rod  is  p,  and  by 
conservation  of  momentum,  this  is  the  amount  of  momentum,  in 
the  horizontal  direction,  that  the  rod  acquires.  In  other  words, 
the  rod  will  fly  forward  a  little.  However,  this  has  no  effect  on 
the  solution  to  the  problem.  More  importantly,  the  object  striking 
the  rod  loses  angular  momentum  bp/2,  which  is  also  transferred 
to  the  rod.  Equating  this  to  the  expression  above  for  L,  we  find 
p  =  {Tx/\2)m^bg. 

Finally,  we  need  to  know  whether  this  can  really  be  done  without 
having  the  foot  of  the  rod  scrape  on  the  floor.  The  figure  shows 
that  the  answer  is  no  for  this  rod  of  finite  width,  but  it  appears 
that  the  answer  would  be  yes  for  a  sufficiently  thin  rod.  This  is 
analyzed  further  in  homework  problem  37  on  page  294. 
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4.3  Angular  Momentum  In  Three  Dimensions 

Conservation  of  angular  momentum  produces  some  surprising  phe¬ 
nomena  when  extended  to  three  dimensions.  Try  the  following  ex¬ 
periment,  for  example.  Take  off  your  shoe,  and  toss  it  in  to  the  air, 
making  it  spin  along  its  long  (toe-to-heel)  axis.  You  should  observe 
a  nice  steady  pattern  of  rotation.  The  same  happens  when  you  spin 
the  shoe  about  its  shortest  (top-to-bottom)  axis.  But  something 
unexpected  happens  when  you  spin  it  about  its  third  (left-to-right) 
axis,  which  is  intermediate  in  length  between  the  other  two.  Instead 
of  a  steady  pattern  of  rotation,  you  will  observe  something  more 
complicated,  with  the  shoe  changing  its  orientation  with  respect  to 
the  rotation  axis. 


4.3.1  Rigid-body  kinematics  in  three  dimensions 

How  do  we  generalize  rigid-body  kinematics  to  three  dimensions? 
When  we  wanted  to  generalize  the  kinematics  of  a  moving  particle 
to  three  dimensions,  we  made  the  numbers  r,  v,  and  a  into  vectors 
r,  v,  and  a.  This  worked  because  these  quantities  all  obeyed  the 
same  laws  of  vector  addition.  For  instance,  one  of  the  laws  of  vector 
addition  is  that,  just  like  addition  of  numbers,  vector  addition  gives 
the  same  result  regardless  of  the  order  of  the  two  quantities  being 
added.  Thus  you  can  step  sideways  1  m  to  the  right  and  then 
step  forward  1  m,  and  the  end  result  is  the  same  as  if  you  stepped 
forward  first  and  then  to  the  side.  In  order  words,  it  didn’t  matter 
whether  you  took  Ari  +  Ar2  or  Ar2  +  Aiq.  In  math  this  is  called 
the  commutative  property  of  addition. 


a  /  Performing  the  rotations  in  one 
order  gives  one  result,  3,  while  re¬ 
versing  the  order  gives  a  different 
result,  5. 


degrees  about  y.  (5)  Rotated  about 

y,  then  x. 


Angular  motion,  unfortunately  doesn’t  have  this  property,  as 
shown  in  figure  a.  Doing  a  rotation  about  the  x  axis  and  then 
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about  y  gives  one  result,  while  doing  them  in  the  opposite  order 
gives  a  different  result.  These  operations  don’t  “commute,”  i.e.,  it 
makes  a  difference  what  order  you  do  them  in. 

This  means  that  there  is  in  general  no  possible  way  to  construct 
a  AO  vector.  However,  if  you  try  doing  the  operations  shown  in 
figure  a  using  small  rotation,  say  about  10  degrees  instead  of  90, 
you’ll  find  that  the  result  is  nearly  the  same  regardless  of  what 
order  you  use;  small  rotations  are  very  nearly  commutative.  Not 
only  that,  but  the  result  of  the  two  10-degree  rotations  is  about  the 
same  as  a  single,  somewhat  larger,  rotation  about  an  axis  that  lies 
symmetrically  at  between  the  x  and  y  axes  at  45  degree  angles  to 
each  one.  This  is  exactly  what  we  would  expect  if  the  two  small 
rotations  did  act  like  vectors  whose  directions  were  along  the  axis 
of  rotation.  We  therefore  define  a  d 0  vector  whose  magnitude  is  the 
amount  of  rotation  in  units  of  radians,  and  whose  direction  is  along 
the  axis  of  rotation.  Actually  this  definition  is  ambiguous,  because 
there  it  could  point  in  either  direction  along  the  axis.  We  therefore 
use  a  right-hand  rule  as  shown  in  figure  b  to  define  the  direction  of 
the  d 6  vector,  and  the  u>  vector,  u  =  d 0/  dt,  based  on  it.  Aliens 
on  planet  Tammyfaye  may  decide  to  define  it  using  their  left  hands 
rather  than  their  right,  but  as  long  as  they  keep  their  scientific 
literature  separate  from  ours,  there  is  no  problem.  When  entering  a 
physics  exam,  always  be  sure  to  write  a  large  warning  note  on  your 
left  hand  in  magic  marker  so  that  you  won’t  be  tempted  to  use  it 
for  the  right-hand  rule  while  keeping  your  pen  in  your  right. 

self-check  D 

Use  the  right-hand  rule  to  determine  the  directions  of  the  cu  vectors  in 
each  rotation  shown  in  figures  a/1  through  a/5.  >  Answer,  p.  963 

Because  the  vector  relationships  among  d0,  and  a  are  strictly 
analogous  to  the  ones  involving  dr,  v,  and  a  (with  the  proviso  that 
we  avoid  describing  large  rotations  using  A 6  vectors),  any  operation 
in  r-v-a  vector  kinematics  has  an  exact  analog  in  6-ui-oi  kinematics. 

Result  of  successive  1 0-degree  rotations  example  23 

>  What  is  the  result  of  two  successive  (positive)  10-degree  rota¬ 
tions  about  the  x  and  y  axes?  That  is,  what  single  rotation  about  a 
single  axis  would  be  equivalent  to  executing  these  in  succession? 

o  The  result  is  only  going  to  be  approximate,  since  10  degrees 
is  not  an  infinitesimally  small  angle,  and  we  are  not  told  in  what 
order  the  rotations  occur.  To  some  approximation,  however,  we 
can  add  the  A0  vectors  in  exactly  the  same  way  we  would  add  Ar 
vectors,  so  we  have 


direction  of 

A6  and  co 


b/The  right-hand  rule  for 
associating  a  vector  with  a 
direction  of  rotation. 


A0  ~  A0-|  +  A02 

«  (10  degrees)*  +  (10  degrees)y. 

This  is  a  vector  with  a  magnitude  of  y/(10  deg)2  +  (10  deg)2  = 
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14  deg,  and  it  points  along  an  axis  midway  between  the  x  and  y 
axes. 

4.3.2  Angular  momentum  in  three  dimensions 

The  vector  cross  product 

In  order  to  expand  our  system  of  three-dimensional  kinematics  to 
include  dynamics,  we  will  have  to  generalize  equations  like  Vt  =  cor, 
t  =  rF  sin  6tf,  and  L  =  rp  sin  6rp,  each  of  which  involves  three 
quantities  that  we  have  either  already  defined  as  vectors  or  that  we 
want  to  redefine  as  vectors.  Although  the  first  one  appears  to  differ 
from  the  others  in  its  form,  it  could  just  as  well  be  rewritten  as 
vt  =  cur  sin  9ur ,  since  9ur  =  90°,  and  sin  9wr  =  1. 

It  thus  appears  that  we  have  discovered  something  general  about 
the  physically  useful  way  to  relate  three  vectors  in  a  multiplicative 
way:  the  magnitude  of  the  result  always  seems  to  be  proportional  to 
the  product  of  the  magnitudes  of  the  two  vectors  being  “multiplied,” 
and  also  to  the  sine  of  the  angle  between  them. 

Is  this  pattern  just  an  accident?  Actually  the  sine  factor  has 
a  very  important  physical  property:  it  goes  to  zero  when  the  two 
vectors  are  parallel.  This  is  a  Good  Thing.  The  generalization  of 
angular  momentum  into  a  three-dimensional  vector,  for  example,  is 
presumably  going  to  describe  not  just  the  clockwise  or  counterclock¬ 
wise  nature  of  the  motion  but  also  from  which  direction  we  would 
have  to  view  the  motion  so  that  it  was  clockwise  or  counterclock¬ 
wise.  (A  clock’s  hands  go  counterclockwise  as  seen  from  behind  the 
clock,  and  don’t  rotate  at  all  as  seen  from  above  or  to  the  side.)  Now 
suppose  a  particle  is  moving  directly  away  from  the  origin,  so  that 
its  r  and  p  vectors  are  parallel.  It  is  not  going  around  the  origin 
from  any  point  of  view,  so  its  angular  momentum  vector  had  better 
be  zero. 

Thinking  in  a  slightly  more  abstract  way,  we  would  expect  the 
angular  momentum  vector  to  point  perpendicular  to  the  plane  of 
motion,  just  as  the  angular  velocity  vector  points  perpendicular  to 
the  plane  of  motion.  The  plane  of  motion  is  the  plane  containing 
both  r  and  p,  if  we  place  the  two  vectors  tail-to-tail.  But  if  r  and 
p  are  parallel  and  are  placed  tail-to-tail,  then  there  are  infinitely 
many  planes  containing  them  both.  To  pick  one  of  these  planes  in 
preference  to  the  others  would  violate  the  symmetry  of  space,  since 
they  should  all  be  equally  good.  Thus  the  zero-if-parallel  property 
is  a  necessary  consequence  of  the  underlying  symmetry  of  the  laws 
of  physics. 

The  following  definition  of  a  kind  of  vector  multiplication  is  con¬ 
sistent  with  everything  we’ve  seen  so  far,  and  on  p.  952  we’ll  prove 
that  the  definition  is  unique,  i.e.,  if  we  believe  in  the  symmetry  of 
space,  it  is  essentially  the  only  way  of  defining  the  multiplication  of 
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two  vectors  to  produce  a  third  vector: 

Definition  of  the  vector  cross  product: 

The  cross  product  A  x  B  of  two  vectors  A  and  B  is  defined  as 
follows: 

(1)  Its  magnitude  is  defined  by  |AxB|  =  |A||B|  sin  9  ab,  where  Oab 
is  the  angle  between  A  and  B  when  they  are  placed  tail-to-tail. 

(2)  Its  direction  is  along  the  line  perpendicular  to  both  A  and  B. 
Of  the  two  such  directions,  it  is  the  one  that  obeys  the  right-hand 
rule  shown  in  figure  c. 

The  name  “cross  product”  refers  to  the  symbol,  and  distin¬ 
guishes  it  from  the  dot  product,  which  acts  on  two  vectors  but 
produces  a  scalar. 

Although  the  vector  cross-product  has  nearly  all  the  properties 
of  numerical  multiplication,  e.g.,  A  x  (B  +  C)  —  A  x  B  +  A  x  C,  it 
lacks  the  usual  property  of  commutativity.  Try  applying  the  right- 
hand  rule  to  find  the  direction  of  the  vector  cross  product  B  x  A 
using  the  two  vectors  shown  in  the  figure.  This  requires  starting 
with  a  flattened  hand  with  the  four  fingers  pointing  along  B,  and 
then  curling  the  hand  so  that  the  fingers  point  along  A.  The  only 
possible  way  to  do  this  is  to  point  your  thumb  toward  the  floor,  in 
the  opposite  direction.  Thus  for  the  vector  cross  product  we  have 

A  x  B  =  — B  x  A, 

a  property  known  as  anticommutativity.  The  vector  cross  product 
is  also  not  associative,  i.e.,  A  x  (B  x  C)  is  usually  not  the  same  as 
(A  x  B)  x  C. 

A  geometric  interpretation  of  the  cross  product,  d,  is  that  if  both 
A  and  B  are  vectors  with  units  of  distance,  then  the  magnitude  of 
their  cross  product  can  be  interpreted  as  the  area  of  the  parallelo¬ 
gram  they  form  when  placed  tail-to-tail. 

A  useful  expression  for  the  components  of  the  vector  cross  prod¬ 
uct  in  terms  of  the  components  of  the  two  vectors  being  multiplied 
is  as  follows: 


(A  x  B)x  =  AyBz  ByAz 
(A  x  B)y  =  AZBX  —  BZAX 
(A  x  B)z  =  AxBy  BxAy 

I’ll  prove  later  that  these  expressions  are  equivalent  to  the  previ¬ 
ous  definition  of  the  cross  product.  Although  they  may  appear 
formidable,  they  have  a  simple  structure:  the  subscripts  on  the  right 
are  the  other  two  besides  the  one  on  the  left,  and  each  equation  is 
related  to  the  preceding  one  by  a  cyclic  change  in  the  subscripts, 
e.  If  the  subscripts  were  not  treated  in  some  completely  symmetric 


AxB 


c/The  right-hand  rule  for 
the  direction  of  the  vector  cross 
product. 


d/The  magnitude  of  the  cross 
product  is  the  area  of  the  shaded 
parallelogram. 


e/A  cyclic  change  in  the  x, 
y,  and  z  subscripts. 
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manner  like  this,  then  the  definition  would  provide  some  way  to  dis¬ 
tinguish  one  axis  from  another,  which  would  violate  the  symmetry 
of  space. 

self-check  E 

Show  that  the  component  equations  are  consistent  with  the  rule  A  x  B  = 
-B  x  A.  >  Answer,  p.  963 

Angular  momentum  in  three  dimensions 

In  terms  of  the  vector  cross  product,  we  have: 

v  =  uj  x  r 
L  =  r  x  p 
r  =  r  x  F 


f/The  position  and  momen¬ 
tum  vectors  of  an  atom  in  the 
spinning  top. 


g/The  right-hand  rule  for 
the  atom’s  contribution  to  the 
angular  momentum. 


But  wait,  how  do  we  know  these  equations  are  even  correct? 
For  instance,  how  do  we  know  that  the  quantity  defined  by  r  x  p 
is  in  fact  conserved?  Well,  just  as  we  saw  on  page  212  that  the 
dot  product  is  unique  (i.e.,  can  only  be  defined  in  one  way  while 
observing  rotational  invariance),  the  cross  product  is  also  unique, 
as  proved  on  page  952.  If  r  x  p  was  not  conserved,  then  there  could 
not  be  any  generally  conserved  quantity  that  would  reduce  to  our 
old  definition  of  angular  momentum  in  the  special  case  of  plane 
rotation.  This  doesn’t  prove  conservation  of  angular  momentum 
-  only  experiments  can  prove  that  —  but  it  does  prove  that  if 
angular  momentum  is  conserved  in  three  dimensions,  there  is  only 
one  possible  way  to  generalize  from  two  dimensions  to  three. 

Angular  momentum  of  a  spinning  top  example  24 

As  an  illustration,  we  consider  the  angular  momentum  of  a  spin¬ 
ning  top.  Figures  f  and  g  show  the  use  of  the  vector  cross  prod¬ 
uct  to  determine  the  contribution  of  a  representative  atom  to  the 
total  angular  momentum.  Since  every  other  atom’s  angular  mo¬ 
mentum  vector  will  be  in  the  same  direction,  this  will  also  be  the 
direction  of  the  total  angular  momentum  of  the  top.  This  happens 
to  be  rigid-body  rotation,  and  perhaps  not  surprisingly,  the  angu¬ 
lar  momentum  vector  is  along  the  same  direction  as  the  angular 
velocity  vector. 

Three  important  points  are  illustrated  by  this  example:  (1) 
When  we  do  the  full  three-dimensional  treatment  of  angular  mo¬ 
mentum,  the  “axis”  from  which  we  measure  the  position  vectors  is 
just  an  arbitrarily  chosen  point.  If  this  had  not  been  rigid-body 
rotation,  we  would  not  even  have  been  able  to  identify  a  single  line 
about  which  every  atom  circled.  (2)  Starting  from  figure  f,  we  had 
to  rearrange  the  vectors  to  get  them  tail-to-tail  before  applying  the 
right-hand  rule.  If  we  had  attempted  to  apply  the  right-hand  rule 
to  figure  f,  the  direction  of  the  result  would  have  been  exactly  the 
opposite  of  the  correct  answer.  (3)  The  equation  L  =  r  X  p  cannot 
be  applied  all  at  once  to  an  entire  system  of  particles.  The  total 
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momentum  of  the  top  is  zero,  which  would  give  an  erroneous  result 
of  zero  angular  momentum  (never  mind  the  fact  that  r  is  not  well 
defined  for  the  top  as  a  whole) . 

Doing  the  right-hand  rule  like  this  requires  some  practice.  I 
urge  you  to  make  models  like  g  out  of  rolled  up  pieces  of  paper  and 
to  practice  with  the  model  in  various  orientations  until  it  becomes 
natural. 

Precession  example  25 

Figure  h  shows  a  counterintuitive  example  of  the  concepts  we’ve 
been  discussing.  One  expects  the  torque  due  to  gravity  to  cause 
the  top  to  flop  down.  Instead,  the  top  remains  spinning  in  the  hor¬ 
izontal  plane,  but  its  axis  of  rotation  starts  moving  in  the  direction 
shown  by  the  shaded  arrow.  This  phenomenon  is  called  preces¬ 
sion.  Figure  i  shows  that  the  torque  due  to  gravity  is  out  of  the 
page.  (Actually  we  should  add  up  all  the  torques  on  all  the  atoms 
in  the  top,  but  the  qualitative  result  is  the  same.)  Since  torque 
is  the  rate  of  change  of  angular  momentum,  t  =  dL/df,  the  AL 
vector  must  be  in  the  same  direction  as  the  torque  (division  by 
a  positive  scalar  doesn’t  change  the  direction  of  the  vector).  As 
shown  in  j,  this  causes  the  angular  momentum  vector  to  twist  in 
space  without  changing  its  magnitude. 

For  similar  reasons,  the  Earth’s  axis  precesses  once  every  26,000 
years  (although  not  through  a  great  circle,  since  the  angle  between 
the  axis  and  the  force  isn’t  90  degrees  as  in  figure  h).  This  pre¬ 
cession  is  due  to  a  torque  exerted  by  the  moon.  If  the  Earth  was 
a  perfect  sphere,  there  could  be  no  precession  effect  due  to  sym¬ 
metry.  However,  the  Earth’s  own  rotation  causes  it  to  be  slightly 
flattened  (oblate)  relative  to  a  perfect  sphere,  giving  it  “love  han¬ 
dles”  on  which  the  moon’s  gravity  can  act.  The  moon’s  gravity  on 
the  nearer  side  of  the  equatorial  bulge  is  stronger,  so  the  torques  do 
not  cancel  out  perfectly.  Presently  the  earth’s  axis  very  nearly  lines 
up  with  the  star  Polaris,  but  in  12,000  years,  the  pole  star  will  be 
Vega  instead. 

The  frisbee  example  26 

The  flow  of  the  air  over  a  flying  frisbee  generates  lift,  and  the  lift 
at  the  front  and  back  of  the  frisbee  isn’t  necessarily  balanced.  If 
you  throw  a  frisbee  without  rotating  it,  as  if  you  were  shooting  a 
basketball  with  two  hands,  you’ll  find  that  it  pitches,  i.e. ,  its  nose 
goes  either  up  or  down.  When  I  do  this  with  my  frisbee,  it  goes 
nose  down,  which  apparently  means  that  the  lift  at  the  back  of 
the  disc  is  greater  than  the  lift  at  the  front.  The  two  torques  are 
unbalanced,  resulting  in  a  total  torque  that  points  to  the  left. 

The  way  you  actually  throw  a  frisbee  is  with  one  hand,  putting  a 
lot  of  spin  on  it.  If  you  throw  backhand,  which  is  how  most  peo¬ 
ple  first  learn  to  do  it,  the  angular  momentum  vector  points  down 
(assuming  you’re  right-handed).  On  my  frisbee,  the  aerodynamic 


h  /  A  top  is  supported  at  its 
tip  by  a  pinhead.  (More  practical 
devices  to  demonstrate  this 
would  use  a  double  bearing.) 

r  « - N, 

x  x  =  rxF 

>  ' 

F 

i  /  The  torque  made  by  grav¬ 
ity  is  in  the  horizontal  plane. 


L 


j  /  The  AL  vector  is  in  the 
same  direction  as  the  torque,  out 
of  the  page. 
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torque  to  the  left  would  therefore  tend  to  make  the  angular  mo¬ 
mentum  vector  precess  in  the  clockwise  direction  as  seen  by  the 
thrower.  This  would  cause  the  disc  to  roll  to  the  right,  and  there¬ 
fore  follow  a  curved  trajectory.  Some  specialized  discs,  used  in 
the  sport  of  disc  golf,  are  actually  designed  intentionally  to  show 
this  behavior;  they’re  known  as  “understable”  discs.  However,  the 
typical  frisbee  that  most  people  play  with  is  designed  to  be  stable: 
as  the  disc  rolls  to  one  side,  the  airflow  around  it  is  altered  in  way 
that  tends  to  bring  the  disc  back  into  level  flight.  Such  a  disc  will 
therefore  tend  to  fly  in  a  straight  line,  provided  that  it  is  thrown 
with  enough  angular  momentum. 

0  Finding  a  cross  product  by  components  example  27 

3  >  What  is  the  torque  produced  by  a  force  given  by  x  +  2y  +  3z  (in 

units  of  Newtons)  acting  on  a  point  whose  radius  vector  is  4x  +  5y 
k  /  Example  27.  (in  meters)? 

>  It’s  helpful  to  make  a  table  of  the  components  as  shown  in  the 
figure.  The  results  are 

tx  =  ryFz  -  Fyrz  =  1 5  N-m 
Ty  =  rzFx  -  Fzrx  =-12  N-m 
Tz  =  rxFy  -  Fxry  =  3  N-m 


r  4  5 

F  1  2 


Torque  and  angular  momentum  example  28 

In  this  example,  we  prove  explicitly  the  consistency  of  the  equa¬ 
tions  involving  torque  and  angular  momentum  that  we  proved 
above  based  purely  on  symmetry.  Starting  from  the  definition  of 
torque,  we  have 


_  dL 
“  d J 
d  \  ^ 

i 

-  E  ^<r'  *  P')- 

/ 


The  derivative  of  a  cross  product  can  be  evaluated  in  the  same 
way  as  the  derivative  of  an  ordinary  scalar  product: 


-E 


dr, 

d7xp' 


+  r,  x 


dp / 
df 


The  first  term  is  zero  for  each  particle,  since  the  velocity  vector  is 
parallel  to  the  momentum  vector.  The  derivative  appearing  in  the 
second  term  is  the  force  acting  on  the  particle,  so 


t  =  r> x  F'> 
/ 


which  is  the  relationship  we  set  out  to  prove. 
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4.3.3  Rigid-body  dynamics  in  three  dimensions 

The  student  who  is  not  madly  in  love  with  mathematics  may 
wish  to  skip  the  rest  of  this  section  after  absorbing  the  statement 
that,  for  a  typical,  asymmetric  object,  the  angular  momentum  vector 
and  the  angular  velocity  vector  need  not  be  parallel.  That  is,  only 
for  a  body  that  possesses  symmetry  about  the  rotation  axis  is  it  true 
that  L  =  Ilj  (the  rotational  equivalent  of  p  =  mv )  for  some  scalar 
I. 

Let’s  evaluate  the  angular  momentum  of  a  rigidly  rotating  sys¬ 
tem  of  particles: 

L  =  ^  Ti  X  Pi 

i 

=  Y2  x  v* 

i 

=  ^  miri  X  (UJ  X  I 'i) 
i 

An  important  mathematical  skill  is  to  know  when  to  give  up  and 
back  off.  This  is  a  complicated  expression,  and  there  is  no  reason 
to  expect  it  to  simplify  and,  for  example,  take  the  form  of  a  scalar 
multiplied  by  u ;.  Instead  we  examine  its  general  characteristics.  If 
we  expanded  it  using  the  equation  that  gives  the  components  of  a 
vector  cross  product,  every  term  would  have  one  of  the  uj  compo¬ 
nents  raised  to  the  first  power,  multiplied  by  a  bunch  of  other  stuff. 
The  most  general  possible  form  for  the  result  is 

Lx  =  Ixx^x  T  Ixy^y  T  Lrz^'z 

Ly  —  Iyx^x  T  lyy^y  T  ^yz^z 

Lz  —  Izx Wx  T  Izy^Jy  T  Izz^zi 

which  you  may  recognize  as  a  case  of  matrix  multiplication.  The 
moment  of  inertia  is  not  a  scalar,  and  not  a  three-component  vector. 
It  is  a  matrix  specified  by  nine  numbers,  called  its  matrix  elements. 

The  elements  of  the  moment  of  inertia  matrix  will  depend  on  our 
choice  of  a  coordinate  system.  In  general,  there  will  be  some  special 
coordinate  system,  in  which  the  matrix  has  a  simple  diagonal  form: 

Lx  =  Ixx^x 

Ly  =  lyy^y 

Lz  —  Izzujz. 

The  three  special  axes  that  cause  this  simplification  are  called 
the  principal  axes  of  the  object,  and  the  corresponding  coordinate 
system  is  the  principal  axis  system.  For  symmetric  shapes  such  as 
a  rectangular  box  or  an  ellipsoid,  the  principal  axes  lie  along  the 
intersections  of  the  three  symmetry  planes,  but  even  an  asymmetric 
body  has  principal  axes. 
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We  can  also  generalize  the  plane-rotation  equation  K  =  ( 1/2)Iuj 2 
to  three  dimensions  as  follows: 

K  =  Y 

i 

=  \  Y  rn^uj  x  r*)  ■  x  r*) 

i 

We  want  an  equation  involving  the  moment  of  inertia,  and  this  has 
some  evident  similarities  to  the  sum  we  originally  wrote  down  for 
the  moment  of  inertia.  To  massage  it  into  the  right  shape,  we  need 
the  vector  identity  (A  x  B)  C  =  (B  x  C)  •  A,  which  we  state  without 
proof.  We  then  write 

K  =7)  Y  mi  x  (w  x  r*)]  '  u 

i 

=  \u>  'Y  miri  X  (w  X  rj) 
i 

=  -L  •  id 
2 

As  a  reward  for  all  this  hard  work,  let’s  analyze  the  problem  of 
the  spinning  shoe  that  I  posed  at  the  beginning  of  the  chapter.  The 
three  rotation  axes  referred  to  there  are  approximately  the  principal 
axes  of  the  shoe.  While  the  shoe  is  in  the  air,  no  external  torques  are 
acting  on  it,  so  its  angular  momentum  vector  must  be  constant  in 
magnitude  and  direction.  Its  kinetic  energy  is  also  constant.  That’s 
in  the  room’s  frame  of  reference,  however.  The  principal  axis  frame 
is  attached  to  the  shoe,  and  tumbles  madly  along  with  it.  In  the 
principal  axis  frame,  the  kinetic  energy  and  the  magnitude  of  the 
angular  momentum  stay  constant,  but  the  actual  direction  of  the 
angular  momentum  need  not  stay  fixed  (as  you  saw  in  the  case 
of  rotation  that  was  initially  about  the  intermediate-length  axis). 
Constant  jLj  gives 

L2  +  Ly  +  L2  =  constant. 

In  the  principal  axis  frame,  it’s  easy  to  solve  for  the  components 
of  id  in  terms  of  the  components  of  L,  so  we  eliminate  oj  from  the 
expression  2  K  =  L  •  u,  giving 

j—L2x  +  j—L2  +  L2  =  constant  #2. 

J-xx  lyy  -Lzz 

The  first  equation  is  the  equation  of  a  sphere  in  the  three  di¬ 
mensional  space  occupied  by  the  angular  momentum  vector,  while 
the  second  one  is  the  equation  of  an  ellipsoid.  The  top  figure  cor¬ 
responds  to  the  case  of  rotation  about  the  shortest  axis,  which  has 
the  greatest  moment  of  inertia  element.  The  intersection  of  the  two 
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surfaces  consists  only  of  the  two  points  at  the  front  and  back  of  the 
sphere.  The  angular  momentum  is  confined  to  one  of  these  points, 
and  can’t  change  its  direction,  i.e. ,  its  orientation  with  respect  to  the 
principal  axis  system,  which  is  another  way  of  saying  that  the  shoe 
can’t  change  its  orientation  with  respect  to  the  angular  momentum 
vector.  In  the  bottom  figure,  the  shoe  is  rotating  about  the  longest 
axis.  Now  the  angular  momentum  vector  is  trapped  at  one  of  the 
two  points  on  the  right  or  left.  In  the  case  of  rotation  about  the 
axis  with  the  intermediate  moment  of  inertia  element,  however,  the 
intersection  of  the  sphere  and  the  ellipsoid  is  not  just  a  pair  of  iso¬ 
lated  points  but  the  curve  shown  with  the  dashed  line.  The  relative 
orientation  of  the  shoe  and  the  angular  momentum  vector  can  and 
will  change. 

One  application  of  the  moment  of  inertia  tensor  is  to  video  games 
that  simulate  car  racing  or  flying  airplanes. 

One  more  exotic  example  has  to  do  with  nuclear  physics.  Al¬ 
though  you  have  probably  visualized  atomic  nuclei  as  nothing  more 
than  featureless  points,  or  perhaps  tiny  spheres,  they  are  often  el¬ 
lipsoids  with  one  long  axis  and  two  shorter,  equal  ones.  Although 
a  spinning  nucleus  normally  gets  rid  of  its  angular  momentum  via 
gamma  ray  emission  within  a  period  of  time  on  the  order  of  picosec¬ 
onds,  it  may  happen  that  a  deformed  nucleus  gets  into  a  state  in 
which  has  a  large  angular  momentum  is  along  its  long  axis,  which 
is  a  very  stable  mode  of  rotation.  Such  states  can  live  for  seconds 
or  even  years!  (There  is  more  to  the  story  —  this  is  the  topic  on 
which  I  wrote  my  Ph.D.  thesis  —  but  the  basic  insight  applies  even 
though  the  full  treatment  requires  fancy  quantum  mechanics.) 

Our  analysis  has  so  far  assumed  that  the  kinetic  energy  of  ro¬ 
tation  energy  can’t  be  converted  into  other  forms  of  energy  such  as 
heat,  sound,  or  vibration.  When  this  assumption  fails,  then  rota¬ 
tion  about  the  axis  of  least  moment  of  inertia  becomes  unstable, 
and  will  eventually  convert  itself  into  rotation  about  the  axis  whose 
moment  of  inertia  is  greatest.  This  happened  to  the  U.S.’s  first  ar¬ 
tificial  satellite,  Explorer  I,  launched  in  1958.  Note  the  long,  floppy 
antennas,  which  tended  to  dissipate  kinetic  energy  into  vibration.  It 
had  been  designed  to  spin  about  its  minimimum-moment-of-inertia 
axis,  but  almost  immediately,  as  soon  as  it  was  in  space,  it  began 
spinning  end  over  end.  It  was  nevertheless  able  to  carry  out  its 
science  mission,  which  didn’t  depend  on  being  able  to  maintain  a 
stable  orientation,  and  it  discovered  the  Van  Allen  radiation  belts. 

This  chapter  is  summarized  on  page  993.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 


I  /  Visualizing  surfaces  of 
constant  energy  and  angular 
momentum  in  Lx-Ly-Lz  space. 


m  /  The  Explorer  I  satellite. 
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Problem  6. 


Problems 

The  symbols  C,  ^ ,  etc.  are  explained  on  page  296. 

1  The  figure  shows  scale  drawing  of  a  pair  of  pliers  being 

used  to  crack  a  nut,  with  an  appropriately  reduced  centimeter  grid. 
Warning:  do  not  attempt  this  at  home;  it  is  bad  manners.  If  the 
force  required  to  crack  the  nut  is  300  N,  estimate  the  force  required 
of  the  person’s  hand.  >  Solution,  p.  976  - 

2  You  are  trying  to  loosen  a  stuck  bolt  on  your  RV  using  a  big 

wrench  that  is  50  cm  long.  If  you  hang  from  the  wrench,  and  your 
mass  is  55  kg,  what  is  the  maximum  torque  you  can  exert  on  the 
bolt?  V 

3  A  physical  therapist  wants  her  patient  to  rehabilitate  his  in¬ 

jured  elbow  by  laying  his  arm  flat  on  a  table,  and  then  lifting  a  2.1 
kg  mass  by  bending  his  elbow.  In  this  situation,  the  weight  is  33 
cm  from  his  elbow.  He  calls  her  back,  complaining  that  it  hurts  him 
to  grasp  the  weight.  He  asks  if  he  can  strap  a  bigger  weight  onto 
his  arm,  only  17  cm  from  his  elbow.  How  much  mass  should  she 
tell  him  to  use  so  that  he  will  be  exerting  the  same  torque?  (He  is 
raising  his  forearm  itself,  as  well  as  the  weight.)  V  ^ 

4  An  object  thrown  straight  up  in  the  air  is  momentarily  at  rest 
when  it  reaches  the  top  of  its  motion.  Does  that  mean  that  it  is  in 
equilibrium  at  that  point?  Explain. 

5  An  object  is  observed  to  have  constant  angular  momentum. 
Can  you  conclude  that  no  torques  are  acting  on  it?  Explain.  [Based 
on  a  problem  by  Serway  and  Faughn.] 

6  A  person  of  mass  m  stands  on  the  ball  of  one  foot.  Find  the 

tension  in  the  calf  muscle  and  the  force  exerted  by  the  shinbones 
on  the  bones  of  the  foot,  in  terms  of  rri,  g,  a,  and  b.  For  simplicity, 
assume  that  all  the  forces  are  at  90-degree  angles  to  the  foot,  i.e. , 
neglect  the  angle  between  the  foot  and  the  floor.  v  ^ 

7  Two  pointlike  particles  have  the  same  momentum  vector.  Can 

you  conclude  that  their  angular  momenta  are  the  same?  Explain. 
[Based  on  a  problem  by  Serway  and  Faughn.]  - 

8  The  box  shown  in  the  figure  is  being  accelerated  by  pulling  on 
it  with  the  rope. 

(a)  Assume  the  floor  is  frictionless.  What  is  the  maximum  force 
that  can  be  applied  without  causing  the  box  to  tip  over? 

>  Hint,  p.  959  V 

(b)  Repeat  part  a,  but  now  let  the  coefficient  of  friction  be  /i.  V 

(c)  What  happens  to  your  answer  to  part  b  when  the  box  is  suffi¬ 
ciently  tall?  How  do  you  interpret  this? 
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9  A  uniform  ladder  of  mass  m  and  length  t  leans  against  a  smooth 
wall,  making  an  angle  6  with  respect  to  the  ground.  The  dirt  exerts 
a  normal  force  and  a  frictional  force  on  the  ladder,  producing  a  force 
vector  with  magnitude  Tj  at  an  angle  0  with  respect  to  the  ground. 
Since  the  wall  is  smooth,  it  exerts  only  a  normal  force  on  the  ladder; 
let  its  magnitude  be  F2. 

(a)  Explain  why  0  must  be  greater  than  6.  No  math  is  needed. 

(b)  Choose  any  numerical  values  you  like  for  m  and  £,  and  show  that 

the  ladder  can  be  in  equilibrium  (zero  torque  and  zero  total  force 
vector)  for  0=45. 00° and  0=63.43°.  ■ 

10  Continuing  problem  9,  find  an  equation  for  0  in  terms  of 

9,  and  show  that  m  and  L  do  not  enter  into  the  equation.  Do  not 
assume  any  numerical  values  for  any  of  the  variables.  You  will  need 
the  trig  identity  sin(a  —  b)  =  sin  a  cos  b  —  sin  b  cos  a.  (As  a  numerical 
check  on  your  result,  you  may  wish  to  check  that  the  angles  given 
in  problem  9b  satisfy  your  equation.)  C 

11  (a)  Find  the  minimum  horizontal  force  which,  applied  at 
the  axle,  will  pull  a  wheel  over  a  step.  Invent  algebra  symbols  for 
whatever  quantities  you  find  to  be  relevant,  and  give  your  answer 
in  symbolic  form. 

(b)  Under  what  circumstances  does  your  result  become  infinite? 
Give  a  physical  interpretation.  What  happens  to  your  answer  when 
the  height  of  the  curb  is  zero?  Does  this  make  sense? 

>  Hint,  p.  959  1 

12  A  ball  is  connected  by  a  string  to  a  vertical  post.  The  ball  is 
set  in  horizontal  motion  so  that  it  starts  winding  the  string  around 
the  post.  Assume  that  the  motion  is  confined  to  a  horizontal  plane, 
i.e.,  ignore  gravity.  Michelle  and  Astrid  are  trying  to  predict  the 
final  velocity  of  the  ball  when  it  reaches  the  post.  Michelle  says 
that  according  to  conservation  of  angular  momentum,  the  ball  has 
to  speed  up  as  it  approaches  the  post.  Astrid  says  that  according  to 
conservation  of  energy,  the  ball  has  to  keep  a  constant  speed.  Who 
is  right?  [Hint:  How  is  this  different  from  the  case  where  you  whirl 
a  rock  in  a  circle  on  a  string  and  gradually  reel  in  the  string?] 

13  In  the  1950’s,  serious  articles  began  appearing  in  magazines 
like  Life  predicting  that  world  domination  would  be  achieved  by  the 
nation  that  could  put  nuclear  bombs  in  orbiting  space  stations,  from 
which  they  could  be  dropped  at  will.  In  fact  it  can  be  quite  difficult 
to  get  an  orbiting  object  to  come  down.  Let  the  object  have  energy 
E  =  K  +  U  and  angular  momentum  L.  Assume  that  the  energy 
is  negative,  i.e.,  the  object  is  moving  at  less  than  escape  velocity. 
Show  that  it  can  never  reach  a  radius  less  than 

_GMm(  j  2EL2  \ 

mm  ~  2 E  (  +  V  +  GHIV )  ' 

[Note  that  both  factors  are  negative,  giving  a  positive  result.] 


Problems  9  and  10. 


Problem  1 1 . 
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Problem  14. 


14  (a)  The  bar  of  mass  m  is  attached  at  the  wall  with  a  hinge, 

and  is  supported  on  the  right  by  a  massless  cable.  Find  the  tension, 
T,  in  the  cable  in  terms  of  the  angle  9.  V 

(b)  Interpreting  your  answer  to  part  a,  what  would  be  the  best  angle 
to  use  if  we  wanted  to  minimize  the  strain  on  the  cable? 

(c)  Again  interpreting  your  answer  to  part  a,  for  what  angles  does 
the  result  misbehave  mathematically?  Interpet  this  physically. 


15  (a)  The  two  identical  rods  are  attached  to  one  another  with 

a  hinge,  and  are  supported  by  the  two  massless  cables.  Find  the 
angle  a  in  terms  of  the  angle  /3,  and  show  that  the  result  is  a  purely 
geometric  one,  independent  of  the  other  variables  involved.  V 

(b)  Using  your  answer  to  part  a,  sketch  the  configurations  for  /3  0, 

f3  =  45°,  and  /?  =  90°.  Do  your  results  make  sense  intuitively?  - 

16  Two  bars  of  length  t  are  connected  with  a  hinge  and  placed  on 
a  frictionless  cylinder  of  radius  r.  (a)  Show  that  the  angle  9  shown 
in  the  figure  is  related  to  the  unitless  ratio  v/i  by  the  equation 


r  cos2  9 
l  2  tan# 


Problem  15. 


(b)  Discuss  the  physical  behavior  of  this  equation  for  very  large  and 
very  small  values  of  rjl.  B 


17  You  wish  to  determine  the  mass  of  a  ship  in  a  bottle  without 
taking  it  out.  Show  that  this  can  be  done  with  the  setup  shown  in 
the  figure,  with  a  scale  supporting  the  bottle  at  one  end,  provided 
that  it  is  possible  to  take  readings  with  the  ship  slid  to  several 
different  locations.  Note  that  you  can’t  determine  the  position  of 
the  ship’s  center  of  mass  just  by  looking  at  it,  and  likewise  for  the 
bottle.  In  particular,  you  can’t  just  say,  “position  the  ship  right 
on  top  of  the  fulcrum”  or  “position  it  right  on  top  of  the  balance.” 


18  Suppose  that  we  lived  in  a  universe  in  which  Newton’s  law  of 
gravity  gave  an  interaction  energy  proportional  to  r~6,  rather  than 
r”1.  Which,  if  any,  of  Kepler’s  laws  would  still  be  true?  Which 
would  be  completely  false?  Which  would  be  different,  but  in  a  way 
that  could  be  calculated  with  straightforward  algebra? 
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19  Use  analogies  to  find  the  equivalents  of  the  following  equations 
for  rotation  in  a  plane: 

K  E  =  p2 /2m 
Ax  =  v0A  t  +  (1/2)  a  At2 
W  =  FAx 

Example:  v  =  Ax/ At  — >  ix  =  A9 / At 


20  For  a  one-dimensional  harmonic  oscillator,  the  solution  to 
the  energy  conservation  equation, 


U  +  K 


1,  2  1  2 

-kx  -| — mv  =  constant, 
2  2 


is  an  oscillation  with  frequency  ix  =  \fkjrn. 

Now  consider  an  analogous  system  consisting  of  a  bar  magnet  hung 
from  a  thread,  which  acts  like  a  magnetic  compass.  A  normal  com¬ 
pass  is  full  of  water,  so  its  oscillations  are  strongly  damped,  but 
the  magnet-on-a-thread  compass  has  very  little  friction,  and  will  os¬ 
cillate  repeatedly  around  its  equilibrium  direction.  The  magnetic 
energy  of  the  bar  magnet  is 


U  =  —Bm  cos  9, 


where  B  is  a  constant  that  measures  the  strength  of  the  earth’s 
magnetic  field,  m  is  a  constant  that  parametrizes  the  strength  of 
the  magnet,  and  9  is  the  angle,  measured  in  radians,  between  the 
bar  magnet  and  magnetic  north.  The  equilibrium  occurs  at  9  =  0, 
which  is  the  minimum  of  U . 

(a)  Problem  19  on  p.  291  gave  some  examples  of  how  to  construct 

analogies  between  rotational  and  linear  motion.  Using  the  same 
technique,  translate  the  equation  defining  the  linear  quantity  k  to 
one  that  defines  an  analogous  angular  one  k  (Greek  letter  kappa). 
Applying  this  to  the  present  example,  find  an  expression  for  k.  (As¬ 
sume  the  thread  is  so  thin  that  its  stiffness  does  not  have  any  sig¬ 
nificant  effect  compared  to  earth’s  magnetic  field.)  V 

(b)  Find  the  frequency  of  the  compass’s  vibrations.  V  ^ 


21  (a)  Find  the  angular  velocities  of  the  earth’s  rotation  and  of 

the  earth’s  motion  around  the  sun.  V 

(b)  Which  motion  involves  the  greater  acceleration? 
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Problem  22. 


€ 

€ 


^  ^  ^  ^  ^ 


Problem  23 


22  The  sun  turns  on  its  axis  once  every  26.0  days.  Its  mass 
is  2.0  x  1030  kg  and  its  radius  is  7.0  x  108  m.  Assume  it  is  a  rigid 
sphere  of  uniform  density. 

(a)  What  is  the  sun’s  angular  momentum?  V 

In  a  few  billion  years,  astrophysicists  predict  that  the  sun  will  use 
up  all  its  sources  of  nuclear  energy,  and  will  collapse  into  a  ball  of 
exotic,  dense  matter  known  as  a  white  dwarf.  Assume  that  its  radius 
becomes  5.8  x  106  m  (similar  to  the  size  of  the  Earth.)  Assume  it 
does  not  lose  any  mass  between  now  and  then.  (Don’t  be  fooled 
by  the  photo,  which  makes  it  look  like  nearly  all  of  the  star  was 
thrown  off  by  the  explosion.  The  visually  prominent  gas  cloud  is 
actually  thinner  than  the  best  laboratory  vacuum  ever  produced  on 
earth.  Certainly  a  little  bit  of  mass  is  actually  lost,  but  it  is  not  at 
all  unreasonable  to  make  an  approximation  of  zero  loss  of  mass  as 
we  are  doing.) 

(b)  What  will  its  angular  momentum  be? 

(c)  How  long  will  it  take  to  turn  once  on  its  axis?  V 

■ 

23  Give  a  numerical  comparison  of  the  two  molecules’  moments 

of  inertia  for  rotation  in  the  plane  of  the  page  about  their  centers 
of  mass.  V  ■ 

24  A  yo-yo  of  total  mass  m  consists  of  two  solid  cylinders 

of  radius  R,  connected  by  a  small  spindle  of  negligible  mass  and 
radius  r.  The  top  of  the  string  is  held  motionless  while  the  string 
unrolls  from  the  spindle.  Show  that  the  acceleration  of  the  yo-yo  is 
g/(l  +  R2/2r2).  m 

25  Show  that  a  sphere  of  radius  R  that  is  rolling  without  slipping 
has  angular  momentum  and  momentum  in  the  ratio  L/p  =  (2/5 )R. 

m 


26  Suppose  a  bowling  ball  is  initially  thrown  so  that  it  has  no 

angular  momentum  at  all,  i.e. ,  it  is  initially  just  sliding  down  the 
lane.  Eventually  kinetic  friction  will  bring  its  angular  velocity  up  to 
the  point  where  it  is  rolling  without  slipping.  Show  that  the  final 
velocity  of  the  ball  equals  5/7  of  its  initial  velocity.  You’ll  need  the 
result  of  problem  25.  ■ 

27  Find  the  angular  momentum  of  a  particle  whose  position  is 

r  =  3x  —  y  +  z  (in  meters)  and  whose  momentum  is  p  =  — 2x  +  y  +  z 
(in  kg-m/s).  v 
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28  Find  a  vector  that  is  perpendicular  to  both  of  the  following 
two  vectors: 


x  +  2y  +  3z 
4x  +  5y  +  6z 

V  ■ 


29  Prove  property  (3)  of  the  vector  cross  product  from  the 
theorem  on  page  952.  - 


30  Prove  the  anticommutative  property  of  the  vector  cross  prod¬ 
uct,  AxB  =  —  BxA,  using  the  expressions  for  the  components  of 
the  cross  product. 


31  Find  three  vectors  with  which  you  can  demonstrate  that  the 
vector  cross  product  need  not  be  associative,  i.e. ,  that  A  x  (B  x  C) 
need  not  be  the  same  as  (A  x  B)  x  C.  - 


32  Which  of  the  following  expressions  make  sense,  and  which  are 
nonsense?  For  those  that  make  sense,  indicate  whether  the  result  is 
a  vector  or  a  scalar. 

(a)  (A  x  B)  x  C 

(b)  (A  x  B)  •  C 

(c)  (A  •  B)  x  C 


33  (a)  As  suggested  in  the  figure,  find  the  area  of  the  infinites¬ 

imal  region  expressed  in  polar  coordinates  as  lying  between  r  and 
r  +  dr  and  between  6  and  6  +  dd.  V 

(b)  Generalize  this  to  find  the  infinitesimal  element  of  volume  in 

cylindrical  coordinates  ( r,6,z ),  where  the  Cartesian  z  axis  is  per¬ 
pendicular  to  the  directions  measured  by  r  and  9.  V 

(c)  Find  the  moment  of  inertia  for  rotation  about  its  axis  of  a  cone 

whose  mass  is  M,  whose  height  is  h,  and  whose  base  has  a  radius 
b.  V  ^ 


,  '^r  d0 

Problem  33 


34  Find  the  moment  of  inertia  of  a  solid  rectangular  box  of  mass 
M  and  uniform  density,  whose  sides  are  of  length  a,  b,  and  c,  for 
rotation  about  an  axis  through  its  center  parallel  to  the  edges  of 
length  a.  V  ^ 
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35  The  nucleus  168Er  (erbium- 168)  contains  68  protons  (which 
is  what  makes  it  a  nucleus  of  the  element  erbium)  and  100  neutrons. 
It  has  an  ellipsoidal  shape  like  an  American  football,  with  one  long 
axis  and  two  short  axes  that  are  of  equal  diameter.  Because  this 
is  a  subatomic  system,  consisting  of  only  168  particles,  its  behavior 
shows  some  clear  quantum-mechanical  properties.  It  can  only  have 
certain  energy  levels,  and  it  makes  quantum  leaps  between  these 
levels.  Also,  its  angular  momentum  can  only  have  certain  values, 
which  are  all  multiples  of  2.109  x  10~34  kg  •  m2/s.  The  table  shows 
some  of  the  observed  angular  momenta  and  energies  of  168Er,  in  SI 


units  (kg  • 

m2/s  and  joules) 

L  x  1034 

E  x  1014 

0 

0 

2.109 

1.2786 

4.218 

4.2311 

6.327 

8.7919 

8.437 

14.8731 

10.546 

22.3798 

12.655 

31.135 

14.764 

41.206 

16.873 

52.223 

(a)  These  data  can  be  described  to  a  good  approximation  as  a  rigid 
end-over-end  rotation.  Estimate  a  single  best-fit  value  for  the  mo¬ 
ment  of  inertia  from  the  data,  and  check  how  well  the  data  agree 
with  the  assumption  of  rigid-body  rotation.  >  Hint,  p.  959  V 

(b)  Check  whether  this  moment  of  inertia  is  on  the  right  order  of 

magnitude.  The  moment  of  inertia  depends  on  both  the  size  and 
the  shape  of  the  nucleus.  For  the  sake  of  this  rough  check,  ignore 
the  fact  that  the  nucleus  is  not  quite  spherical.  To  estimate  its  size, 
use  the  fact  that  a  neutron  or  proton  has  a  volume  of  about  1  fm3 
(one  cubic  femtometer,  where  1  fm  =  10“ 15  m),  and  assume  they 
are  closely  packed  in  the  nucleus.  ■ 

36  (a)  Prove  the  identity  a  x  (b  x  c)  =  b(a  •  c)  —  c(a  •  b) 
by  expanding  the  product  in  terms  of  its  components.  Note  that 
because  the  x,  y,  and  z  components  are  treated  symmetrically  in 
the  definitions  of  the  vector  cross  product,  it  is  only  necessary  to 
carry  out  the  proof  for  the  x  component  of  the  result. 

(b)  Applying  this  to  the  angular  momentum  of  a  rigidly  rotating 
body,  L  =  I  r  x  (u  x  r)  dm,  show  that  the  diagonal  elements  of  the 
moment  of  inertia  tensor  can  be  expressed  as,  e.g.,  Ixx  =  JV  + 
z2)  dm. 

(c)  Find  the  diagonal  elements  of  the  moment  of  inertia  matrix  of 

an  ellipsoid  with  axes  of  lengths  a,  b,  and  c,  in  the  principal-axis 
frame,  and  with  the  axis  at  the  center.  V  ^ 

37  In  example  22  on  page  276,  prove  that  if  the  rod  is  sufficiently 
thin,  it  can  be  toppled  without  scraping  on  the  floor. 

>  Solution,  p.  976  ■ 
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38  Suppose  an  object  has  mass  m,  and  moment  of  inertia  ID 

for  rotation  about  some  axis  A  passing  through  its  center  of  mass. 
Prove  that  for  an  axis  B,  parallel  to  A  and  lying  at  a  distance  h 
from  it,  the  object’s  moment  of  inertia  is  given  by  ID  +  mh2.  This 
is  known  as  the  parallel  axis  theorem.  ^ 

39  Let  two  sides  of  a  triangle  be  given  by  the  vectors  A  and 
B,  with  their  tails  at  the  origin,  and  let  mass  m  be  uniformly  dis¬ 
tributed  on  the  interior  of  the  triangle,  (a)  Show  that  the  distance 
of  the  triangle’s  center  of  mass  from  the  intersection  of  sides  A  and 
B  is  given  by  g  |  A  +  B  | . 

(b)  Consider  the  quadrilateral  with  mass  2 m,  and  vertices  at  the 
origin,  A,  B,  and  A  +  B.  Show  that  its  moment  of  inertia,  for 
rotation  about  an  axis  perpendicular  to  it  and  passing  through  its 
center  of  mass,  is  A 2  +  B2). 

(c)  Show  that  the  moment  of  inertia  for  rotation  about  an  axis  per¬ 
pendicular  to  the  plane  of  the  original  triangle,  and  passing  through 
its  center  of  mass,  is  yg ( A 2  +  B2  —  A  •  B) .  Hint:  Combine  the  results 
of  parts  a  and  b  with  the  result  of  problem  38. 

40  When  we  talk  about  rigid-body  rotation,  the  concept  of 

a  perfectly  rigid  body  can  only  be  an  idealization.  In  reality,  any 
object  will  compress,  expand,  or  deform  to  some  extent  when  sub¬ 
jected  to  the  strain  of  rotation.  However,  if  we  let  it  settle  down  for 
a  while,  perhaps  it  will  reach  a  new  equilibrium.  As  an  example, 
suppose  we  fill  a  centrifuge  tube  with  some  compressible  substance 
like  shaving  cream  or  Wonder  Bread.  We  can  model  the  contents  of 
the  tube  as  a  one-dimensional  line  of  mass,  extending  from  r  =  0  to 
r  =  £.  Once  the  rotation  starts,  we  expect  that  the  contents  will  be 
most  compressed  near  the  “floor”  of  the  tube  at  r  =  l\  this  is  both 
because  the  inward  force  required  for  circular  motion  increases  with 
r  for  a  fixed  u ,  and  because  the  part  at  the  floor  has  the  greatest 
amount  of  material  pressing  “down”  (actually  outward)  on  it.  The 
linear  density  dm/  dr,  in  units  of  kg/m,  should  therefore  increase  as 
a  function  of  r.  Suppose  that  we  have  dm/  dr  =  n er where  ^  is  a 
constant.  Find  the  moment  of  inertia.  'J 

41  When  we  release  an  object  such  as  a  bicycle  wheel  or  a  coin 

on  an  inclined  plane,  we  can  observe  a  variety  of  different  behaviors. 
Characterize  these  behaviors  empirically  and  try  to  list  the  physical 
parameters  that  determine  which  behavior  occurs.  Try  to  form  a 
conjecture  about  the  behavior  using  simple  closed- form  expressions. 
Test  your  conjecture  experimentally.  li 
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Problem  42. 


Problem  43. 


42  The  figure  shows  a  tabletop  experiment  that  can  be  used  to 

determine  an  unknown  moment  of  inertia.  A  rotating  platform  of 
radius  R  has  a  string  wrapped  around  it.  The  string  is  threaded 
over  a  pulley  and  down  to  a  hanging  weight  of  mass  m.  The  mass 
is  released  from  rest,  and  its  downward  acceleration  a  (a  >  0)  is 
measured.  Find  the  total  moment  of  inertia  I  of  the  platform  plus 
the  object  sitting  on  top  of  it.  (The  moment  of  inertia  of  the  object 
itself  can  then  be  found  by  subtracting  the  value  for  the  empty 
platform.)  V  ■ 

43  The  uniform  cube  has  unit  weight  and  sides  of  unit  length. 

One  corner  is  attached  to  a  universal  joint,  i.e.,  a  frictionless  bearing 
that  allows  any  type  of  rotation.  If  the  cube  is  in  equilibrium,  find 
the  magnitudes  of  the  forces  a,  b,  and  c.  V  I 

44  In  this  problem  we  investigate  the  notion  of  division  by  a 
vector. 

(a)  Given  a  nonzero  vector  a  and  a  scalar  b,  suppose  we  wish  to  find 
a  vector  u  that  is  the  solution  of  a  •  u  =  b.  Show  that  the  solution 
is  not  unique,  and  give  a  geometrical  description  of  the  solution  set. 

(b)  Do  the  same  thing  for  the  equation  a  X  u  =  c. 

(c)  Show  that  the  simultaneous  solution  of  these  two  equations  ex¬ 
ists  and  is  unique. 

Remark:  This  is  one  motivation  for  constructing  the  number  system  called  the 
quaternions.  For  a  certain  period  around  1900,  quaternions  were  more  popular 
than  the  system  of  vectors  and  scalars  more  commonly  used  today.  They  still 
have  some  important  advantages  over  the  scalar-vector  system  for  certain  appli¬ 
cations,  such  as  avoiding  a  phenomenon  known  as  gimbal  lock  in  controlling  the 
orientation  of  bodies  such  as  spacecraft.  B 

45  Show  that  when  a  thin,  uniform  ring  rotates  about  a  diameter, 

the  moment  of  inertia  is  half  as  big  as  for  rotation  about  the  axis  of 
symmetry.  >  Solution,  p.  976  - 


Key  to  symbols: 

easy  -  typical  B  challenging  ■  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  4A:  Torque 

Equipment: 

•  rulers  with  holes  in  them 

•  spring  scales  (two  per  group) 


While  one  person  holds  the  pencil  which  forms  the  axle  for  the  ruler,  the  other  members  of  the 
group  pull  on  the  scale  and  take  readings.  In  each  case,  calculate  the  total  torque  on  the  ruler, 
and  find  out  whether  it  equals  zero  to  roughly  within  the  accuracy  of  the  experiment. 
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Chapter  5 

Thermodynamics 

S  =  klogW 

Inscription  on  the  tomb  of  Ludwig  Boltzmann,  1844-1906. 
Boltzmann  originated  the  microscopic  theory  of  thermodynam¬ 
ics. 

In  a  developing  country  like  China,  a  refrigerator  is  the  mark  of 
a  family  that  has  arrived  in  the  middle  class,  and  a  car  is  the  ulti¬ 
mate  symbol  of  wealth.  Both  of  these  are  heat  engines:  devices  for 
converting  between  heat  and  other  forms  of  energy.  Unfortunately 
for  the  Chinese,  neither  is  a  very  efficient  device.  Burning  fossil  fuels 
has  made  China’s  big  cities  the  most  polluted  on  the  planet,  and 
the  country’s  total  energy  supply  isn’t  sufficient  to  support  Amer¬ 
ican  levels  of  energy  consumption  by  more  than  a  small  fraction 
of  China’s  population.  Could  we  somehow  manipulate  energy  in  a 
more  efficient  way? 

Conservation  of  energy  is  a  statement  that  the  total  amount  of 
energy  is  constant  at  all  times,  which  encourages  us  to  believe  that 
any  energy  transformation  can  be  undone  —  indeed,  the  laws  of 
physics  you’ve  learned  so  far  don’t  even  distinguish  the  past  from 
the  future.  If  you  get  in  a  car  and  drive  around  the  block,  the 
net  effect  is  to  consume  some  of  the  energy  you  paid  for  at  the 
gas  station,  using  it  to  heat  the  neighborhood.  There  would  not 
seem  to  be  any  fundamental  physical  principle  to  prevent  you  from 
recapturing  all  that  heat  and  using  it  again  the  next  time  you  want 
to  go  for  a  drive.  More  modestly,  why  don’t  engineers  design  a  car 
engine  so  that  it  recaptures  the  heat  energy  that  would  otherwise 
be  wasted  via  the  radiator  and  the  exhaust? 

Hard  experience,  however,  has  shown  that  designers  of  more  and 
more  efficient  engines  run  into  a  brick  wall  at  a  certain  point.  The 
generators  that  the  electric  company  uses  to  produce  energy  at  an 
oil-fueled  plant  are  indeed  much  more  efficient  than  a  car  engine,  but 
even  if  one  is  willing  to  accept  a  device  that  is  very  large,  expensive, 
and  complex,  it  turns  out  to  be  impossible  to  make  a  perfectly  effi¬ 
cient  heat  engine  —  not  just  impossible  with  present-day  technology, 
but  impossible  due  to  a  set  of  fundamental  physical  principles  known 
as  the  science  of  thermodynamics.  And  thermodynamics  isn’t  just  a 
pesky  set  of  constraints  on  heat  engines.  Without  thermodynamics, 
there  is  no  way  to  explain  the  direction  of  time’s  arrow  —  why  we 
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can  remember  the  past  but  not  the  future,  and  why  it’s  easier  to 
break  Humpty  Dumpty  than  to  put  him  back  together  again. 


5.1  Pressure  and  Temperature 

When  we  heat  an  object,  we  speed  up  the  mind-bogglingly  complex 
random  motion  of  its  molecules.  One  method  for  taming  complexity 
is  the  conservation  laws,  since  they  tell  us  that  certain  things  must 
remain  constant  regardless  of  what  process  is  going  on.  Indeed, 
the  law  of  conservation  of  energy  is  also  known  as  the  first  law  of 
thermodynamics . 

But  as  alluded  to  in  the  introduction  to  this  chapter,  conserva¬ 
tion  of  energy  by  itself  is  not  powerful  enough  to  explain  certain 
empirical  facts  about  heat.  A  second  way  to  sidestep  the  complex¬ 
ity  of  heat  is  to  ignore  heat’s  atomic  nature  and  concentrate  on 
quantities  like  temperature  and  pressure  that  tell  us  about  a  sys¬ 
tem’s  properties  as  a  whole.  This  approach  is  called  macroscopic  in 
contrast  to  the  microscopic  method  of  attack.  Pressure  and  temper¬ 
ature  were  fairly  well  understood  in  the  age  of  Newton  and  Galileo, 
hundreds  of  years  before  there  was  any  firm  evidence  that  atoms 
and  molecules  even  existed. 

Unlike  the  conserved  quantities  such  as  mass,  energy,  momen¬ 
tum,  and  angular  momentum,  neither  pressure  nor  temperature  is 
additive.  Two  cups  of  coffee  have  twice  the  heat  energy  of  a  single 
cup,  but  they  do  not  have  twice  the  temperature.  Likewise,  the 
painful  pressure  on  your  eardrums  at  the  bottom  of  a  pool  is  not 
affected  if  you  insert  or  remove  a  partition  between  the  two  halves 
of  the  pool. 

We  restrict  ourselves  to  a  discussion  of  pressure  in  fluids  at  rest 
and  in  equilibrium.  In  physics,  the  term  “fluid”  is  used  to  mean 
either  a  gas  or  a  liquid.  The  important  feature  of  a  fluid  can  be 
demonstrated  by  comparing  with  a  cube  of  jello  on  a  plate.  The 
jello  is  a  solid.  If  you  shake  the  plate  from  side  to  side,  the  jello  will 
respond  by  shearing,  i.e.,  by  slanting  its  sides,  but  it  will  tend  to 
spring  back  into  its  original  shape.  A  solid  can  sustain  shear  forces, 
but  a  fluid  cannot.  A  fluid  does  not  resist  a  change  in  shape  unless 
it  involves  a  change  in  volume. 

5.1.1  Pressure 

If  you’re  at  the  bottom  of  a  pool,  you  can’t  relieve  the  pain  in 
your  ears  by  turning  your  head.  The  water’s  force  on  your  eardrum 
is  always  the  same,  and  is  always  perpendicular  to  the  surface  where 
the  eardrum  contacts  the  water.  If  your  ear  is  on  the  east  side  of 
your  head,  the  water’s  force  is  to  the  west.  If  you  keep  your  ear 
in  the  same  spot  while  turning  around  so  your  ear  is  on  the  north, 
the  force  will  still  be  the  same  in  magnitude,  and  it  will  change 
its  direction  so  that  it  is  still  perpendicular  to  the  eardrum:  south. 
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This  shows  that  pressure  has  no  direction  in  space,  i.e. ,  it  is  a  scalar. 
The  direction  of  the  force  is  determined  by  the  orientation  of  the 
surface  on  which  the  pressure  acts,  not  by  the  pressure  itself.  A 
fluid  flowing  over  a  surface  can  also  exert  frictional  forces,  which 
are  parallel  to  the  surface,  but  the  present  discussion  is  restricted 
to  fluids  at  rest. 

Experiments  also  show  that  a  fluid’s  force  on  a  surface  is  pro¬ 
portional  to  the  surface  area.  The  vast  force  of  the  water  behind 
a  dam,  for  example,  in  proportion  to  the  dam’s  great  surface  area. 
(The  bottom  of  the  dam  experiences  a  higher  proportion  of  its  force.) 

Based  on  these  experimental  results,  it  appears  that  the  useful 
way  to  define  pressure  is  as  follows.  The  pressure  of  a  fluid  at  a 
given  point  is  defined  as  F±/A,  where  A  is  the  area  of  a  small  surface 
inserted  in  the  fluid  at  that  point,  and  F±_  is  the  component  of  the 
fluid’s  force  on  the  surface  which  is  perpendicular  to  the  surface.  (In 
the  case  of  a  moving  fluid,  fluid  friction  forces  can  act  parallel  to 
the  surface,  but  we’re  only  dealing  with  stationary  fluids,  so  there 
is  only  an  F±.) 

This  is  essentially  how  a  pressure  gauge  works.  The  reason  that 
the  surface  must  be  small  is  so  that  there  will  not  be  any  significant 
difference  in  pressure  between  one  part  of  it  and  another  part.  The 
SI  units  of  pressure  are  evidently  N/m2,  and  this  combination  can 
be  abbreviated  as  the  pascal,  1  Pa=l  N/m2.  The  pascal  turns  out  to 
be  an  inconveniently  small  unit,  so  car  tires,  for  example,  normally 
have  pressures  imprinted  on  them  in  units  of  kilopascals. 

Pressure  in  U.S.  units  example  1 

In  U.S.  units,  the  unit  of  force  is  the  pound,  and  the  unit  of  distance 
is  the  inch.  The  unit  of  pressure  is  therefore  pounds  per  square 
inch,  or  p.s.i.  (Note  that  the  pound  is  not  a  unit  of  mass.) 

Atmospheric  pressure  in  U.S.  and  metric  units  example  2 

>  A  figure  that  many  people  in  the  U.S.  remember  is  that  atmo¬ 
spheric  pressure  is  about  15  pounds  per  square  inch.  What  is 
this  in  metric  units? 

> 


a/ A  simple  pressure  gauge 
consists  of  a  cylinder  open  at  one 
end,  with  a  piston  and  a  spring 
inside.  The  depth  to  which  the 
spring  is  depressed  is  a  measure 
of  the  pressure.  To  determine  the 
absolute  pressure,  the  air  needs 
to  be  pumped  out  of  the  interior  of 
the  gauge,  so  that  there  is  no  air 
pressure  acting  outward  on  the 
piston.  In  many  practical  gauges, 
the  back  of  the  piston  is  open  to 
the  atmosphere,  so  the  pressure 
the  gauge  registers  equals  the 
pressure  of  the  fluid  minus  the 
pressure  of  the  atmosphere. 


(15  lb) /(I  in2)  = 


68  N 

(0.0254  m)2 
1.0  x  105  N/m2 
100  kPa 


Only  pressure  differences  are  normally  significant. 

If  you  spend  enough  time  on  an  airplane,  the  pain  in  your  ears 
subsides.  This  is  because  your  body  has  gradually  been  able  to  ad¬ 
mit  more  air  into  the  cavity  behind  the  eardrum.  Once  the  pressure 
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inside  is  equalized  with  the  pressure  outside,  the  inward  and  out¬ 
ward  forces  on  your  eardrums  cancel  out,  and  there  is  no  physical 
sensation  to  tell  you  that  anything  unusual  is  going  on.  For  this 
reason,  it  is  normally  only  pressure  differences  that  have  any  phys¬ 
ical  significance.  Thus  deep-sea  fish  are  perfectly  healthy  in  their 
habitat  because  their  bodies  have  enough  internal  pressure  to  cancel 
the  pressure  from  the  water  in  which  they  live;  if  they  are  caught  in 
a  net  and  brought  to  the  surface  rapidly,  they  explode  because  their 
internal  pressure  is  so  much  greater  than  the  low  pressure  outside. 

Getting  killed  by  a  pool  pump  example  3 

>  My  house  has  a  pool,  which  I  maintain  myself.  A  pool  always 
needs  to  have  its  water  circulated  through  a  filter  for  several  hours 
a  day  in  order  to  keep  it  clean.  The  filter  is  a  large  barrel  with  a 
strong  clamp  that  holds  the  top  and  bottom  halves  together.  My 
filter  has  a  prominent  warning  label  that  warns  me  not  to  try  to 
open  the  clamps  while  the  pump  is  on,  and  it  shows  a  cartoon 
of  a  person  being  struck  by  the  top  half  of  the  pump.  The  cross- 
sectional  area  of  the  filter  barrel  is  0.25  m2.  Like  most  pressure 
gauges,  the  one  on  my  pool  pump  actually  reads  the  difference  in 
pressure  between  the  pressure  inside  the  pump  and  atmospheric 
pressure.  The  gauge  reads  90  kPa.  What  is  the  force  that  is 
trying  to  pop  open  the  filter? 

>  If  the  gauge  told  us  the  absolute  pressure  of  the  water  inside, 
we’d  have  to  find  the  force  of  the  water  pushing  outward  and  the 
force  of  the  air  pushing  inward,  and  subtract  in  order  to  find  the 
total  force.  Since  air  surrounds  us  all  the  time,  we  would  have  to 
do  such  a  subtraction  every  time  we  wanted  to  calculate  anything 
useful  based  on  the  gauge’s  reading.  The  manufacturers  of  the 
gauge  decided  to  save  us  from  all  this  work  by  making  it  read  the 
difference  in  pressure  between  inside  and  outside,  so  all  we  have 
to  do  is  multiply  the  gauge  reading  by  the  cross-sectional  area  of 
the  filter: 


F  =  PA 

=  (90  x  103  N/m2)(0.25  m2) 

=  22000  N 

That’s  a  lot  of  force! 

The  word  “suction”  and  other  related  words  contain  a  hidden 
misunderstanding  related  to  this  point  about  pressure  differences. 
When  you  suck  water  up  through  a  straw,  there  is  nothing  in  your 
mouth  that  is  attracting  the  water  upward.  The  force  that  lifts  the 
water  is  from  the  pressure  of  the  water  in  the  cup.  By  creating  a 
partial  vacuum  in  your  mouth,  you  decreased  the  air’s  downward 
force  on  the  water  so  that  it  no  longer  exactly  canceled  the  upward 
force. 
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Variation  of  pressure  with  depth 

The  pressure  within  a  fluid  in  equilibrium  can  only  depend  on 
depth,  due  to  gravity.  If  the  pressure  could  vary  from  side  to  side, 
then  a  piece  of  the  fluid  in  between,  b,  would  be  subject  to  unequal 
forces  from  the  parts  of  the  fluid  on  its  two  sides.  Since  fluids  do  not 
exhibit  shear  forces,  there  would  be  no  other  force  that  could  keep 
this  piece  of  fluid  from  accelerating.  This  contradicts  the  assumption 
that  the  fluid  was  in  equilibrium. 

self-check  A 

How  does  this  proof  fail  for  solids?  >  Answer,  p.  963 

To  find  the  variation  with  depth,  we  consider  the  vertical  forces 
acting  on  a  tiny,  imaginary  cube  of  the  fluid  having  infinitesimal 
height  d y  and  areas  dA  on  the  top  and  bottom.  Using  positive 
numbers  for  upward  forces,  we  have 

Pbottom.  dA  Ptop  dA  —  Fg  =  0. 

The  weight  of  the  fluid  is  Fg  =  mg  =  pV g  =  pdAdy  g,  where  p  is 
the  density  of  the  fluid,  so  the  difference  in  pressure  is 

dP  =  —pgdy.  [variation  in  pressure  with  depth  for 

a  fluid  of  density  p  in  equilibrium; 

positive  y  is  up.] 

A  more  elegant  way  of  writing  this  is  in  terms  of  a  dot  product, 
dP  =  pg  •  dy,  which  automatically  takes  care  of  the  plus  or  minus 
sign,  depending  on  the  relative  directions  of  the  g  and  dy  vectors, 
and  avoids  any  requirements  about  the  coordinate  system. 

The  factor  of  p  explains  why  we  notice  the  difference  in  pressure 
when  diving  3  m  down  in  a  pool,  but  not  when  going  down  3  m  of 
stairs.  The  equation  only  tells  us  the  difference  in  pressure,  not  the 
absolute  pressure.  The  pressure  at  the  surface  of  a  swimming  pool 
equals  the  atmospheric  pressure,  not  zero,  even  though  the  depth  is 
zero  at  the  surface.  The  blood  in  your  body  does  not  even  have  an 
upper  surface. 

In  cases  where  g  and  p  are  independent  of  depth,  we  can  inte¬ 
grate  both  sides  of  the  equation  to  get  everything  in  terms  of  finite 
differences  rather  than  differentials:  A P  =  —pgAy. 

self-check  B 

In  which  of  the  following  situations  is  the  equation  A P  =  -pgAy  valid? 
Why?  (1)  difference  in  pressure  between  a  tabletop  and  the  feet  (i.e., 
predicting  the  pressure  of  the  feet  on  the  floor)  (2)  difference  in  air  pres¬ 
sure  between  the  top  and  bottom  of  a  tall  building  (3)  difference  in  air 
pressure  between  the  top  and  bottom  of  Mt.  Everest  (4)  difference  in 
pressure  between  the  top  of  the  earth’s  mantle  and  the  center  of  the 
earth  (5)  difference  in  pressure  between  the  top  and  bottom  of  an  air¬ 
plane’s  wing  t>  Answer,  p. 

964 


b/This  doesn’t  happen.  If 
pressure  could  vary  horizontally 
in  equilibrium,  the  cube  of  water 
would  accelerate  horizontally. 
This  is  a  contradiction,  since 
we  assumed  the  fluid  was  in 
equilibrium. 


c  /  The  pressure  is  the  same 
at  all  the  points  marked  with  dots. 


d/This  does  happen.  The 
sum  of  the  forces  from  the 
surrounding  parts  of  the  fluid  is 
upward,  canceling  the  downward 
force  of  gravity. 
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Pressure  of  lava  underneath  a  volcano  example  4 

>  A  volcano  has  just  finished  erupting,  and  a  pool  of  molten  lava 
is  lying  at  rest  in  the  crater.  The  lava  has  come  up  through  an 
opening  inside  the  volcano  that  connects  to  the  earth’s  molten 
mantle.  The  density  of  the  lava  is  4.1  g/cm3.  What  is  the  pressure 
in  the  lava  underneath  the  base  of  the  volcano,  3000  m  below  the 
surface  of  the  pool? 

> 


A  P  =  pgAy 

=  (4.1  g/cm3)(9.8  m/s2)(3000  m) 

=  (4.1  x  106  g/m3)(9.8  m/s2)(3000  m) 

=  (4.1  x  103  kg/m3)(9.8  m/s2)(3000  m) 

=  1.2  x  108  N/m2 
=  1.2  x  108  Pa 

This  is  the  difference  between  the  pressure  we  want  to  find  and 
atmospheric  pressure  at  the  surface.  The  latter,  however,  is  tiny 
compared  to  the  A P  we  just  calculated,  so  what  we’ve  found  is 
essentially  the  pressure,  P. 

Atmospheric  pressure  example  5 

Gases,  unlike  liquids,  are  quite  compressible,  and  at  a  given  tem¬ 
perature,  the  density  of  a  gas  is  approximately  proportional  to  the 
pressure.  The  proportionality  constant  is  discussed  on  page  308, 
but  for  now  let’s  just  call  it  k,  p  =  kP.  Using  this  fact,  we  can 
find  the  variation  of  atmospheric  pressure  with  altitude,  assuming 
constant  temperature: 


e  /  We  have  to  wait  for  the 
thermometer  to  equilibrate  its 
temperature  with  the  temperature 
of  Irene’s  armpit. 


dP  =  -pgdy 

dP  =  - kPgdy 

dP  ,  ^ 

—  =  -kgdy 

In  P  =  -kgy  +  constant  [integrating  both  sides] 

P  =  (constant) e~k9y  [exponentiating  both  sides] 

Pressure  falls  off  exponentially  with  height.  There  is  no  sharp  cut¬ 
off  to  the  atmosphere,  but  the  exponential  factor  gets  extremely 
small  by  the  time  you’re  ten  or  a  hundred  miles  up. 

5.1.2  Temperature 

Thermal  equilibrium 

We  use  the  term  temperature  casually,  but  what  is  it  exactly? 
Roughly  speaking,  temperature  is  a  measure  of  how  concentrated 
the  heat  energy  is  in  an  object.  A  large,  massive  object  with  very 
little  heat  energy  in  it  has  a  low  temperature. 


304 


Chapter  5  Thermodynamics 


But  physics  deals  with  operational  definitions,  i.e.,  definitions  of 
how  to  measure  the  thing  in  question.  How  do  we  measure  temper¬ 
ature?  One  common  feature  of  all  temperature-measuring  devices 
is  that  they  must  be  left  for  a  while  in  contact  with  the  thing  whose 
temperature  is  being  measured.  When  you  take  your  temperature 
with  a  fever  thermometer,  you  are  waiting  for  the  mercury  inside  to 
come  up  to  the  same  temperature  as  your  body.  The  thermometer 
actually  tells  you  the  temperature  of  its  own  working  fluid  (in  this 
case  the  mercury).  In  general,  the  idea  of  temperature  depends  on 
the  concept  of  thermal  equilibrium.  When  you  mix  cold  eggs  from 
the  refrigerator  with  flour  that  has  been  at  room  temperature,  they 
rapidly  reach  a  compromise  temperature.  What  determines  this 
compromise  temperature  is  conservation  of  energy,  and  the  amount 
of  energy  required  to  heat  or  cool  each  substance  by  one  degree. 
But  without  even  having  constructed  a  temperature  scale,  we  can 
see  that  the  important  point  is  the  phenomenon  of  thermal  equi¬ 
librium  itself:  two  objects  left  in  contact  will  approach  the  same 
temperature.  We  also  assume  that  if  object  A  is  at  the  same  tem¬ 
perature  as  object  B,  and  B  is  at  the  same  temperature  as  C,  then 
A  is  at  the  same  temperature  as  C.  This  statement  is  sometimes 
known  as  the  zeroth  law  of  thermodynamics,  so  called  because  after 
the  first,  second,  and  third  laws  had  been  developed,  it  was  realized 
that  there  was  another  law  that  was  even  more  fundamental. 

Thermal  expansion 

The  familiar  mercury  thermometer  operates  on  the  principle  that 
the  mercury,  its  working  fluid,  expands  when  heated  and  contracts 
when  cooled.  In  general,  all  substances  expand  and  contract  with 
changes  in  temperature.  The  zeroth  law  of  thermodynamics  guar¬ 
antees  that  we  can  construct  a  comparative  scale  of  temperatures 
that  is  independent  of  what  type  of  thermometer  we  use.  If  a  ther¬ 
mometer  gives  a  certain  reading  when  it’s  in  thermal  equilibrium 
with  object  A,  and  also  gives  the  same  reading  for  object  B,  then 
A  and  B  must  be  the  same  temperature,  regardless  of  the  details  of 
how  the  thermometers  works. 

What  about  constructing  a  temperature  scale  in  which  every 
degree  represents  an  equal  step  in  temperature?  The  Celsius  scale 
has  0  as  the  freezing  point  of  water  and  100  as  its  boiling  point.  The 
hidden  assumption  behind  all  this  is  that  since  two  points  define  a 
line,  any  two  thermometers  that  agree  at  two  points  must  agree  at 
all  other  points.  In  reality  if  we  calibrate  a  mercury  thermometer 
and  an  alcohol  thermometer  in  this  way,  we  will  find  that  a  graph 
of  one  thermometer’s  reading  versus  the  other  is  not  a  perfectly 
straight  y  =  x  line.  The  subtle  inconsistency  becomes  a  drastic  one 
when  we  try  to  extend  the  temperature  scale  through  the  points 
where  mercury  and  alcohol  boil  or  freeze.  Gases,  however,  are  much 
more  consistent  among  themselves  in  their  thermal  expansion  than 


f/ Thermal  equilibrium  can 
be  prevented.  Otters  have  a  coat 
of  fur  that  traps  air  bubbles  for  in¬ 
sulation.  If  a  swimming  otter  was 
in  thermal  equilibrium  with  cold 
water,  it  would  be  dead.  Heat  is 
still  conducted  from  the  otter’s 
body  to  the  water,  but  much 
more  slowly  than  it  would  be  in  a 
warm-blooded  animal  that  didn’t 
have  this  special  adaptation. 


g/A  hot  air  balloon  is  inflated. 
Because  of  thermal  expansion, 
the  hot  air  is  less  dense  than 
the  surrounding  cold  air,  and 
therefore  floats  as  the  cold  air 
drops  underneath  it  and  pushes  it 
up  out  of  the  way. 
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open  to 

standard 

pressure 


h/A  simplified  version  of  an 
ideal  gas  thermometer.  The 
whole  instrument  is  allowed  to 
come  into  thermal  equilibrium 
with  the  substance  whose  tem¬ 
perature  is  to  be  measured,  and 
the  mouth  of  the  cylinder  is  left 
open  to  standard  pressure.  The 
volume  of  the  noble  gas  gives  an 
indication  of  temperature. 


0  K - , - r- 

-273  -200  -100 


temperature  (C) 

i/The  volume  of  1  kg  of  neon 
gas  as  a  function  of  temperature 
(at  standard  pressure).  Although 
neon  would  actually  condense 
into  a  liquid  at  some  point, 
extrapolating  the  graph  gives 
to  zero  volume  gives  the  same 
temperature  as  for  any  other  gas: 
absolute  zero. 


solids  or  liquids,  and  the  noble  gases  like  helium  and  neon  are  more 
consistent  with  each  other  than  gases  in  general.  Continuing  to 
search  for  consistency,  we  find  that  noble  gases  are  more  consistent 
with  each  other  when  their  pressure  is  very  low. 

As  an  idealization,  we  imagine  a  gas  in  which  the  atoms  interact 
only  with  the  sides  of  the  container,  not  with  each  other.  Such  a 
gas  is  perfectly  nonreactive  (as  the  noble  gases  very  nearly  are),  and 
never  condenses  to  a  liquid  (as  the  noble  gases  do  only  at  extremely 
low  temperatures).  Its  atoms  take  up  a  negligible  fraction  of  the 
available  volume.  Any  gas  can  be  made  to  behave  very  much  like 
this  if  the  pressure  is  extremely  low,  so  that  the  atoms  hardly  ever 
encounter  each  other.  Such  a  gas  is  called  an  ideal  gas,  and  we  define 
the  Celsius  scale  in  terms  of  the  volume  of  the  gas  in  a  thermometer 
whose  working  substance  is  an  ideal  gas  maintained  at  a  fixed  (very 
low)  pressure,  and  which  is  calibrated  at  0  and  100  degrees  according 
to  the  melting  and  boiling  points  of  water.  The  Celsius  scale  is  not 
just  a  comparative  scale  but  an  additive  one  as  well:  every  step  in 
temperature  is  equal,  and  it  makes  sense  to  say  that  the  difference 
in  temperature  between  18  and  28°  C  is  the  same  as  the  difference 
between  48  and  58. 

Absolute  zero  and  the  kelvin  scale 

We  find  that  if  we  extrapolate  a  graph  of  volume  versus  tem¬ 
perature,  the  volume  becomes  zero  at  nearly  the  same  temperature 
for  all  gases:  —  273°C.  Real  gases  will  all  condense  into  liquids  at 
some  temperature  above  this,  but  an  ideal  gas  would  achieve  zero 
volume  at  this  temperature,  known  as  absolute  zero.  The  most  use¬ 
ful  temperature  scale  in  scientific  work  is  one  whose  zero  is  defined 
by  absolute  zero,  rather  than  by  some  arbitrary  standard  like  the 
melting  point  of  water.  The  temperature  scale  used  universally  in 
scientific  work,  called  the  Kelvin  scale,  is  the  same  as  the  Celsius 
scale,  but  shifted  by  273  degrees  to  make  its  zero  coincide  with  ab¬ 
solute  zero.  Scientists  use  the  Celsius  scale  only  for  comparisons  or 
when  a  change  in  temperature  is  all  that  is  required  for  a  calcula¬ 
tion.  Only  on  the  Kelvin  scale  does  it  make  sense  to  discuss  ratios 
of  temperatures,  e.g.,  to  say  that  one  temperature  is  twice  as  hot  as 
another. 

Which  temperature  scale  to  use  example  6 

>  You  open  an  astronomy  book  and  encounter  the  equation 

(light  emitted)  =  (constant)  x  T4 

for  the  light  emitted  by  a  star  as  a  function  of  its  surface  tempera¬ 
ture.  What  temperature  scale  is  implied? 

>  The  equation  tells  us  that  doubling  the  temperature  results  in 
the  emission  of  16  times  as  much  light.  Such  a  ratio  only  makes 
sense  if  the  Kelvin  scale  is  used. 
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Although  we  can  achieve  as  good  an  approximation  to  an  ideal 
gas  as  we  wish  by  making  the  pressure  very  low,  it  seems  never¬ 
theless  that  there  should  be  some  more  fundamental  way  to  define 
temperature.  We  will  construct  a  more  fundamental  scale  of  tem¬ 
perature  in  section  5.4. 

Discussion  Questions 

A  Figure  j/1  shows  objects  1  and  2,  each  with  a  certain  temperature  T 
and  a  certain  amount  of  thermal  energy  E.  They  are  connected  by  a  thin 
rod,  so  that  eventually  they  will  reach  thermal  equilibrium.  We  expect  that 
the  rate  at  which  heat  is  transferred  into  object  1  will  be  given  by  some 
equation  dE-i  /  df  =  k(. . .),  where  k  is  a  positive  constant  of  proportionality 
and  “. . .”  is  some  expression  that  depends  on  the  temperatures.  Suppose 
that  the  following  six  forms  are  proposed  for  “. . . 

1.  Ti 

2.  T2 

3.  7i  -  r2 

4.  T2-TA 

5.  h/T2 

6-  ^2/Ti  j  /  Discussion  questions  A-C. 

Give  physical  reasons  why  five  of  these  are  not  possible. 

B  How  should  the  rate  of  heat  conduction  in  j/2  compare  with  the  rate 
in  j/1? 

C  The  example  in  j/3  is  different  from  the  preceding  ones  because 
when  we  add  the  third  object  in  the  middle,  we  don’t  necessarily  know  the 
intermediate  temperature.  We  could  in  fact  set  up  this  third  object  with 
any  desired  initial  temperature.  Suppose,  however,  that  the  flow  of  heat 
is  steady.  For  example,  the  36°  object  could  be  a  human  body,  the  0° 
object  could  be  the  air  on  a  cold  day,  and  the  object  in  between  could  be 
a  simplified  physical  model  of  the  insulation  provided  by  clothing  or  body 
fat.  Under  this  assumption,  what  is  the  intermediate  temperature?  How 
does  the  rate  of  heat  conduction  compare  in  the  two  cases? 

D  Based  on  the  conclusions  of  questions  A-C,  how  should  the  rate 
of  heat  conduction  through  an  object  depend  on  its  length  and  cross- 
sectional  area?  If  all  the  linear  dimensions  of  the  object  are  doubled, 
what  happens  to  the  rate  of  heat  conduction  through  it?  How  would  this 
apply  if  we  compare  an  elephant  to  a  shrew? 


1 


3 


5.2  Microscopic  Description  of  An  Ideal  Gas 

5.2.1  Evidence  for  the  kinetic  theory 

Why  does  matter  have  the  thermal  properties  it  does?  The  basic 
answer  must  come  from  the  fact  that  matter  is  made  of  atoms.  How, 
then,  do  the  atoms  give  rise  to  the  bulk  properties  we  observe? 
Gases,  whose  thermal  properties  are  so  simple,  offer  the  best  chance 
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for  us  to  construct  a  simple  connection  between  the  microscopic  and 
macroscopic  worlds. 

A  crucial  observation  is  that  although  solids  and  liquids  are 
nearly  incompressible,  gases  can  be  compressed,  as  when  we  in¬ 
crease  the  amount  of  air  in  a  car’s  tire  while  hardly  increasing  its 
volume  at  all.  This  makes  us  suspect  that  the  atoms  in  a  solid  are 
packed  shoulder  to  shoulder,  while  a  gas  is  mostly  vacuum,  with 
large  spaces  between  molecules.  Most  liquids  and  solids  have  den¬ 
sities  about  1000  times  greater  than  most  gases,  so  evidently  each 
molecule  in  a  gas  is  separated  from  its  nearest  neighbors  by  a  space 
something  like  10  times  the  size  of  the  molecules  themselves. 

If  gas  molecules  have  nothing  but  empty  space  between  them, 
why  don’t  the  molecules  in  the  room  around  you  just  fall  to  the 
floor?  The  only  possible  answer  is  that  they  are  in  rapid  motion, 
continually  rebounding  from  the  walls,  floor  and  ceiling.  In  section 
2.4  I  have  already  given  some  of  the  evidence  for  the  kinetic  theory 
of  heat,  which  states  that  heat  is  the  kinetic  energy  of  randomly 
moving  molecules.  This  theory  was  proposed  by  Daniel  Bernoulli 
in  1738,  and  met  with  considerable  opposition  because  it  seemed 
as  though  the  molecules  in  a  gas  would  eventually  calm  down  and 
settle  into  a  thin  film  on  the  floor.  There  was  no  precedent  for  this 
kind  of  perpetual  motion.  No  rubber  ball,  however  elastic,  rebounds 
from  a  wall  with  exactly  as  much  energy  as  it  originally  had,  nor 
do  we  ever  observe  a  collision  between  balls  in  which  none  of  the 
kinetic  energy  at  all  is  converted  to  heat  and  sound.  The  analogy  is 
a  false  one,  however.  A  rubber  ball  consists  of  atoms,  and  when  it  is 
heated  in  a  collision,  the  heat  is  a  form  of  motion  of  those  atoms.  An 
individual  molecule,  however,  cannot  possess  heat.  Likewise  sound 
is  a  form  of  bulk  motion  of  molecules,  so  colliding  molecules  in  a  gas 
cannot  convert  their  kinetic  energy  to  sound.  Molecules  can  indeed 
induce  vibrations  such  as  sound  waves  when  they  strike  the  walls  of  a 
container,  but  the  vibrations  of  the  walls  are  just  as  likely  to  impart 
energy  to  a  gas  molecule  as  to  take  energy  from  it.  Indeed,  this  kind 
of  exchange  of  energy  is  the  mechanism  by  which  the  temperatures 
of  the  gas  and  its  container  become  equilibrated. 

5.2.2  Pressure,  volume,  and  temperature 

A  gas  exerts  pressure  on  the  walls  of  its  container,  and  in  the 
kinetic  theory  we  interpret  this  apparently  constant  pressure  as  the 
averaged-out  result  of  vast  numbers  of  collisions  occurring  every 
second  between  the  gas  molecules  and  the  walls.  The  empirical 
facts  about  gases  can  be  summarized  by  the  relation 

PV  oc  nT,  [ideal  gas] 

which  really  only  holds  exactly  for  an  ideal  gas.  Here  n  is  the  number 
of  molecules  in  the  sample  of  gas. 

Volume  related  to  temperature  example  7 
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The  proportionality  of  volume  to  temperature  at  fixed  pressure 
was  the  basis  for  our  definition  of  temperature. 

Pressure  related  to  temperature  example  8 

Pressure  is  proportional  to  temperature  when  volume  is  held  con¬ 
stant.  An  example  is  the  increase  in  pressure  in  a  car’s  tires  when 
the  car  has  been  driven  on  the  freeway  for  a  while  and  the  tires 
and  air  have  become  hot. 


We  now  connect  these  empirical  facts  to  the  kinetic  theory  of 
a  classical  ideal  gas.  For  simplicity,  we  assume  that  the  gas  is 
monoatomic  (i.e.,  each  molecule  has  only  one  atom),  and  that  it 
is  confined  to  a  cubical  box  of  volume  V,  with  L  being  the  length 
of  each  edge  and  A  the  area  of  any  wall.  An  atom  whose  velocity 
has  an  x  component  vx  will  collide  regularly  with  the  left-hand  wall, 
traveling  a  distance  2 L  parallel  to  the  x  axis  between  collisions  with 
that  wall.  The  time  between  collisions  is  At  =  2L/vx,  and  in  each 
collision  the  x  component  of  the  atom’s  momentum  is  reversed  from 
—rrmx  to  mvx.  The  total  force  on  the  wall  is 


j?  _  ST'  ^Px,i 

A  ti 


[monoatomic  ideal  gas], 


where  the  index  i  refers  to  the  individual  atoms.  Substituting 
Apxy  =  2mvx,i  and  A  ti  =  2  L/vx,i,  we  have 


[monoatomic  ideal  gas]. 


The  quantity  mv^  i  is  twice  the  contribution  to  the  kinetic  energy 
from  the  part  of  the  atoms’  center  of  mass  motion  that  is  parallel  to 
the  x  axis.  Since  we’re  assuming  a  monoatomic  gas,  center  of  mass 
motion  is  the  only  type  of  motion  that  gives  rise  to  kinetic  energy. 
(A  more  complex  molecule  could  rotate  and  vibrate  as  well.)  If  the 
quantity  inside  the  sum  included  the  y  and  z  components,  the  sum 
would  be  twice  the  total  kinetic  energy  of  all  the  molecules.  Since  we 
expect  the  energy  to  be  equally  shared  among  x,  y,  and  z  motion,1 
the  quantity  inside  the  sum  must  therefore  equal  2/3  of  the  total 
kinetic  energy,  so 


2  K total 

3  L 


[monoatomic  ideal  gas]. 


Dividing  by  A  and  using  AL  =  V,  we  have 


P  =  —  "  a  [monoatomic  ideal  gas], 
o  V 

This  can  be  connected  to  the  empirical  relation  PV  oc  nT  if  we 
multiply  by  V  on  both  sides  and  rewrite  Ktotai  as  7iK,  where  K  is 
the  average  kinetic  energy  per  molecule: 

2  - 

PV  =  —nK  [monoatomic  ideal  gas]. 

3 

xThis  equal  sharing  will  be  justified  more  rigorously  on  page  325. 
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For  the  first  time  we  have  an  interpretation  of  temperature  based  on 
a  microscopic  description  of  matter:  in  a  monoatomic  ideal  gas,  the 
temperature  is  a  measure  of  the  average  kinetic  energy  per  molecule. 
The  proportionality  between  the  two  is  K  =  (3/2) kT,  where  the 
constant  of  proportionality  k ,  known  as  Boltzmann’s  constant,  has 
a  numerical  value  of  1.38  x  10~23  J/K. 

The  Boltzmann  constant  has  the  value  it  does  because  the  Cel¬ 
sius  and  kelvin  scales  were  defined  before  the  microscopic  picture 
of  thermodynamics  had  been  discovered.  For  some  calculations,  it 
is  more  convenient  to  work  in  more  natural  units  where  k  =  1  by 
definition,  and  then  the  units  of  temperature  and  energy  are  the 
same.  The  Boltzmann  constant  is  small  because  our  energy  scale  of 
joules  is  a  macroscopic  scale,  so  that  when  we  express  the  thermal 
energy  of  a  single  atom  in  joules,  the  number  is  very  small. 

Summarizing,  we  have  the  following  two  important  facts. 


Microscopic  model  of  an  ideal  gas 

For  an  ideal  gas, 

PV  =  nkT , 

which  is  known  as  the  ideal  gas  law.  The  temperature  of  the  gas  is 
a  measure  of  the  average  kinetic  energy  per  atom, 

K  =  -kT. 

2 


Although  I  won’t  prove  it  here,  the  ideal  gas  law  applies  to  all 
ideal  gases,  even  though  the  derivation  assumed  a  monoatomic  ideal 
gas  in  a  cubical  box.  (You  may  have  seen  it  written  elsewhere  as 
PV  =  NRT,  where  N  =  u/Na  is  the  number  of  moles  of  atoms, 
R  =  kNA,  and  N a  =  6.0  x  1023,  called  Avogadro’s  number,  is 
essentially  the  number  of  hydrogen  atoms  in  1  g  of  hydrogen.) 

Pressure  in  a  car  tire  example  9 

>  After  driving  on  the  freeway  for  a  while,  the  air  in  your  car’s 
tires  heats  up  from  10°C  to  35°C.  How  much  does  the  pressure 
increase? 

>  The  tires  may  expand  a  little,  but  we  assume  this  effect  is  small, 
so  the  volume  is  nearly  constant.  From  the  ideal  gas  law,  the 
ratio  of  the  pressures  is  the  same  as  the  ratio  of  the  absolute 
temperatures, 

P2/Pi  =  Tz/T: 

=  (308  K)/(283  K) 

=  1.09, 


or  a  9%  increase. 
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Discussion  Questions 


A  Compare  the  amount  of  energy  needed  to  heat  1  liter  of  helium  by 
1  degree  with  the  energy  needed  to  heat  1  liter  of  xenon.  In  both  cases, 
the  heating  is  carried  out  in  a  sealed  vessel  that  doesn’t  allow  the  gas  to 
expand.  (The  vessel  is  also  well  insulated.) 

B  Repeat  discussion  question  A  if  the  comparison  is  1  kg  of  helium 
versus  1  kg  of  xenon  (equal  masses,  rather  than  equal  volumes). 

C  Repeat  discussion  question  A,  but  now  compare  1  liter  of  helium  in 
a  vessel  of  constant  volume  with  the  same  amount  of  helium  in  a  vessel 
that  allows  expansion  beyond  the  initial  volume  of  1  liter.  (This  could  be  a 
piston,  or  a  balloon.) 


Section  5.2 
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5.3  Entropy  As  a  Macroscopic  Quantity 

5.3.1  Efficiency  and  grades  of  energy 

Some  forms  of  energy  are  more  convenient  than  others  in  certain 
situations.  You  can’t  run  a  spring-powered  mechanical  clock  on  a 
battery,  and  you  can’t  run  a  battery-powered  clock  with  mechanical 
energy.  However,  there  is  no  fundamental  physical  principle  that 
prevents  you  from  converting  100%  of  the  electrical  energy  in  a 
battery  into  mechanical  energy  or  vice-versa.  More  efficient  motors 
and  generators  are  being  designed  every  year.  In  general,  the  laws 
of  physics  permit  perfectly  efficient  conversion  within  a  broad  class 
of  forms  of  energy. 

Heat  is  different.  Friction  tends  to  convert  other  forms  of  energy 
into  heat  even  in  the  best  lubricated  machines.  When  we  slide  a 
book  on  a  table,  friction  brings  it  to  a  stop  and  converts  all  its  kinetic 
energy  into  heat,  but  we  never  observe  the  opposite  process,  in  which 
a  book  spontaneously  converts  heat  energy  into  mechanical  energy 
and  starts  moving!  Roughly  speaking,  heat  is  different  because  it  is 
disorganized.  Scrambling  an  egg  is  easy.  Unscrambling  it  is  harder. 

We  summarize  these  observations  by  saying  that  heat  is  a  lower 
grade  of  energy  than  other  forms  such  as  mechanical  energy. 

Of  course  it  is  possible  to  convert  heat  into  other  forms  of  energy 
such  as  mechanical  energy,  and  that  is  what  a  car  engine  does  with 
the  heat  created  by  exploding  the  air-gasoline  mixture.  But  a  car 
engine  is  a  tremendously  inefficient  device,  and  a  great  deal  of  the 
heat  is  simply  wasted  through  the  radiator  and  the  exhaust.  Engi¬ 
neers  have  never  succeeded  in  creating  a  perfectly  efficient  device  for 
converting  heat  energy  into  mechanical  energy,  and  we  now  know 
that  this  is  because  of  a  deeper  physical  principle  that  is  far  more 
basic  than  the  design  of  an  engine. 


312 


Chapter  5  Thermodynamics 
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cold 


hot 


warm 


warm 


a  / 1.  The  temperature  difference 
between  the  hot  and  cold  parts  of 
the  air  can  be  used  to  extract  me¬ 
chanical  energy,  for  example  with 
a  fan  blade  that  spins  because 
of  the  rising  hot  air  currents.  2. 
If  the  temperature  of  the  air  is 
first  allowed  to  become  uniform, 
then  no  mechanical  energy  can 
be  extracted.  The  same  amount 
of  heat  energy  is  present,  but  it 
is  no  longer  accessible  for  doing 
mechanical  work. 


5.3.2  Heat  engines 

Heat  may  be  more  useful  in  some  forms  than  in  others,  i.e. ,  there 
are  different  grades  of  heat  energy.  In  figure  a/1,  the  difference  in 
temperature  can  be  used  to  extract  mechanical  work  with  a  fan 
blade.  This  principle  is  used  in  power  plants,  where  steam  is  heated 
by  burning  oil  or  by  nuclear  reactions,  and  then  allowed  to  expand 
through  a  turbine  which  has  cooler  steam  on  the  other  side.  On  a 
smaller  scale,  there  is  a  Christmas  toy,  b,  that  consists  of  a  small 
propeller  spun  by  the  hot  air  rising  from  a  set  of  candles,  very  much 
like  the  setup  shown  in  figure  a. 

In  figure  a/2,  however,  no  mechanical  work  can  be  extracted 
because  there  is  no  difference  in  temperature.  Although  the  air  in 
a/2  has  the  same  total  amount  of  energy  as  the  air  in  a/1,  the  heat 
in  a/2  is  a  lower  grade  of  energy,  since  none  of  it  is  accessible  for 
doing  mechanical  work. 

In  general,  we  define  a  heat  engine  as  any  device  that  takes  heat 
from  a  reservoir  of  hot  matter,  extracts  some  of  the  heat  energy  to  do 
mechanical  work,  and  expels  a  lesser  amount  of  heat  into  a  reservoir 
of  cold  matter.  The  efficiency  of  a  heat  engine  equals  the  amount  of 
useful  work  extracted,  W,  divided  by  the  amount  of  energy  we  had 
to  pay  for  in  order  to  heat  the  hot  reservoir.  This  latter  amount 
of  heat  is  the  same  as  the  amount  of  heat  the  engine  extracts  from 
the  high-temperature  reservoir,  Qh-  (The  letter  Q  is  the  standard 
notation  for  a  transfer  of  heat.)  By  conservation  of  energy,  we  have 
Qh  =  W  +  Qli  where  Ql  is  the  amount  of  heat  expelled  into  the 
low-temperature  reservoir,  so  the  efficiency  of  a  heat  engine,  W/Qh, 
can  be  rewritten  as 

efficiency  =  1  —  [efficiency  of  any  heat  engine] 

Qh 

It  turns  out  that  there  is  a  particular  type  of  heat  engine,  the 
Carnot  engine,  which,  although  not  100%  efficient,  is  more  efficient 


b/A  heat  engine.  Hot  air 
from  the  candles  rises  through 
the  fan  blades,  and  makes  the 
angels  spin. 


c/Sadi  Carnot  (1796-1832) 
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d/The  beginning  of  the  first 
expansion  stroke,  in  which  the 
working  gas  is  kept  in  thermal 
equilibrium  with  the  hot  reservoir. 


e/The  beginning  of  the  sec¬ 
ond  expansion  stroke,  in  which 
the  working  gas  is  thermally 
insulated.  The  working  gas  cools 
because  it  is  doing  work  on  the 
piston  and  thus  losing  energy. 


f/The  beginning  of  the  first 
compression  stroke.  The  working 
gas  begins  the  stroke  at  the  same 
temperature  as  the  cold  reservoir, 
and  remains  in  thermal  contact 
with  it  the  whole  time.  The  engine 
does  negative  work. 


g/The  beginning  of  the  sec¬ 
ond  compression  stroke,  in  which 
mechanical  work  is  absorbed, 
heating  the  working  gas  back  up 
to  Th. 


than  any  other.  The  grade  of  heat  energy  in  a  system  can  thus  be 
unambiguously  defined  in  terms  of  the  amount  of  heat  energy  in  it 
that  cannot  be  extracted  even  by  a  Carnot  engine. 

How  can  we  build  the  most  efficient  possible  engine?  Let’s  start 
with  an  unnecessarily  inefficient  engine  like  a  car  engine  and  see 
how  it  could  be  improved.  The  radiator  and  exhaust  expel  hot 
gases,  which  is  a  waste  of  heat  energy.  These  gases  are  cooler  than 
the  exploded  air-gas  mixture  inside  the  cylinder,  but  hotter  than 
the  air  that  surrounds  the  car.  We  could  thus  improve  the  engine’s 
efficiency  by  adding  an  auxiliary  heat  engine  to  it,  which  would 
operate  with  the  first  engine’s  exhaust  as  its  hot  reservoir  and  the 
air  as  its  cold  reservoir.  In  general,  any  heat  engine  that  expels 
heat  at  an  intermediate  temperature  can  be  made  more  efficient  by 
changing  it  so  that  it  expels  heat  only  at  the  temperature  of  the 
cold  reservoir. 

Similarly,  any  heat  engine  that  absorbs  some  energy  at  an  in¬ 
termediate  temperature  can  be  made  more  efficient  by  adding  an 
auxiliary  heat  engine  to  it  which  will  operate  between  the  hot  reser¬ 
voir  and  this  intermediate  temperature. 

Based  on  these  arguments,  we  define  a  Carnot  engine  as  a  heat 
engine  that  absorbs  heat  only  from  the  hot  reservoir  and  expels  it 
only  into  the  cold  reservoir.  Figures  d-g  show  a  realization  of  a 
Carnot  engine  using  a  piston  in  a  cylinder  filled  with  a  monoatomic 
ideal  gas.  This  gas,  known  as  the  working  fluid,  is  separate  from, 
but  exchanges  energy  with,  the  hot  and  cold  reservoirs.  As  proved 
on  page  328,  this  particular  Carnot  engine  has  an  efficiency  given 
by 

efficiency  =  1  —  — — ,  [efficiency  of  a  Carnot  engine] 

Th 

where  Tl  is  the  temperature  of  the  cold  reservoir  and  Th  is  the 
temperature  of  the  hot  reservoir. 

Even  if  you  do  not  wish  to  dig  into  the  details  of  the  proof, 
the  basic  reason  for  the  temperature  dependence  is  not  so  hard  to 
understand.  Useful  mechanical  work  is  done  on  strokes  d  and  e,  in 
which  the  gas  expands.  The  motion  of  the  piston  is  in  the  same 
direction  as  the  gas’s  force  on  the  piston,  so  positive  work  is  done 
on  the  piston.  In  strokes  f  and  g,  however,  the  gas  does  negative 
work  on  the  piston.  We  would  like  to  avoid  this  negative  work, 
but  we  must  design  the  engine  to  perform  a  complete  cycle.  Luckily 
the  pressures  during  the  compression  strokes  are  lower  than  the  ones 
during  the  expansion  strokes,  so  the  engine  doesn’t  undo  all  its  work 
with  every  cycle.  The  ratios  of  the  pressures  are  in  proportion  to 
the  ratios  of  the  temperatures,  so  if  Tl  is  20%  of  Th,  the  engine  is 
80%  efficient. 

We  have  already  proved  that  any  engine  that  is  not  a  Carnot 
engine  is  less  than  optimally  efficient,  and  it  is  also  true  that  all 
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Carnot  engines  operating  between  a  given  pair  of  temperatures  Th 
and  Tl  have  the  same  efficiency.  (This  can  be  proved  by  the  methods 
of  section  5.4.)  Thus  a  Carnot  engine  is  the  most  efficient  possible 
heat  engine. 

5.3.3  Entropy 

We  would  like  to  have  some  numerical  way  of  measuring  the 
grade  of  energy  in  a  system.  We  want  this  quantity,  called  entropy, 
to  have  the  following  two  properties: 

(1)  Entropy  is  additive.  When  we  combine  two  systems  and 
consider  them  as  one,  the  entropy  of  the  combined  system  equals 
the  sum  of  the  entropies  of  the  two  original  systems.  (Quantities 
like  mass  and  energy  also  have  this  property.) 

(2)  The  entropy  of  a  system  is  not  changed  by  operating  a  Carnot 
engine  within  it. 

It  turns  out  to  be  simpler  and  more  useful  to  define  changes 
in  entropy  than  absolute  entropies.  Suppose  as  an  example  that  a 
system  contains  some  hot  matter  and  some  cold  matter.  It  has  a 
relatively  high  grade  of  energy  because  a  heat  engine  could  be  used 
to  extract  mechanical  work  from  it.  But  if  we  allow  the  hot  and 
cold  parts  to  equilibrate  at  some  lukewarm  temperature,  the  grade 
of  energy  has  gotten  worse.  Thus  putting  heat  into  a  hotter  area 
is  more  useful  than  putting  it  into  a  cold  area.  Motivated  by  these 
considerations,  we  define  a  change  in  entropy  as  follows: 

AS  =  ^  [change  in  entropy  when  adding 

heat  Q  to  matter  at  temperature  T ; 

AS  is  negative  if  heat  is  taken  out] 

A  system  with  a  higher  grade  of  energy  has  a  lower  entropy. 

Entropy  is  additive.  example  10 

Since  changes  in  entropy  are  defined  by  an  additive  quantity  (heat) 
divided  by  a  non-additive  one  (temperature),  entropy  is  additive. 

Entropy  isn’t  changed  by  a  Carnot  engine.  example  1 1 

The  efficiency  of  a  heat  engine  is  defined  by 

efficiency  =  1  -  QL/QH, 

and  the  efficiency  of  a  Carnot  engine  is 

efficiency  =  1  -  TL/TH, 

so  for  a  Carnot  engine  we  have  Ql/Qh  =  Tl/Th,  which  can  be 
rewritten  as  Ql/Tl  =  QH/TH.  The  entropy  lost  by  the  hot  reservoir 
is  therefore  the  same  as  the  entropy  gained  by  the  cold  one. 


h  /  Entropy  can  be  understood 
using  the  metaphor  of  a  water 
wheel.  Letting  the  water  levels 
equalize  is  like  letting  the  entropy 
maximize.  Taking  water  from  the 
high  side  and  putting  it  into  the 
low  side  increases  the  entropy. 
Water  levels  in  this  metaphor 
correspond  to  temperatures  in 
the  actual  definition  of  entropy. 
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Entropy  increases  in  heat  conduction.  example  12 

When  a  hot  object  gives  up  energy  to  a  cold  one,  conservation 
of  energy  tells  us  that  the  amount  of  heat  lost  by  the  hot  object 
is  the  same  as  the  amount  of  heat  gained  by  the  cold  one.  The 
change  in  entropy  is  -Q/TH  +  Q/TL,  which  is  positive  because 
Tl  <  Th. 

Entropy  is  increased  by  a  non-Carnot  engine.  example  13 
The  efficiency  of  a  non-Carnot  engine  is  less  than  1  -  TL/TH, 
so  Ql/Qh  >  Tl/Th  and  Ql/Tl  >  QH/TH.  This  means  that  the 
entropy  increase  in  the  cold  reservoir  is  greater  than  the  entropy 
decrease  in  the  hot  reservoir. 

A  book  sliding  to  a  stop  example  14 

A  book  slides  across  a  table  and  comes  to  a  stop.  Once  it  stops, 
all  its  kinetic  energy  has  been  transformed  into  heat.  As  the  book 
and  table  heat  up,  their  entropies  both  increase,  so  the  total  en¬ 
tropy  increases  as  well. 

All  of  these  examples  involved  closed  systems,  and  in  all  of  them 
the  total  entropy  either  increased  or  stayed  the  same.  It  never  de¬ 
creased.  Here  are  two  examples  of  schemes  for  decreasing  the  en¬ 
tropy  of  a  closed  system,  with  explanations  of  why  they  don’t  work. 

Using  a  refrigerator  to  decrease  entropy?  example  15 

t>  A  refrigerator  takes  heat  from  a  cold  area  and  dumps  it  into  a 
hot  area.  (1)  Does  this  lead  to  a  net  decrease  in  the  entropy  of 
a  closed  system?  (2)  Could  you  make  a  Carnot  engine  more  ef¬ 
ficient  by  running  a  refrigerator  to  cool  its  low-temperature  reser¬ 
voir  and  eject  heat  into  its  high-temperature  reservoir? 

>  (1)  No.  The  heat  that  comes  off  of  the  radiator  coils  is  a  great 
deal  more  than  the  heat  the  fridge  removes  from  inside;  the  dif¬ 
ference  is  what  it  costs  to  run  your  fridge.  The  heat  radiated  from 
the  coils  is  so  much  more  than  the  heat  removed  from  the  inside 
that  the  increase  in  the  entropy  of  the  air  in  the  room  is  greater 
than  the  decrease  of  the  entropy  inside  the  fridge.  The  most  ef¬ 
ficient  refrigerator  is  actually  a  Carnot  engine  running  in  reverse, 
which  leads  to  neither  an  increase  nor  a  decrease  in  entropy. 

(2)  No.  The  most  efficient  refrigerator  is  a  reversed  Carnot  en¬ 
gine.  You  will  not  achieve  anything  by  running  one  Carnot  engine 
in  reverse  and  another  forward.  They  will  just  cancel  each  other 
out. 

Maxwell’s  demon  example  16 

>  Maxwell  imagined  a  pair  of  rooms,  their  air  being  initially  in  ther¬ 
mal  equilibrium,  having  a  partition  across  the  middle  with  a  tiny 
door.  A  miniscule  demon  is  posted  at  the  door  with  a  little  ping- 
pong  paddle,  and  his  duty  is  to  try  to  build  up  faster-moving  air 
molecules  in  room  B  and  slower  moving  ones  in  room  A.  For  in¬ 
stance,  when  a  fast  molecule  is  headed  through  the  door,  going 
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from  A  to  B,  he  lets  it  by,  but  when  a  slower  than  average  molecule 
tries  the  same  thing,  he  hits  it  back  into  room  A.  Would  this  de¬ 
crease  the  total  entropy  of  the  pair  of  rooms? 

>  No.  The  demon  needs  to  eat,  and  we  can  think  of  his  body 
as  a  little  heat  engine,  and  his  metabolism  is  less  efficient  than  a 
Carnot  engine,  so  he  ends  up  increasing  the  entropy  rather  than 
decreasing  it. 

Observations  such  as  these  lead  to  the  following  hypothesis, 
known  as  the  second  law  of  thermodynamics: 


The  entropy  of  a  closed  system  always  increases,  or  at  best  stays 
the  same:  AS  >  0. 


At  present  our  arguments  to  support  this  statement  may  seem 
less  than  convincing,  since  they  have  so  much  to  do  with  obscure 
facts  about  heat  engines.  In  the  following  section  we  will  find  a  more 
satisfying  and  fundamental  explanation  for  the  continual  increase  in 
entropy.  To  emphasize  the  fundamental  and  universal  nature  of  the 
second  law,  here  are  a  few  exotic  examples. 

Entropy  and  evolution  example  1 7 

A  favorite  argument  of  many  creationists  who  don’t  believe  in  evo¬ 
lution  is  that  evolution  would  violate  the  second  law  of  thermody¬ 
namics:  the  death  and  decay  of  a  living  thing  releases  heat  (as 
when  a  compost  heap  gets  hot)  and  lessens  the  amount  of  en¬ 
ergy  available  for  doing  useful  work,  while  the  reverse  process, 
the  emergence  of  life  from  nonliving  matter,  would  require  a  de¬ 
crease  in  entropy.  Their  argument  is  faulty,  since  the  second  law 
only  applies  to  closed  systems,  and  the  earth  is  not  a  closed  sys¬ 
tem.  The  earth  is  continuously  receiving  energy  from  the  sun. 

The  heat  death  of  the  universe  example  18 

Living  things  have  low  entropy:  to  demonstrate  this  fact,  observe 
how  a  compost  pile  releases  heat,  which  then  equilibrates  with 
the  cooler  environment.  We  never  observe  dead  things  to  leap 
back  to  life  after  sucking  some  heat  energy  out  of  their  environ¬ 
ments!  The  only  reason  life  was  able  to  evolve  on  earth  was  that 
the  earth  was  not  a  closed  system:  it  got  energy  from  the  sun, 
which  presumably  gained  more  entropy  than  the  earth  lost. 

Victorian  philosophers  spent  a  lot  of  time  worrying  about  the  heat 
death  of  the  universe:  eventually  the  universe  would  have  to  be¬ 
come  a  high-entropy,  lukewarm  soup,  with  no  life  or  organized 
motion  of  any  kind.  Fortunately  (?),  we  now  know  a  great  many 
other  things  that  will  make  the  universe  inhospitable  to  life  long 
before  its  entropy  is  maximized.  Life  on  earth,  for  instance,  will 
end  when  the  sun  evolves  into  a  hotter  state  and  boils  away  our 
oceans. 
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Hawking  radiation  example  19 

Any  process  that  could  destroy  heat  (or  convert  it  into  noth¬ 
ing  but  mechanical  work)  would  lead  to  a  reduction  in  entropy. 
Black  holes  are  supermassive  stars  whose  gravity  is  so  strong 
that  nothing,  not  even  light,  can  escape  from  them  once  it  gets 
within  a  boundary  known  as  the  event  horizon.  Black  holes  are 
commonly  observed  to  suck  hot  gas  into  them.  Does  this  lead  to 
a  reduction  in  the  entropy  of  the  universe?  Of  course  one  could 
argue  that  the  entropy  is  still  there  inside  the  black  hole,  but  being 
able  to  “hide”  entropy  there  amounts  to  the  same  thing  as  being 
able  to  destroy  entropy. 

The  physicist  Steven  Hawking  was  bothered  by  this  question,  and 
finally  realized  that  although  the  actual  stuff  that  enters  a  black 
hole  is  lost  forever,  the  black  hole  will  gradually  lose  energy  in  the 
form  of  light  emitted  from  just  outside  the  event  horizon.  This  light 
ends  up  reintroducing  the  original  entropy  back  into  the  universe 
at  large. 

Discussion  Questions 

A  In  this  discussion  question,  you’ll  think  about  a  car  engine  in  terms  of 
thermodynamics.  Note  that  an  internal  combustion  engine  doesn’t  fit  very 
well  into  the  theoretical  straightjacket  of  a  heat  engine.  For  instance,  a 
heat  engine  has  a  high-temperature  heat  reservoir  at  a  single  well-defined 
temperature,  TH.  In  a  typical  car  engine,  however,  there  are  several  very 
different  temperatures  you  could  imagine  using  for  TH\  the  temperature 
of  the  engine  block  (~  1 00°C),  the  walls  of  the  cylinder  (—  250°C),  or  the 
temperature  of  the  exploding  air-gas  mixture  (~  1000°C,  with  significant 
changes  over  a  four-stroke  cycle).  Let’s  use  TH  —  1000°C. 

Burning  gas  supplies  heat  energy  QH  to  your  car’s  engine.  The  engine 
does  mechanical  work  W ,  but  also  expels  heat  Ql  into  the  environment 
through  the  radiator  and  the  exhaust.  Conservation  of  energy  gives 

Qh  =  Ql+  W, 

and  the  relative  proportions  of  QL  and  W  are  usually  about  90%  to  10%. 
(Actually  it  depends  quite  a  bit  on  the  type  of  car,  the  driving  conditions, 
etc.) 

(1)  Ql  is  obviously  undesirable:  you  pay  for  it,  but  all  it  does  is  heat  the 
neighborhood.  Suppose  that  engineers  do  a  really  good  job  of  getting  rid 
of  the  effects  that  create  QL,  such  as  friction.  Could  QL  ever  be  reduced 
to  zero,  at  least  theoretically? 

(2)  A  gallon  of  gas  releases  about  140  MJ  of  heat  QH  when  burned.  Es¬ 
timate  the  change  in  entropy  of  the  universe  due  to  running  a  typical  car 
engine  and  burning  one  gallon  of  gas.  (You’ll  have  to  estimate  how  hot 
the  environment  is.  For  the  sake  of  argument,  assume  that  the  work  done 
by  the  engine,  W,  remains  in  the  form  of  mechanical  energy,  although  in 
reality  it  probably  ends  up  being  changed  into  heat  when  you  step  on  the 
brakes.)  Is  your  result  consistent  with  the  second  law  of  thermodynamics? 

(3)  What  would  happen  if  you  redid  the  calculation  in  #2,  but  assumed 
Ql  =  0?  Is  this  consistent  with  your  answer  to  #1  ? 
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B  When  we  run  the  Carnot  engine  in  figures  d-g,  there  are  four  parts 
of  the  universe  that  undergo  changes  in  their  physical  states:  the  hot 
reservoir,  the  cold  reservoir,  the  working  gas,  and  the  outside  world  to 
which  the  shaft  is  connected  in  order  to  do  physical  work.  Over  one  full 
cycle,  discuss  which  of  these  parts  gain  entropy,  which  ones  lose  entropy, 
and  which  ones  keep  the  same  entropy.  During  which  of  the  four  strokes 
do  these  changes  occur? 

5.4  Entropy  As  a  Microscopic  Quantity 

5.4.1  A  microscopic  view  of  entropy 

To  understand  why  the  second  law  of  thermodynamics  is  always 
true,  we  need  to  see  what  entropy  really  means  at  the  microscopic 
level.  An  example  that  is  easy  to  visualize  is  the  free  expansion  of 
a  monoatomic  gas.  Figure  a/1  shows  a  box  in  which  all  the  atoms 
of  the  gas  are  confined  on  one  side.  We  very  quickly  remove  the 
barrier  between  the  two  sides,  a/2,  and  some  time  later,  the  system 
has  reached  an  equilibrium,  a/3.  Each  snapshot  shows  both  the  po¬ 
sitions  and  the  momenta  of  the  atoms,  which  is  enough  information 
to  allow  us  in  theory  to  extrapolate  the  behavior  of  the  system  into 
the  future,  or  the  past.  However,  with  a  realistic  number  of  atoms, 
rather  than  just  six,  this  would  be  beyond  the  computational  power 
of  any  computer.2 

But  suppose  we  show  figure  a/2  to  a  friend  without  any  further 
information,  and  ask  her  what  she  can  say  about  the  system’s  behav¬ 
ior  in  the  future.  She  doesn’t  know  how  the  system  was  prepared. 
Perhaps,  she  thinks,  it  was  just  a  strange  coincidence  that  all  the 
atoms  happened  to  be  in  the  right  half  of  the  box  at  this  particular 
moment.  In  any  case,  she  knows  that  this  unusual  situation  won’t 
last  for  long.  She  can  predict  that  after  the  passage  of  any  signifi¬ 
cant  amount  of  time,  a  surprise  inspection  is  likely  to  show  roughly 
half  the  atoms  on  each  side.  The  same  is  true  if  you  ask  her  to  say 
what  happened  in  the  past.  She  doesn’t  know  about  the  barrier, 
so  as  far  as  she’s  concerned,  extrapolation  into  the  past  is  exactly 
the  same  kind  of  problem  as  extrapolation  into  the  future.  We  just 
have  to  imagine  reversing  all  the  momentum  vectors,  and  then  all 
our  reasoning  works  equally  well  for  backwards  extrapolation.  She 
would  conclude,  then,  that  the  gas  in  the  box  underwent  an  unusual 
fluctuation,  b,  and  she  knows  that  the  fluctuation  is  very  unlikely 
to  exist  very  far  into  the  future,  or  to  have  existed  very  far  into  the 
past. 

What  does  this  have  to  do  with  entropy?  Well,  state  a/3  has 
a  greater  entropy  than  state  a/2.  It  would  be  easy  to  extract  me¬ 
chanical  work  from  a/2,  for  instance  by  letting  the  gas  expand  while 


2Even  with  smaller  numbers  of  atoms,  there  is  a  problem  with  this  kind  of 
brute-force  computation,  because  the  tiniest  measurement  errors  in  the  initial 
state  would  end  up  having  large  effects  later  on. 


a  /  A  gas  expands  freely,  doubling 
its  volume. 


b/An  unusual  fluctuation  in 
the  distribution  of  the  atoms 
between  the  two  sides  of  the 
box.  There  has  been  no  external 
manipulation  as  in  figure  a/1. 
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c  /  Earth  orbit  is  becoming  clut¬ 
tered  with  space  junk,  and  the 
pieces  can  be  thought  of  as  the 
“molecules”  comprising  an  exotic 
kind  of  gas.  These  image  shows 
the  evolution  of  a  cloud  of  debris 
arising  from  a  2007  Chinese  test 
of  an  anti-satellite  rocket.  Pan¬ 
els  1-4  show  the  cloud  five  min¬ 
utes,  one  hour,  one  day,  and  one 
month  after  the  impact.  The  en¬ 
tropy  seems  to  have  maximized 
by  panel  4. 


R=1 

R=2 

R=0 

R=1 

*1 

d  /  The  phase  space  for  two 
atoms  in  a  box. 


e  /  The  phase  space  for  three 
atoms  in  a  box. 


pressing  on  a  piston  rather  than  simply  releasing  it  suddenly  into  the 
void.  There  is  no  way  to  extract  mechanical  work  from  state  a/3. 
Roughly  speaking,  our  microscopic  description  of  entropy  relates  to 
the  number  of  possible  states.  There  are  a  lot  more  states  like  a/3 
than  there  are  states  like  a/2.  Over  long  enough  periods  of  time 
—  long  enough  for  equilibration  to  occur  —  the  system  gets  mixed 
up,  and  is  about  equally  likely  to  be  in  any  of  its  possible  states, 
regardless  of  what  state  it  was  initially  in.  We  define  some  number 
that  describes  an  interesting  property  of  the  whole  system,  say  the 
number  of  atoms  in  the  right  half  of  the  box,  R.  A  high-entropy 
value  of  R  is  one  like  R  =  3,  which  allows  many  possible  states.  We 
are  far  more  likely  to  encounter  R  =  3  than  a  low-entropy  value  like 
R  =  0  or  R  =  6. 

5.4.2  Phase  space 

There  is  a  problem  with  making  this  description  of  entropy  into  a 
mathematical  definition.  The  problem  is  that  it  refers  to  the  number 
of  possible  states,  but  that  number  is  theoretically  infinite.  To  get 
around  the  problem,  we  coarsen  our  description  of  the  system.  For 
the  atoms  in  figure  a,  we  don’t  really  care  exactly  where  each  atom 
is.  We  only  care  whether  it  is  in  the  right  side  or  the  left  side.  If  a 
particular  atom’s  left-right  position  is  described  by  a  coordinate  x, 
then  the  set  of  all  possible  values  of  x  is  a  line  segment  along  the 
x  axis,  containing  an  infinite  number  of  points.  We  break  this  line 
segment  down  into  two  halves,  each  of  width  Ax,  and  we  consider 
two  different  values  of  x  to  be  variations  on  the  same  state  if  they 
both  lie  in  the  same  half.  For  our  present  purposes,  we  can  also 
ignore  completely  the  y  and  z  coordinates,  and  all  three  momentum 
components,  px,  py,  and  pz. 

Now  let’s  do  a  real  calculation.  Suppose  there  are  only  two  atoms 
in  the  box,  with  coordinates  x\  and  X2-  We  can  give  all  the  relevant 
information  about  the  state  of  the  system  by  specifying  one  of  the 
cells  in  the  grid  shown  in  figure  d.  This  grid  is  known  as  the  phase 
space  of  the  system.3  The  lower  right  cell,  for  instance,  describes 


liThe  term  is  a  little  obscure.  Basically  the  idea  is  the  same  as  in  “my  toddler 
is  going  through  a  phase  where  he  always  says  no.”  The  “phase”  is  a  stage  in 
the  evolution  of  the  system,  a  snapshot  of  its  state  at  a  moment  in  time.  The 
usage  is  also  related  to  the  concept  of  Lissajous  figures,  in  which  a  particular 
point  on  the  trajectory  is  defined  by  the  phases  of  the  oscillations  along  the  x 
and  y  axes. 
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a  state  in  which  atom  number  1  is  in  the  right  side  of  the  box  and 
atom  number  2  in  the  left.  Since  there  are  two  possible  states  with 
R  =  1  and  only  one  state  with  R  =  2,  we  are  twice  as  likely  to 
observe  R  =  1,  and  R  =  1  has  higher  entropy  than  R  =  2. 

Figure  e  shows  a  corresponding  calculation  for  three  atoms,  which 
makes  the  phase  space  three-dimensional.  Here,  the  R  =  1  and  2 
states  are  three  times  more  likely  than  R  =  0  and  3.  Four  atoms 
would  require  a  four-dimensional  phase  space,  which  exceeds  our 
ability  to  visualize.  Although  our  present  example  doesn’t  require 
it,  a  phase  space  can  describe  momentum  as  well  as  position,  as 
shown  in  figure  f.  In  general,  a  phase  space  for  a  monoatomic  gas 
has  six  dimensions  per  atom  (one  for  each  coordinate  and  one  for 
each  momentum  component). 

5.4.3  Microscopic  definitions  of  entropy  and  temperature 

Two  more  issues  need  to  be  resolved  in  order  to  make  a  micro¬ 
scopic  definition  of  entropy. 

First,  if  we  defined  entropy  as  the  number  of  possible  states, 
it  would  be  a  multiplicative  quantity,  not  an  additive  one:  if  an 
ice  cube  in  a  glass  of  water  has  M\  states  available  to  it,  and  the 
number  of  states  available  to  the  water  is  M2,  then  the  number  of 
possible  states  of  the  whole  system  is  the  product  Mi  M2.  To  get 
around  this  problem,  we  take  the  natural  logarithm  of  the  number 
of  states,  which  makes  the  entropy  additive  because  of  the  property 
of  the  logarithm  ln(MiM2)  =  In  Mi  +  In  M2. 

The  second  issue  is  a  more  trivial  one.  The  concept  of  entropy 
was  originally  invented  as  a  purely  macroscopic  quantity,  and  the 
macroscopic  definition  AS  =  Q/T,  which  has  units  of  J/K,  has  a 
different  calibration  than  would  result  from  defining  S  =  InM.  The 
calibration  constant  we  need  turns  out  to  be  simply  the  Boltzmann 
constant,  k. 

Microscopic  definition  of  entropy:  The  entropy  of  a  system  is 
S  =  k  In  M,  where  M  is  the  number  of  available  states.4 

This  also  leads  to  a  more  fundamental  definition  of  temperature. 
Two  systems  are  in  thermal  equilibrium  when  they  have  maximized 
their  combined  entropy  through  the  exchange  of  energy.  Here  the 
energy  possessed  by  one  part  of  the  system,  E\  or  £2,  plays  the 
same  role  as  the  variable  R  in  the  examples  of  free  expansion  above. 
A  maximum  of  a  function  occurs  when  the  derivative  is  zero,  so  the 
maximum  entropy  occurs  when 

dQSi  +  fr) 

d£i 

We  assume  the  systems  are  only  able  to  exchange  heat  energy  with 

4This  is  the  same  relation  as  the  one  on  Boltzmann’s  tomb,  just  in  a  slightly 
different  notation. 


P 


X 

f  /  A  phase  space  for  a  sin¬ 
gle  atom  in  one  dimension,  taking 
momentum  into  account. 


g  /  Ludwig  Boltzmann’s  tomb, 
inscribed  with  his  equation  for 
entropy. 
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each  other,  dE\  =  —  dE 2,  so 


llo  (L8  0l6  0A  6.2  0.0 
fraction  of  energy  in  atom  2 


h  /  A  two-atom  system  has 
the  highest  number  of  available 
states  when  the  energy  is  equally 
divided.  Equal  energy  division  is 
therefore  the  most  likely  possibil¬ 
ity  at  any  given  moment  in  time. 


fraction  of  energy  in  atoms  1-10 

L0  (L8  0l6  (L4  0.2  0.0 
fraction  of  energy  in  atoms  1 1  -20 

i/When  two  systems  of  10 
atoms  each  interact,  the  graph  of 
the  number  of  possible  states  is 
narrower  than  with  only  one  atom 
in  each  system. 


dSi  _  dS2 
dE[  ~  dE2 ’ 

and  since  the  energy  is  being  exchanged  in  the  form  of  heat  we  can 
make  the  equations  look  more  familiar  if  we  write  d Q  for  an  amount 
of  heat  to  be  transferred  into  either  system: 

dSi  _  dS2 
dQ\  d  Q2 

In  terms  of  our  previous  definition  of  entropy,  this  is  equivalent  to 
1/Ti  =  1/T2,  which  makes  perfect  sense  since  the  systems  are  in 
thermal  equilibrium.  According  to  our  new  approach,  entropy  has 
already  been  defined  in  a  fundamental  manner,  so  we  can  take  this 
as  a  definition  of  temperature: 

1  _  cLS 
T~dQ1 

where  dS'  represents  the  increase  in  the  system’s  entropy  from  adding 
heat  dQ  to  it. 

Examples  with  small  numbers  of  atoms 

Let’s  see  how  this  applies  to  an  ideal,  monoatomic  gas  with  a 
small  number  of  atoms.  To  start  with,  consider  the  phase  space 
available  to  one  atom.  Since  we  assume  the  atoms  in  an  ideal  gas 
are  noninteracting,  their  positions  relative  to  each  other  are  really  ir¬ 
relevant.  We  can  therefore  enumerate  the  number  of  states  available 
to  each  atom  just  by  considering  the  number  of  momentum  vectors 
it  can  have,  without  considering  its  possible  locations.  The  relation¬ 
ship  between  momentum  and  kinetic  energy  is  E  =  (p2+py+p2)/2m, 
so  if  for  a  fixed  value  of  its  energy,  we  arrange  all  of  an  atom’s  possi¬ 
ble  momentum  vectors  with  their  tails  at  the  origin,  their  tips  all  lie 
on  the  surface  of  a  sphere  in  phase  space  with  radius  |p|  =  \j2mE. 
The  number  of  possible  states  for  that  atom  is  proportional  to  the 
sphere’s  surface  area,  which  in  turn  is  proportional  to  the  square  of 
the  sphere’s  radius,  |p|  2  =  2  mE. 

Now  consider  two  atoms.  For  any  given  way  of  sharing  the  en¬ 
ergy  between  the  atoms,  E  =  E\  +  E2,  the  number  of  possible 
combinations  of  states  is  proportional  to  E\E2.  The  result  is  shown 
in  figure  h.  The  greatest  number  of  combinations  occurs  when  we 
divide  the  energy  equally,  so  an  equal  division  gives  maximum  en¬ 
tropy. 

By  increasing  the  number  of  atoms,  we  get  a  graph  whose  peak 
is  narrower,  i.  With  more  than  one  atom  in  each  system,  the  total 
energy  is  E  =  (p2x  l  +  p2yl  +p2z  l  +p2Xj2  +p2y  2  +p2z  2  +  ...)/2m.  With  n 
atoms,  a  total  of  3 n  momentum  coordinates  are  needed  in  order  to 
specify  their  state,  and  such  a  set  of  numbers  is  like  a  single  point 
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in  a  3n-dimensional  space  (which  is  impossible  to  visualize).  For  a 
given  total  energy  E,  the  possible  states  are  like  the  surface  of  a 
3?r-dimensional  sphere,  with  a  surface  area  proportional  to  p3n~l , 
or  i?(3n-b/2.  The  graph  in  figure  i,  for  example,  was  calculated 
according  to  the  formula  E^^E2^2  =  E29^2(E  —  E\ )29/2. 

Since  graph  i  is  narrower  than  graph  h,  the  fluctuations  in  energy 
sharing  are  smaller.  If  we  inspect  the  system  at  a  random  moment  in 
time,  the  energy  sharing  is  very  unlikely  to  be  more  lopsided  than 
a  40-60  split.  Now  suppose  that,  instead  of  10  atoms  interacting 
with  10  atoms,  we  had  a  1023  atoms  interacting  with  1023  atoms. 
The  graph  would  be  extremely  narrow,  and  it  would  be  a  statistical 
certainty  that  the  energy  sharing  would  be  nearly  perfectly  equal. 
This  is  why  we  never  observe  a  cold  glass  of  water  to  change  itself 
into  an  ice  cube  sitting  in  some  warm  water! 

By  the  way,  note  that  although  we’ve  redefined  temperature, 
these  examples  show  that  things  are  coming  out  consistent  with  the 
old  definition,  since  we  saw  that  the  old  definition  of  temperature 
could  be  described  in  terms  of  the  average  energy  per  atom,  and 
here  we’re  finding  that  equilibration  results  in  each  subset  of  the 
atoms  having  an  equal  share  of  the  energy. 


Entropy  of  a  monoatomic  ideal  gas 

Let’s  calculate  the  entropy  of  a  monoatomic  ideal  gas  of  n  atoms. 
This  is  an  important  example  because  it  allows  us  to  show  that 
our  present  microscopic  treatment  of  thermodynamics  is  consistent 
with  our  previous  macroscopic  approach,  in  which  temperature  was 
defined  in  terms  of  an  ideal  gas  thermometer. 

The  number  of  possible  locations  for  each  atom  is  V/ Ax3,  where 
Ax  is  the  size  of  the  space  cells  in  phase  space.  The  number  of  pos¬ 
sible  combinations  of  locations  for  the  atoms  is  therefore  (V / Ax3)n. 

The  possible  momenta  cover  the  surface  of  a  3n-dimensional 
sphere,  whose  radius  is  \j2mE,  and  whose  surface  area  is  therefore 
proportional  to  £;(3n~1)/2.  In  terms  of  phase-space  cells,  this  area 
corresponds  to  £,(3n~1)/2/Ap3r!  possible  combinations  of  momenta, 
multiplied  by  some  constant  of  proportionality  which  depends  on 
m,  the  atomic  mass,  and  n,  the  number  of  atoms.  To  avoid  having 
to  calculate  this  constant  of  proportionality,  we  limit  ourselves  to 
calculating  the  part  of  the  entropy  that  does  not  depend  on  n,  so 
the  resulting  formula  will  not  be  useful  for  comparing  entropies  of 
ideal  gas  samples  with  different  numbers  of  atoms. 


The  final  result  for  the  number  of  available  states  is 
/  V  \n  £(3n-1)/2 

M  =  (.A^3  )  A p3n-t  ’  [function  of  n] 

so  the  entropy  is 


S  =  nk  In  V  +  -nk  In  E  +  (function  of  Ax,  A p,  and  n), 
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where  the  distinction  between  n  and  n  —  1  has  been  ignored.  Using 
PV  =  nkT  and  E  =  (3/2 )nkT,  we  can  also  rewrite  this  as 


S 


5 

2 


nk  In  T—nk  In  P+. . . , 


[entropy  of  a  monoatomic  ideal  gas] 


where  indicates  terms  that  may  depend  on  Ax,  A p,  m,  and 

n,  but  that  have  no  effect  on  comparisons  of  gas  samples  with  the 
same  number  of  atoms. 


self-check  C 

Why  does  it  make  sense  that  the  temperature  term  has  a  positive  sign 
in  the  above  example,  while  the  pressure  term  is  negative?  Why  does 
it  make  sense  that  the  whole  thing  is  proportional  to  n?  >  Answer,  p. 
964 

To  show  consistency  with  the  macroscopic  approach  to  thermo¬ 
dynamics,  we  need  to  show  that  these  results  are  consistent  with 
the  behavior  of  an  ideal-gas  thermometer.  Using  the  new  defini¬ 
tion  1/T  =  dS/  dQ,  we  have  1/T  =  dS/  dE,  since  transferring  an 
amount  of  heat  dQ  into  the  gas  increases  its  energy  by  a  correspond¬ 
ing  amount.  Evaluating  the  derivative,  we  find  1/T  =  (3/2 )nk/E, 
or  E  =  (3/2 )nkT,  which  is  the  correct  relation  for  a  monoatomic 
ideal  gas. 

A  mixture  of  molecules  example  20 

>  Suppose  we  have  a  mixture  of  two  different  monoatomic  gases, 
say  helium  and  argon.  How  would  we  find  the  entropy  of  such  a 
mixture  (say,  in  terms  of  V  and  E)?  How  would  the  energy  be 
shared  between  the  two  types  of  molecules,  i.e.,  would  a  more 
massive  argon  atom  have  more  energy  on  the  average  than  a 
less  massive  helium  atom,  the  same,  or  less? 

>  Since  entropy  is  additive,  we  simply  need  to  add  the  entropies  of 
the  two  types  of  atom.  However,  the  expression  derived  above  for 
the  entropy  omitted  the  dependence  on  the  mass  m  of  the  atom, 
which  is  different  for  the  two  constituents  of  the  gas,  so  we  need 
to  go  back  and  figure  out  how  to  put  that  m-dependence  back  in. 
The  only  place  where  we  threw  away  m's  was  when  we  identified 
the  radius  of  the  sphere  in  momentum  space  with  \/2. mE,  but 
then  threw  away  the  constant  factor  of  m.  In  other  words,  the  final 
result  can  be  generalized  merely  by  replacing  E  everywhere  with 
the  product  mE.  Since  the  log  of  a  product  is  the  sum  of  the  logs, 
the  dependence  of  the  final  result  on  m  and  E  can  be  broken 
apart  into  two  different  terms,  and  we  find 

3  3 

S  =  nk  In  V  +  -  nk  In  m  +  -nk  In  E  +  . . . 

2  2 

The  total  entropy  of  the  mixture  can  then  be  written  as 

3  3 

S  =  n-\ k  In  V  +  n2k  In  V  +  -n-\ k\om^  +  -n2k  In  m2 

3  3 

+  -/7i  k  In  Ei  +  -n2k  In  E2  +  . . . 
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Now  what  about  the  energy  sharing?  If  the  total  energy  is  E  = 
E1  +  E2,  then  the  most  ovewhelmingly  probable  sharing  of  energy 
will  the  the  one  that  maximizes  the  entropy.  Notice  that  the  depen¬ 
dence  of  the  entropy  on  the  masses  m-\  and  m2  occurs  in  terms 
that  are  entirely  separate  from  the  energy  terms.  If  we  want  to 
maximize  S  with  respect  to  E \  (with  E2  =  E  -  E-\  by  conservation 
of  energy),  then  we  differentiate  S  with  respect  to  E^  and  set  it 
equal  to  zero.  The  terms  that  contain  the  masses  don’t  have  any 
dependence  on  E-i ,  so  their  derivatives  are  zero,  and  we  find  that 
the  molecular  masses  can  have  no  effect  on  the  energy  sharing. 
Setting  the  derivative  equal  to  zero,  we  have 


0 


0 

rh 

Ei 


d 

dEi 


(  3  3 

(  n-i  k  In  V  +  n2k  In  V  +  n i  k  In  m-\  +  -n2k  In  m2 

+  |n1/clnE1  +  |n2/cln(E-E1)  +  ...') 
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In  other  words,  each  gas  gets  a  share  of  the  energy  in  proportion 
to  the  number  of  its  atoms,  and  therefore  every  atom  gets,  on 
average,  the  same  amount  of  energy,  regardless  of  its  mass.  The 
result  for  the  average  energy  per  atom  is  exactly  the  same  as  for 
an  unmixed  gas,  K  =  (3/2)kT. 


Equipartition 

Example  20  is  a  special  case  of  a  more  general  statement  called 
the  equipartition  theorem.  Suppose  we  have  only  one  argon  atom, 
named  Alice,  and  one  helium  atom,  named  Harry.  Their  total  ki¬ 
netic  energy  is  E  =  p^/2m  +  Py/2m.  +  p2z/2m.  +  p'2x/2m'  +p'2y/2m'  + 
p,2z/2m! ,  where  the  primes  indicate  Harry.  We  have  six  terms  that 
all  look  alike.  The  only  difference  among  them  is  that  the  constant 
factors  attached  to  the  squares  of  the  momenta  have  different  val¬ 
ues,  but  we’ve  just  proved  that  those  differences  don’t  matter.  In 
other  words,  if  we  have  any  system  at  all  whose  energy  is  of  the 
form  E  =  (. .  ,)p\  +  (. .  .)p2  +  . . .,  with  any  number  of  terms,  then 
each  term  holds,  on  average,  the  same  amount  of  energy,  \kT.  We 
say  that  the  system  consisting  of  Alice  and  Harry  has  six  degrees  of 
freedom.  It  doesn’t  even  matter  whether  the  things  being  squared 
are  momenta:  if  you  look  back  over  the  logical  steps  that  went  into 
the  argument,  you’ll  see  that  none  of  them  depended  on  that.  In  a 
solid,  for  example,  the  atoms  aren’t  free  to  wander  around,  but  they 
can  vibrate  from  side  to  side.  If  an  atom  moves  away  from  its  equi¬ 
librium  position  at  x  =  0  to  some  other  value  of  x,  then  its  electrical 
energy  is  (l/2)/tx2,  where  k  is  the  spring  constant  (written  as  the 
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j  /  An  experiment  for  deter¬ 
mining  the  shapes  of  molecules. 


Greek  letter  kappa  to  distinguish  it  from  the  Boltzmann  constant  k ) . 
We  can  conclude  that  each  atom  in  the  solid,  on  average,  has  \ kT  of 
energy  in  the  electrical  energy  due  to  its  x  displacement  along  the  x 
axis,  and  equal  amounts  for  y  and  This  is  known  as  equipartition, 
meaning  equal  partitioning,  or  equal  sharing.  The  equipartition  the¬ 
orem  says  that  if  the  expression  for  the  energy  looks  like  a  sum  of 
squared  variables,  then  each  degree  of  freedom  has  an  average  en¬ 
ergy  of  \kT.  Thus,  very  generally,  we  can  interpret  temperature  as 
the  average  energy  per  degree  of  freedom  (times  k/2). 

An  unexpected  glimpse  of  the  microcosm 

You  may  have  the  feeling  at  this  point  that  of  course  Boltzmann 
was  right  about  the  literal  existence  of  atoms,  but  only  very  so¬ 
phisticated  experiments  could  vindicate  him  definitively.  After  all, 
the  microscopic  and  macroscopic  definitions  of  entropy  are  equiva¬ 
lent,  so  it  might  seem  as  though  there  was  no  real  advantage  to  the 
microscopic  approach.  Surprisingly,  very  simple  experiments  are  ca¬ 
pable  of  revealing  a  picture  of  the  microscopic  world,  and  there  is 
no  possible  macroscopic  explanation  for  their  results. 

In  1819,  before  Boltzmann  was  born,  Clement  and  Desormes  did 
an  experiment  like  the  one  shown  in  figure  j.  The  gas  in  the  flask 
is  pressurized  using  the  syringe.  This  heats  it  slightly,  so  it  is  then 
allowed  to  cool  back  down  to  room  temperature.  Its  pressure  is 
measured  using  the  manometer.  The  stopper  on  the  flask  is  popped 
and  then  immediately  reinserted.  Its  pressure  is  now  equalized  with 
that  in  the  room,  and  the  gas’s  expansion  has  cooled  it  a  little, 
because  it  did  mechanical  work  on  its  way  out  of  the  flask,  causing 
it  to  lose  some  of  its  internal  energy  E.  The  expansion  is  carried  out 
quickly  enough  so  that  there  is  not  enough  time  for  any  significant 
amount  of  heat  to  flow  in  through  the  walls  of  the  flask  before  the 
stopper  is  reinserted.  The  gas  is  now  allowed  to  come  back  up 
to  room  temperature  (which  takes  a  much  longer  time),  and  as  a 
result  regains  a  fraction  b  of  its  original  overpressure.  During  this 
const  ant- volume  reheating,  we  have  PV  =  nkT,  so  the  amount  of 
pressure  regained  is  a  direct  indication  of  how  much  the  gas  cooled 
down  when  it  lost  an  amount  of  energy  A  E. 

If  the  gas  is  monoatomic,  then  we  know  what  to  expect  for  this 
relationship  between  energy  and  temperature:  A E  =  (3/2)nfcAT, 
where  the  factor  of  3  came  ultimately  from  the  fact  that  the  gas 
was  in  a  three-dimensional  space,  k/1.  Moving  in  this  space,  each 
molecule  can  have  momentum  in  the  x,  y,  and  z  directions.  It  has 
three  degrees  of  freedom.  What  if  the  gas  is  not  monoatomic? 
Air,  for  example,  is  made  of  diatomic  molecules,  k/2.  There  is  a 
subtle  difference  between  the  two  cases.  An  individual  atom  of  a 
monoatomic  gas  is  a  perfect  sphere,  so  it  is  exactly  the  same  no 
matter  how  it  is  oriented.  Because  of  this  perfect  symmetry,  there 
is  thus  no  way  to  tell  whether  it  is  spinning  or  not,  and  in  fact  we 
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k/The  differing  shapes  of  a 
helium  atom  (1),  a  nitrogen 
molecule  (2),  and  a  difluo- 
roethane  molecule  (3)  have 
surprising  macroscopic  effects. 


find  that  it  can’t  rotate.  The  diatomic  gas,  on  the  other  hand,  can 
rotate  end  over  end  about  the  x  or  y  axis,  but  cannot  rotate  about 
the  z  axis,  which  is  its  axis  of  symmetry.  It  has  a  total  of  five  de¬ 
grees  of  freedom.  A  polyatomic  molecule  with  a  more  complicated, 
asymmetric  shape,  k/3,  can  rotate  about  all  three  axis,  so  it  has  a 
total  of  six  degrees  of  freedom. 

Because  a  polyatomic  molecule  has  more  degrees  of  freedom  than 
a  monoatomic  one,  it  has  more  possible  states  for  a  given  amount  of 
energy.  That  is,  its  entropy  is  higher  for  the  same  energy.  From  the 
definition  of  temperature,  1/T  =  d S/  d E,  we  conclude  that  it  has  a 
lower  temperature  for  the  same  energy.  In  other  words,  it  is  more 
difficult  to  heat  n  molecules  of  difluoroethane  than  it  is  to  heat  n 
atoms  of  helium.  When  the  Clement-Desormes  experiment  is  carried 
out,  the  result  b  therefore  depends  on  the  shape  of  the  molecule! 
Who  would  have  dreamed  that  such  simple  observations,  correctly 
interpreted,  could  give  us  this  kind  of  glimpse  of  the  microcosm? 

Lets  go  ahead  and  calculate  how  this  works.  Suppose  a  gas  is 
allowed  to  expand  without  being  able  to  exchange  heat  with  the 
rest  of  the  universe.  The  loss  of  thermal  energy  from  the  gas  equals 
the  work  it  does  as  it  expands,  and  using  the  result  of  homework 
problem  2  ou  page  338,  the  work  done  in  an  infinitesimal  expansion 
equals  PdV,  so 

dE  +  PdV  =  0. 

(If  the  gas  had  uot  been  insulated,  then  there  would  have  been  a 
third  term  for  the  heat  gained  or  lost  by  heat  conduction.) 

From  section  5.2  we  have  E  =  (3/2 )PV  for  a  monoatomic  ideal 
gas.  More  generally,  the  equipartition  theorem  tells  us  that  the  3 
simply  needs  to  be  replaced  with  the  number  of  degrees  of  freedom  a, 
so  d E  =  (a/2)PdV  +  (a/ 2)V  dP,  and  the  equation  above  becomes 

0  =  °—^PdV  +  —V  dP. 

2  2 
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Rearranging,  we  have 


.  .  dF  dP 

(“  +  2)“^r  -  ~a-j5~- 

Integrating  both  sides  gives 

(a  +  2)  In  V  =  —a  In  P  +  constant, 

and  taking  exponentials  on  both  sides  yields 

Va+2  oc  P~a. 


We  now  wish  to  reexpress  this  in  terms  of  pressure  and  temper¬ 
ature.  Eliminating  V  oc  ( T/P )  gives 

Toe  Pb, 

where  b  =  2 /(a  +  2)  is  equal  to  2/5,  2/7,  or  1/4,  respectively,  for  a 
monoatomic,  diatomic,  or  polyatomic  gas. 

Efficiency  of  the  Carnot  engine  example  21 

As  an  application,  we  now  prove  the  result  claimed  earlier  for  the 
efficiency  of  a  Carnot  engine.  First  consider  the  work  done  dur¬ 
ing  the  constant-temperature  strokes.  Integrating  the  equation 
dW  =  PdV,  we  have  W  =  f  PdV.  Since  the  thermal  energy  of 
an  ideal  gas  depends  only  on  its  temperature,  there  is  no  change 
in  the  thermal  energy  of  the  gas  during  this  constant-temperature 
process.  Conservation  of  energy  therefore  tells  us  that  work  done 
by  the  gas  must  be  exactly  balanced  by  the  amount  of  heat  trans¬ 
ferred  in  from  the  reservoir. 

Q=W 

-SP6V 

For  our  proof  of  the  efficiency  of  the  Carnot  engine,  we  need  only 
the  ratio  of  Qh  to  Q/_,  so  we  neglect  constants  of  proportionality, 
and  simply  subsitutde  P  oc  T/V,  giving 

Q  oc  f  ^dl/oc  T In  oc  T In 
J  V  vq  P2 


The  efficiency  of  a  heat  engine  is 

Q, 

efficiency  =  1  -  — . 

Uh 

Making  use  of  the  result  from  the  previous  proof  for  a  Carnot  en¬ 
gine  with  a  monoatomic  ideal  gas  as  its  working  gas,  we  have 


efficiency  =  1 


Tl  In (P4/P3) 
Th  In (P1/P2)’ 
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where  the  subscripts  1,  2,  3,  and  4  refer  to  figures  d-g  on  page 
314.  We  have  shown  above  that  the  temperature  is  proportional 
to  Pb  on  the  insulated  strokes  2-3  and  4-1 ,  the  pressures  must  be 
related  by  P2/P3  =  P1/P4,  which  can  be  rearranged  as  P4/P3  = 
P1/P2,  and  we  therefore  have 

efficiency  =  1  - 

1 H 


5.4.4  The  arrow  of  time,  or  “this  way  to  the  Big  Bang” 

Now  that  we  have  a  microscopic  understanding  of  entropy,  what 
does  that  tell  us  about  the  second  law  of  thermodynamics?  The 
second  law  defines  a  forward  direction  to  time,  “time’s  arrow.”  The 
microscopic  treatment  of  entropy,  however,  seems  to  have  mysteri¬ 
ously  sidestepped  that  whole  issue.  A  graph  like  figure  b  on  page 
319,  showing  a  fluctuation  away  from  equilibrium,  would  look  just 
as  natural  if  we  flipped  it  over  to  reverse  the  direction  of  time.  Af¬ 
ter  all,  the  basic  laws  of  physics  are  conservation  laws,  which  don’t 
distinguish  between  past  and  future.  Our  present  picture  of  entropy 
suggests  that  we  restate  the  second  law  of  thermodynamics  as  fol¬ 
lows:  low-entropy  states  are  short-lived.  An  ice  cube  can’t  exist 
forever  in  warm  water.  We  no  longer  have  to  distinguish  past  from 
future. 

But  how  do  we  reconcile  this  with  our  strong  psychological  sense 
of  the  direction  of  time,  including  our  ability  to  remember  the  past 
but  not  the  future?  Why  do  we  observe  ice  cubes  melting  in  water, 
but  not  the  time-reversed  version  of  the  same  process? 

The  answer  is  that  there  is  no  past-future  asymmetry  in  the  laws 
of  physics,  but  there  is  a  past-future  asymmetry  in  the  universe.  The 
universe  started  out  with  the  Big  Bang.  (Some  of  the  evidence  for 
the  Big  Bang  theory  is  given  on  page  360.)  The  early  universe  had 
a  very  low  entropy,  and  low-entropy  states  are  short-lived.  What 
does  “short-lived”  mean  here,  however?  Hot  coffee  left  in  a  paper 
cup  will  equilibrate  with  the  air  within  ten  minutes  or  so.  Hot  coffee 
in  a  thermos  bottle  maintains  its  low-entropy  state  for  much  longer, 
because  the  coffee  is  insulated  by  a  vacuum  between  the  inner  and 
outer  walls  of  the  thermos.  The  universe  has  been  mostly  vacuum 
for  a  long  time,  so  it’s  well  insulated.  Also,  it  takes  billions  of  years 
for  a  low-entropy  normal  star  like  our  sun  to  evolve  into  the  high- 
entropy  cinder  known  as  a  white  dwarf. 

The  universe,  then,  is  still  in  the  process  of  equilibrating,  and 
all  the  ways  we  have  of  telling  the  past  from  the  future  are  really 
just  ways  of  determining  which  direction  in  time  points  toward  the 
Big  Bang,  i.e.,  which  direction  points  to  lower  entropy.  The  psy¬ 
chological  arrow  of  time,  for  instance,  is  ultimately  based  on  the 
thermodynamic  arrow.  In  some  general  sense,  your  brain  is  like 
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a  computer,  and  computation  has  thermodynamic  effects.  In  even 
the  most  efficient  possible  computer,  for  example,  erasing  one  bit 
of  memory  decreases  its  entropy  from  /c  In  2  (two  possible  states)  to 
k  hi  1  (one  state),  for  a  drop  of  about  10-23  J/K.  One  way  of  de¬ 
termining  the  direction  of  the  psychological  arrow  of  time  is  that 
forward  in  psychological  time  is  the  direction  in  which,  billions  of 
years  from  now,  all  consciousness  will  have  ceased;  if  consciousness 
was  to  exist  forever  in  the  universe,  then  there  would  have  to  be  a 
never-ending  decrease  in  the  universe’s  entropy.  This  can’t  happen, 
because  low-entropy  states  are  short-lived. 

Relating  the  direction  of  the  thermodynamic  arrow  of  time  to  the 
existence  of  the  Big  Bang  is  a  satisfying  way  to  avoid  the  paradox  of 
how  the  second  law  can  come  from  basic  laws  of  physics  that  don’t 
distinguish  past  from  future.  There  is  a  remaining  mystery,  however: 
why  did  our  universe  have  a  Big  Bang  that  was  low  in  entropy?  It 
could  just  as  easily  have  been  a  maximum-entropy  state,  and  in  fact 
the  number  of  possible  high-entropy  Big  Bangs  is  vastly  greater  than 
the  number  of  possible  low-entropy  ones.  The  question,  however,  is 
probably  not  one  that  can  be  answered  using  the  methods  of  science. 
All  we  can  say  is  that  if  the  universe  had  started  with  a  maximum- 
entropy  Big  Bang,  then  we  wouldn’t  be  here  to  wonder  about  it.  A 
longer,  less  mathematical  discussion  of  these  concepts,  along  with 
some  speculative  ideas,  is  given  in  “The  Cosmic  Origins  of  Time’s 
Arrow,”  Sean  M.  Carroll,  Scientific  American,  June  2008,  p.  48. 

5.4.5  Quantum  mechanics  and  zero  entropy 

The  previous  discussion  would  seem  to  imply  that  absolute  en¬ 
tropies  are  never  well  defined,  since  any  calculation  of  entropy  will 
always  end  up  having  terms  that  depend  on  A p  and  Ax.  For  in¬ 
stance,  we  might  think  that  cooling  an  ideal  gas  to  absolute  zero 
would  give  zero  entropy,  since  there  is  then  only  one  available  mo¬ 
mentum  state,  but  there  would  still  be  many  possible  position  states. 
We’ll  see  later  in  this  book,  however,  that  the  quantum  mechanical 
uncertainty  principle  makes  it  impossible  to  know  the  location  and 
position  of  a  particle  simultaneously  with  perfect  accuracy.  The 
best  we  can  do  is  to  determine  them  with  an  accuracy  such  that 
the  product  ApAx  is  equal  to  a  constant  called  Planck’s  constant. 
According  to  quantum  physics,  then,  there  is  a  natural  minimum 
size  for  rectangles  in  phase  space,  and  entropy  can  be  defined  in 
absolute  terms.  Another  way  of  looking  at  it  is  that  according  to 
quantum  physics,  the  gas  as  a  whole  has  some  well-defined  ground 
state,  which  is  its  state  of  minimum  energy.  When  the  gas  is  cooled 
to  absolute  zero,  the  scene  is  not  at  all  like  what  we  would  picture 
in  classical  physics,  with  a  lot  of  atoms  lying  around  motionless.  It 
might,  for  instance,  be  a  strange  quantum- mechanical  state  called 
the  Bose-Einstein  condensate,  which  was  achieved  for  the  first  time 
recently  with  macroscopic  amounts  of  atoms.  Classically,  the  gas 


330 


Chapter  5  Thermodynamics 


has  many  possible  states  available  to  it  at  zero  temperature,  since 
the  positions  of  the  atoms  can  be  chosen  in  a  variety  of  ways.  The 
classical  picture  is  a  bad  approximation  under  these  circumstances, 
however.  Quantum  mechanically  there  is  only  one  ground  state,  in 
which  each  atom  is  spread  out  over  the  available  volume  in  a  cloud 
of  probability.  The  entropy  is  therefore  zero  at  zero  temperature. 
This  fact,  which  cannot  be  understood  in  terms  of  classical  physics, 
is  known  as  the  third  law  of  thermodynamics. 

5.4.6  Summary  of  the  laws  of  thermodynamics 

Here  is  a  summary  of  the  laws  of  thermodynamics: 


The  zeroth  law  of  thermodynamics  (page  305)  If  object  A  is 
at  the  same  temperature  as  object  B,  and  B  is  at  the  same 
temperature  as  C,  then  A  is  at  the  same  temperature  as  C. 

The  first  law  of  thermodynamics  (page  300)  Energy  is 

conserved. 

The  second  law  of  thermodynamics  (page  317)  The  entropy 
of  a  closed  system  always  increases,  or  at  best  stays  the  same: 
AS  >  0. 

The  third  law  of  thermodynamics  (page  331)  The  entropy  of 
a  system  approaches  zero  as  its  temperature  approaches  abso¬ 
lute  zero. 


From  a  modern  point  of  view,  only  the  first  law  deserves  to  be  called 
a  fundamental  law  of  physics.  Once  Boltmann  discovered  the  mi¬ 
croscopic  nature  of  entropy,  the  zeroth  and  second  laws  could  be 
understood  as  statements  about  probability:  a  system  containing  a 
large  number  of  particles  is  overwhelmingly  likely  to  do  a  certain 
thing,  simply  because  the  number  of  possible  ways  to  do  it  is  ex¬ 
tremely  large  compared  to  the  other  possibilities.  The  third  law 
is  also  now  understood  to  be  a  consequence  of  more  basic  physical 
principles,  but  to  explain  the  third  law,  it’s  not  sufficient  simply  to 
know  that  matter  is  made  of  atoms:  we  also  need  to  understand  the 
quantum-mechanical  nature  of  those  atoms,  discussed  in  chapter  13. 
Historically,  however,  the  laws  of  thermodynamics  were  discovered 
in  the  eighteenth  century,  when  the  atomic  theory  of  matter  was 
generally  considered  to  be  a  hypothesis  that  couldn’t  be  tested  ex¬ 
perimentally.  Ideally,  with  the  publication  of  Boltzmann’s  work  on 
entropy  in  1877,  the  zeroth  and  second  laws  would  have  been  imme¬ 
diately  demoted  from  the  status  of  physical  laws,  and  likewise  the 
development  of  quantum  mechanics  in  the  1920’s  would  have  done 
the  same  for  the  third  law. 
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I  /  The  Otto  cycle.  1 .  In  the  ex¬ 
haust  stroke,  the  piston  expels  the 
burned  air-gas  mixture  left  over 
from  the  preceding  cycle.  2.  In 
the  intake  stroke,  the  piston  sucks 
in  fresh  air-gas  mixture.  3.  In 
the  compression  stroke,  the  pis¬ 
ton  compresses  the  mixture,  and 
heats  it.  4.  At  the  beginning  of 
the  power  stroke,  the  spark  plug 
fires,  causing  the  air-gas  mixture 
to  burn  explosively  and  heat  up 
much  more.  The  heated  mix¬ 
ture  expands,  and  does  a  large 
amount  of  positive  mechanical 
work  on  the  piston.  An  ani¬ 
mated  version  can  be  viewed  in 
the  Wikipedia  article  “Four-stroke 
engine." 
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3 
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5.5  More  About  Heat  Engines 

So  far,  the  only  heat  engine  we’ve  discussed  in  any  detail  has  been 
a  fictitious  Carnot  engine,  with  a  monoatomic  ideal  gas  as  its  work¬ 
ing  gas.  As  a  more  realistic  example,  figure  1  shows  one  full  cycle 
of  a  cylinder  in  a  standard  gas-burning  automobile  engine.  This 
four-stroke  cycle  is  called  the  Otto  cycle,  after  its  inventor,  German 
engineer  Nikolaus  Otto.  The  Otto  cycle  is  more  complicated  than  a 
Carnot  cycle,  in  a  number  of  ways: 


•  The  working  gas  is  physically  pumped  in  and  out  of  the  cylin¬ 
der  through  valves,  rather  than  being  sealed  and  reused  indef¬ 
initely  as  in  the  Carnot  engine. 

•  The  cylinders  are  not  perfectly  insulated  from  the  engine  block, 
so  heat  energy  is  lost  from  each  cylinder  by  conduction.  This 
makes  the  engine  less  efficient  that  a  Carnot  engine,  because 
heat  is  being  discharged  at  a  temperature  that  is  not  as  cool 
as  the  environment. 

•  Rather  than  being  heated  by  contact  with  an  external  heat 
reservoir,  the  air-gas  mixture  inside  each  cylinder  is  heated 
by  internal  combusion:  a  spark  from  a  spark  plug  burns  the 
gasoline,  releasing  heat. 
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•  The  working  gas  is  not  monoatomic.  Air  consists  of  diatomic 
molecules  (N2  and  O2),  and  gasoline  of  polyatomic  molecules 
such  as  octane  (CgHig). 

•  The  working  gas  is  not  ideal.  An  ideal  gas  is  one  in  which 
the  molecules  never  interact  with  one  another,  but  only  with 
the  walls  of  the  vessel,  when  they  collide  with  it.  In  a  car  en¬ 
gine,  the  molecules  are  interacting  very  dramatically  with  one 
another  when  the  air-gas  mixture  explodes  (and  less  dramat¬ 
ically  at  other  times  as  well,  since,  for  example,  the  gasoline 
may  be  in  the  form  of  microscopic  droplets  rather  than  indi¬ 
vidual  molecules). 


This  is  all  extremely  complicated,  and  it  would  be  nice  to  have 
some  way  of  understanding  and  visualizing  the  important  proper¬ 
ties  of  such  a  heat  engine  without  trying  to  handle  every  detail  at 
once.  A  good  method  of  doing  this  is  a  type  of  graph  known  as 
a  P-V  diagram.  As  proved  in  homework  problem  2,  the  equation 
d W  =  F  dx  for  mechanical  work  can  be  rewritten  as  d W  =  P  dV  in 
the  case  of  work  done  by  a  piston.  Here  P  represents  the  pressure 
of  the  working  gas,  and  V  its  volume.  Thus,  on  a  graph  of  P  versus 
V,  the  area  under  the  curve  represents  the  work  done.  When  the 
gas  expands,  dx  is  positive,  and  the  gas  does  positive  work.  When 
the  gas  is  being  compressed,  dx  is  negative,  and  the  gas  does  neg¬ 
ative  work,  i.e.,  it  absorbs  energy.  Notice  how,  in  the  diagram  of 
the  Carnot  engine  in  the  top  panel  of  figure  a,  the  cycle  goes  clock¬ 
wise  around  the  curve,  and  therefore  the  part  of  the  curve  in  which 
negative  work  is  being  done  (arrowheads  pointing  to  the  left)  are 
below  the  ones  in  which  positive  work  is  being  done.  This  means 
that  over  all,  the  engine  does  a  positive  amount  of  work.  This  net 
work  equals  the  area  under  the  top  part  of  the  curve,  minus  the 
area  under  the  bottom  part  of  the  curve,  which  is  simply  the  area 
enclosed  by  the  curve.  Although  the  diagram  for  the  Otto  engine  is 
more  complicated,  we  can  at  least  compare  it  on  the  same  footing 
with  the  Carnot  engine.  The  curve  forms  a  figure-eight,  because  it 
cuts  across  itself.  The  top  loop  goes  clockwise,  so  as  in  the  case 
of  the  Carnot  engine,  it  represents  positive  work.  The  bottom  loop 
goes  counterclockwise,  so  it  represents  a  net  negative  contribution 
to  the  work.  This  is  because  more  work  is  expended  in  forcing  out 
the  exhaust  than  is  generated  in  the  intake  stroke. 

To  make  an  engine  as  efficient  as  possible,  we  would  like  to  make 
the  loop  have  as  much  area  as  possible.  What  is  it  that  determines 
the  actual  shape  of  the  curve?  First  let’s  consider  the  constant- 
temperature  expansion  stroke  that  forms  the  top  of  the  Carnot  en¬ 
gine’s  P-V  plot.  This  is  analogous  to  the  power  stroke  of  an  Otto 
engine.  Heat  is  being  sucked  in  from  the  hot  reservoir,  and  since 
the  working  gas  is  always  in  thermal  equilibrium  with  the  hot  reser¬ 
voir,  its  temperature  is  constant.  Regardless  of  the  type  of  gas,  we 


Carnot  engine 


Otto  cycle 


a  /  P-V  diagrams  for  a  Carnot 
engine  and  an  Otto  engine. 
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therefore  have  PV  =  nkT  with  T  held  constant,  and  thus  P  oc  V -1 
is  the  mathematical  shape  of  this  curve  —  a  y  =  1/x  graph,  which 
is  a  hyperbola.  This  is  all  true  regardless  of  whether  the  working 
gas  is  monoatomic,  diatomic,  or  polyatomic.  (The  bottom  of  the 
loop  is  likewise  of  the  form  P  oc  V -1,  but  with  a  smaller  constant 
of  proportionality  due  to  the  lower  temperature.) 

Now  consider  the  insulated  expansion  stroke  that  forms  the  right 
side  of  the  curve  for  the  Carnot  engine.  As  shown  on  page  327, 
the  relationship  between  pressure  and  temperature  in  an  insulated 
compression  or  expansion  is  T  oc  Pb ,  with  b  =  2/5,  2/7,  or  1/4, 
respectively,  for  a  monoatomic,  diatomic,  or  polyatomic  gas.  For  P 
as  a  function  of  V  at  constant  T,  the  ideal  gas  law  gives  P  oc  T /V, 
so  P  oc  17-7,  where  7  =  1/(1  —  b)  takes  on  the  values  5/3,  7/5,  and 
4/3.  The  number  7  can  be  interpreted  as  the  ratio  Cp/Cy,  where 
Cp ,  the  heat  capacity  at  constant  pressure,  is  the  amount  of  heat 
required  to  raise  the  temperature  of  the  gas  by  one  degree  while 
keeping  its  pressure  constant,  and  Cy  is  the  corresponding  quantity 
under  conditions  of  constant  volume. 

The  compression  ratio  example  22 

Operating  along  a  constant-temperature  stroke,  the  amount  of 
mechanical  work  done  by  a  heat  engine  can  be  calculated  as 
follows: 


PV  =  nkT 

Setting  c  =  nkT  to  simplify  the  writing, 

P  =  cl/"1 

rvf 

W=  /  PdV 

fVf 

=  c  l/-  dV 

Jv, 

=  c  In  Vf  -  c  In  V; 

=  c\x\{Vf/Vj) 

The  ratio  Vf/\ is  called  the  compression  ratio  of  the  engine,  and 
higher  values  result  in  more  power  along  this  stroke.  Along  an 
insulated  stroke,  we  have  P  oc  V~y,  with  y  4  1,  so  the  result  for 
the  work  no  longer  has  this  perfect  mathematical  property  of  de¬ 
pending  only  on  the  ratio  Vf/Vj.  Nevertheless,  the  compression 
ratio  is  still  a  good  figure  of  merit  for  predicting  the  performance  of 
any  heat  engine,  including  an  internal  combustion  engine.  High 
compression  ratios  tend  to  make  the  working  gas  of  an  internal 
combustion  engine  heat  up  so  much  that  it  spontaneously  ex¬ 
plodes.  When  this  happens  in  an  Otto-cycle  engine,  it  can  cause 
ignition  before  the  sparkplug  fires,  an  undesirable  effect  known  as 
pinging.  For  this  reason,  the  compression  ratio  of  an  Otto-cycle 
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automobile  engine  cannot  normally  exceed  about  10.  In  a  diesel 
engine,  however,  this  effect  is  used  intentionally,  as  an  alternative 
to  sparkplugs,  and  compression  ratios  can  be  20  or  more. 


Sound  example  23 

Figure  b  shows  a  P-V  plot  for  a  sound  wave.  As  the  pressure 
oscillates  up  and  down,  the  air  is  heated  and  cooled  by  its  com¬ 
pression  and  expansion.  Pleat  conduction  is  a  relatively  slow  pro¬ 
cess,  so  typically  there  is  not  enough  time  over  each  cycle  for  any 
significant  amount  of  heat  to  flow  from  the  hot  areas  to  the  cold 
areas.  (This  is  analogous  to  insulated  compression  or  expansion 
of  a  heat  engine;  in  general,  a  compression  or  expansion  of  this 
type,  with  no  transfer  of  heat,  is  called  adiabatic.)  The  pressure 
and  volume  of  a  particular  little  piece  of  the  air  are  therefore  re¬ 
lated  according  to  P  oc  V~y .  The  cycle  of  oscillation  consists 
of  motion  back  and  forth  along  a  single  curve  in  the  P-V  plane, 
and  since  this  curve  encloses  zero  volume,  no  mechanical  work 
is  being  done:  the  wave  (under  the  assumed  ideal  conditions) 
propagates  without  any  loss  of  energy  due  to  friction. 

The  speed  of  sound  is  also  related  to  y.  See  example  13  on 
p.  379. 


Measuring  7  using  the  “spring  of  air”  example  24 

Figure  c  shows  an  experiment  that  can  be  used  to  measure  the 
y  of  a  gas.  When  the  mass  m  is  inserted  into  bottle’s  neck, 
which  has  cross-sectional  area  A,  the  mass  drops  until  it  com¬ 
presses  the  air  enough  so  that  the  pressure  is  enough  to  support 
its  weight.  The  observed  frequency  w  of  oscillations  about  this 
equilibrium  position  y0  can  be  used  to  extract  the  y  of  the  gas. 


m 


dP 

”  dy  ro 

A  dP 

Vo 

A2  dP 

“  ~  m  6V  Vo 

We  make  the  bottle  big  enough  so  that  its  large  surface-to-volume 
ratio  prevents  the  conduction  of  any  significant  amount  of  heat 
through  its  walls  during  one  cycle,  so  P  oc  V~y ,  and  dP/d\/  = 
—yP/V.  Thus, 


(JO 
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A2  P0 
y  m  V0 


V 


b  /  Example  23, 


c  /  Example  24. 
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The  Helmholtz  resonator  example  25 

When  you  blow  over  the  top  of  a  beer  bottle,  you  produce  a  pure 
tone.  As  you  drink  more  of  the  beer,  the  pitch  goes  down.  This 
is  similar  to  example  24,  except  that  instead  of  a  solid  mass  m 
sitting  inside  the  neck  of  the  bottle,  the  moving  mass  is  the  air 
itself.  As  air  rushes  in  and  out  of  the  bottle,  its  velocity  is  highest 
at  the  bottleneck,  and  since  kinetic  energy  is  proportional  to  the 
square  of  the  velocity,  essentially  all  of  the  kinetic  energy  is  that 
of  the  air  that’s  in  the  neck.  In  other  words,  we  can  replace  m  with 
ALp,  where  L  is  the  length  of  the  neck,  and  p  is  the  density  of  the 
air.  Substituting  into  the  earlier  result,  we  find  that  the  resonant 
frequency  is 


tu 


2 


Po  a 

yp  LV, o' 


This  is  known  as  a  Helmholtz  resonator.  As  shown  in  figure  d,  a 
violin  or  an  acoustic  guitar  has  a  Helmholtz  resonance,  since  air 
can  move  in  and  out  through  the  f-holes.  Problem  10  is  a  more 
quantitative  exploration  of  this. 


d  /  The  resonance  curve  of  a 
1713  Stradivarius  violin,  mea¬ 
sured  by  Carleen  Hutchins.  There 
are  a  number  of  different  reso¬ 
nance  peaks,  some  strong  and 
some  weak;  the  ones  near  200 
and  400  Hz  are  vibrations  of  the 
wood,  but  the  one  near  300  Hz 
is  a  resonance  of  the  air  moving 
in  and  out  through  those  holes 
shaped  like  the  letter  F.  The  white 
lines  show  the  frequencies  of  the 
four  strings. 


frequency  (Hz) 


We  have  already  seen,  based  on  the  microscopic  nature  of  en¬ 
tropy,  that  any  Carnot  engine  has  the  same  efficiency,  and  the  ar¬ 
gument  only  employed  the  assumption  that  the  engine  met  the  def¬ 
inition  of  a  Carnot  cycle:  two  insulated  strokes,  and  two  constant- 
temperature  strokes.  Since  we  didn’t  have  to  make  any  assumptions 
about  the  nature  of  the  working  gas  being  used,  the  result  is  evi¬ 
dently  true  for  diatomic  or  polyatomic  molecules,  or  for  a  gas  that  is 
not  ideal.  This  result  is  surprisingly  simple  and  general,  and  a  little 
mysterious  —  it  even  applies  to  possibilities  that  we  have  not  even 
considered,  such  as  a  Carnot  engine  designed  so  that  the  working 
“gas”  actually  consists  of  a  mixture  of  liquid  droplets  and  vapor,  as 
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in  a  steam  engine.  How  can  it  always  turn  out  so  simple,  given  the 
kind  of  mathematical  complications  that  were  swept  under  the  rug 
in  example  22?  A  better  way  to  understand  this  result  is  by  switch¬ 
ing  from  P-V  diagrams  to  a  diagram  of  temperature  versus  entropy, 
as  shown  in  figure  e.  An  infinitesimal  transfer  of  heat  d Q  gives  rise 
to  a  change  in  entropy  d S  =  d Q/T,  so  the  area  under  the  curve  on 
a  T-S  plot  gives  the  amount  of  heat  transferred.  The  area  under  the 
top  edge  of  the  box  in  figure  e,  extending  all  the  way  down  to  the 
axis,  represents  the  amount  of  heat  absorbed  from  the  hot  reservoir, 
while  the  smaller  area  under  the  bottom  edge  represents  the  heat 
wasted  into  the  cold  reservoir.  By  conservation  of  energy,  the  area 
enclosed  by  the  box  therefore  represents  the  amount  of  mechanical 
work  being  done,  as  for  a  P-V  diagram.  We  can  now  see  why  the 
efficiency  of  a  Carnot  engine  is  independent  of  any  of  the  physical 
details:  the  definition  of  a  Carnot  engine  guarantees  that  the  T-S 
diagram  will  be  a  rectangular  box,  and  the  efficiency  depends  only 
on  the  relative  heights  of  the  top  and  bottom  of  the  box. 

This  chapter  is  summarized  on  page  995.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 


e  /  A  T-S  diagram  for  a  Carnot 
engine. 
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Problems 


The  symbols  \/,  ^ ,  etc.  are  explained  on  page  341. 

1  (a)  Show  that  under  conditions  of  standard  pressure  and  tem¬ 
perature,  the  volume  of  a  sample  of  an  ideal  gas  depends  only  on 
the  number  of  molecules  in  it. 

(b)  One  mole  is  defined  as  6.0  x  1023  atoms.  Find  the  volume  of  one 
mole  of  an  ideal  gas,  in  units  of  liters,  at  standard  temperature  and 
pressure  (0°C  and  101  kPa).  V 

2  A  gas  in  a  cylinder  expands  its  volume  by  an  amount  dV, 
pushing  out  a  piston.  Show  that  the  work  done  by  the  gas  on  the 
piston  is  given  by  (1W  =  PdV. 

3  (a)  A  helium  atom  contains  2  protons,  2  electrons,  and  2 

neutrons.  Find  the  mass  of  a  helium  atom.  V 

(b)  Find  the  number  of  atoms  in  1.0  kg  of  helium.  V 

(c)  Helium  gas  is  monoatomic.  Find  the  amount  of  heat  needed  to 

raise  the  temperature  of  1.0  kg  of  helium  by  1.0  degree  C.  (This  is 
known  as  helium’s  heat  capacity  at  constant  volume.)  V 

4  A  sample  of  gas  is  enclosed  in  a  sealed  chamber.  The  gas 

consists  of  molecules,  which  are  then  split  in  half  through  some 
process  such  as  exposure  to  ultraviolet  light,  or  passing  an  electric 
spark  through  the  gas.  The  gas  returns  to  thermal  equilibrium  with 
the  surrounding  room.  How  does  its  pressure  now  compare  with  its 
pressure  before  the  molecules  were  split?  - 

5  Most  of  the  atoms  in  the  universe  are  in  the  form  of  gas  that 

is  not  part  of  any  star  or  galaxy:  the  intergalactic  medium  (IGM). 
The  IGM  consists  of  about  10~5  atoms  per  cubic  centimeter,  with 
a  typical  temperature  of  about  103  K.  These  are,  in  some  sense,  the 
density  and  temperature  of  the  universe  (not  counting  light,  or  the 
exotic  particles  known  as  “dark  matter”).  Calculate  the  pressure  of 
the  universe  (or,  speaking  more  carefully,  the  typical  pressure  due 
to  the  IGM).  V  ^ 

6  Estimate  the  pressure  at  the  center  of  the  Earth,  assuming  it 
is  of  constant  density  throughout.  Note  that  g  is  not  constant  with 
respect  to  depth  —  as  shown  in  example  19  on  page  105,  g  equals 
Gmr/b 3  for  r,  the  distance  from  the  center,  less  than  b,  the  earth’s 
radius. 

(a)  State  your  result  in  terms  of  G,  m,  and  b.  V 

(b)  Show  that  your  answer  from  part  a  has  the  right  units  for  pres¬ 
sure. 

(c)  Evaluate  the  result  numerically.  V 

(d)  Given  that  the  earth’s  atmosphere  is  on  the  order  of  one  thou¬ 
sandth  the  earth’s  radius,  and  that  the  density  of  the  earth  is  several 
thousand  times  greater  than  the  density  of  the  lower  atmosphere, 
check  that  your  result  is  of  a  reasonable  order  of  magnitude. 
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7  (a)  Determine  the  ratio  between  the  escape  velocities  from  the 

surfaces  of  the  earth  and  the  moon.  V 

(b)  The  temperature  during  the  lunar  daytime  gets  up  to  about 
130°C.  In  the  extremely  thin  (almost  nonexistent)  lunar  atmosphere, 
estimate  how  the  typical  velocity  of  a  molecule  would  compare  with 
that  of  the  same  type  of  molecule  in  the  earth’s  atmosphere.  As¬ 
sume  that  the  earth’s  atmosphere  has  a  temperature  of  0°C.  v 

(c)  Suppose  you  were  to  go  to  the  moon  and  release  some  fluo¬ 

rocarbon  gas,  with  molecular  formula  CnF2n+2-  Estimate  what  is 
the  smallest  fluorocarbon  molecule  (lowest  n )  whose  typical  velocity 
would  be  lower  than  that  of  an  N2  molecule  on  earth  in  proportion 
to  the  moon’s  lower  escape  velocity.  The  moon  would  be  able  to 
retain  an  atmosphere  made  of  these  molecules.  E  ^ 

8  Refrigerators,  air  conditioners,  and  heat  pumps  are  heat  en¬ 
gines  that  work  in  reverse.  You  put  in  mechanical  work,  and  the 
effect  is  to  take  heat  out  of  a  cooler  reservoir  and  deposit  heat  in  a 
warmer  one:  Ql  +  IE  =  Qh •  As  with  the  heat  engines  discussed  pre¬ 
viously,  the  efficiency  is  defined  as  the  energy  transfer  you  want  (■ Ql 
for  a  refrigerator  or  air  conditioner,  Qh  for  a  heat  pump)  divided 
by  the  energy  transfer  you  pay  for  (IE). 

Efficiencies  are  supposed  to  be  unitless,  but  the  efficiency  of  an 
air  conditioner  is  normally  given  in  terms  of  an  EER  rating  (or 
a  more  complex  version  called  an  SEER).  The  EER  is  defined  as 
Ql/ IE,  but  expressed  in  the  barbaric  units  of  of  Btu/ watt-hour. 
A  typical  EER  rating  for  a  residential  air  conditioner  is  about  10 
Btu/watt-hour,  corresponding  to  an  efficiency  of  about  3.  The  stan¬ 
dard  temperatures  used  for  testing  an  air  conditioner’s  efficiency  are 
80°E  (27°C)  inside  and  95°E  (35°C)  outside. 

(a)  What  would  be  the  EER  rating  of  a  reversed  Carnot  engine  used 

as  an  air  conditioner?  V 

(b)  If  you  ran  a  3-kW  residential  air  conditioner,  with  an  efficiency 

of  3,  for  one  hour,  what  would  be  the  effect  on  the  total  entropy 
of  the  universe?  Is  your  answer  consistent  with  the  second  law  of 
thermodynamics?  V  - 

9  Even  when  resting,  the  human  body  needs  to  do  a  certain 

amount  of  mechanical  work  to  keep  the  heart  beating.  This  quan¬ 
tity  is  difficult  to  define  and  measure  with  high  precision,  and  also 
depends  on  the  individual  and  her  level  of  activity,  but  it’s  estimated 
to  be  about  1  to  5  watts.  Suppose  we  consider  the  human  body  as 
nothing  more  than  a  pump.  A  person  who  is  just  lying  in  bed  all 
day  needs  about  1000  kcal/day  worth  of  food  to  stay  alive,  (a)  Es¬ 
timate  the  person’s  thermodynamic  efficiency  as  a  pump,  and  (b) 
compare  with  the  maximum  possible  efficiency  imposed  by  the  laws 
of  thermodynamics  for  a  heat  engine  operating  across  the  difference 
between  a  body  temperature  of  37°C  and  an  ambient  temperature 
of  22°C.  (c)  Interpret  your  answer.  >  Answer,  p.  970  - 
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10  Example  25  on  page  336  suggests  analyzing  the  resonance 

of  a  violin  at  300  Hz  as  a  Helmholtz  resonance.  However,  we  might 
expect  the  equation  for  the  frequency  of  a  Helmholtz  resonator  to 
be  a  rather  crude  approximation  here,  since  the  f-holes  are  not  long 
tubes,  but  slits  cut  through  the  face  of  the  instrument,  which  is  only 
about  2.5  mm  thick,  (a)  Estimate  the  frequency  that  way  anyway, 
for  a  violin  with  a  volume  of  about  1.6  liters,  and  f-holes  with  a  total 
area  of  10  cm2,  (b)  A  common  rule  of  thumb  is  that  at  an  open  end 
of  an  air  column,  such  as  the  neck  of  a  real  Helmholtz  resonator, 
some  air  beyond  the  mouth  also  vibrates  as  if  it  was  inside  the 
tube,  and  that  this  effect  can  be  taken  into  account  by  adding  0.4 
times  the  diameter  of  the  tube  for  each  open  end  (i.e. ,  0.8  times  the 
diameter  when  both  ends  are  open).  Applying  this  to  the  violin’s 
f-holes  results  in  a  huge  change  in  L,  since  the  ~  7  mm  width  of  the 
f-hole  is  considerably  greater  than  the  thickness  of  the  wood.  Try 
it,  and  see  if  the  result  is  a  better  approximation  to  the  observed 
frequency  of  the  resonance.  t>  Answer,  p.  970  •- 

11  (a)  Atmospheric  pressure  at  sea  level  is  101  kPa.  The  deepest 
spot  in  the  world’s  oceans  is  a  valley  called  the  Challenger  Deep,  in 
the  Marianas  Trench,  with  a  depth  of  11.0  km.  Find  the  pressure 
at  this  depth,  in  units  of  atmospheres.  Although  water  under  this 
amount  of  pressure  does  compress  by  a  few  percent,  assume  for  the 
purposes  of  this  problem  that  it  is  incompressible. 

(b)  Suppose  that  an  air  bubble  is  formed  at  this  depth  and  then 
rises  to  the  surface.  Estimate  the  change  in  its  volume  and  radius. 

>  Solution,  p.  977  - 

12  Our  sun  is  powered  by  nuclear  fusion  reactions,  and  as  a  first 
step  in  these  reactions,  one  proton  must  approach  another  proton  to 
within  a  short  enough  range  r.  This  is  difficult  to  achieve,  because 
the  protons  have  electric  charge  +e  and  therefore  repel  one  another 
electrically.  (It’s  a  good  thing  that  it’s  so  difficult,  because  other¬ 
wise  the  sun  would  use  up  all  of  its  fuel  very  rapidly  and  explode.) 
To  make  fusion  possible,  the  protons  must  be  moving  fast  enough 
to  come  within  the  required  range.  Even  at  the  high  temperatures 
present  in  the  core  of  our  sun,  almost  none  of  the  protons  are  mov¬ 
ing  fast  enough. 

(a)  For  comparison,  the  early  universe,  soon  after  the  Big  Bang, 
had  extremely  high  temperatures.  Estimate  the  temperature  T 
that  would  have  been  required  so  that  protons  with  average  en¬ 
ergies  could  fuse.  State  your  result  in  terms  of  r,  the  mass  m  of  the 
proton,  and  universal  constants. 

(b)  Show  that  the  units  of  your  answer  to  part  a  make  sense. 

(c)  Evaluate  your  result  from  part  a  numerically,  using  r  =  10“ 15  m 
and  m  =  1.7  x  ICE27  kg.  As  a  check,  you  should  find  that  this  is 
much  hotter  than  the  sun’s  core  temperature  of  107  K. 

[>  Solution,  p.  977  - 

13  Object  A  is  a  brick.  Object  B  is  half  of  a  similar  brick.  If  A 
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is  heated,  we  have  AS  =  Q/T.  Show  that  if  this  equation  is  valid 
for  A,  then  it  is  also  valid  for  B.  >  Solution,  p.  977  - 

14  Typically  the  atmosphere  gets  colder  with  increasing  altitude. 
However,  sometimes  there  is  an  inversion  layer,  in  which  this  trend 
is  reversed,  e.g.,  because  a  less  dense  mass  of  warm  air  moves  into  a 
certain  area,  and  rises  above  the  denser  colder  air  that  was  already 
present.  Suppose  that  this  causes  the  pressure  P  as  a  function  of 
height  y  to  be  given  by  a  function  of  the  form  P  =  P0e~ky(  1  +  by), 
where  constant  temperature  would  give  6  =  0  and  an  inversion  layer 
would  give  b  >  0.  (a)  Infer  the  units  of  the  constants  P0,  k,  and  b. 
(b)  Find  the  density  of  the  air  as  a  function  of  y ,  of  the  constants, 
and  of  the  acceleration  of  gravity  g.  (c)  Check  that  the  units  of  your 
answer  to  part  b  make  sense.  >  Solution,  p.  978  - 


Key  to  symbols: 

easy  -  typical  S  challenging  )i  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Chapter  5  Thermodynamics 


The  vibrations  of  this  electric  bass  string  are  converted  to  electrical  vibra¬ 
tions,  then  to  sound  vibrations,  and  finally  to  vibrations  of  our  eardrums. 


Chapter  6 

Waves 


Dandelion.  Cello.  Read  those  two  words,  and  your  brain  instantly 
conjures  a  stream  of  associations,  the  most  prominent  of  which  have 
to  do  with  vibrations.  Our  mental  category  of  “dandelion-ness”  is 
strongly  linked  to  the  color  of  light  waves  that  vibrate  about  half  a 
million  billion  times  a  second:  yellow.  The  velvety  throb  of  a  cello 
has  as  its  most  obvious  characteristic  a  relatively  low  musical  pitch 
—  the  note  you’re  spontaneously  imagining  right  now  might  be  one 
whose  sound  vibrations  repeat  at  a  rate  of  a  hundred  times  a  second. 

Evolution  seems  to  have  designed  our  two  most  important  senses 
around  the  assumption  that  our  environment  is  made  of  waves, 
whereas  up  until  now,  we’ve  mostly  taken  the  view  that  Nature  can 
be  understood  by  breaking  her  down  into  smaller  and  smaller  parts, 
ending  up  with  particles  as  her  most  fundamental  building  blocks. 
Does  that  work  for  light  and  sound?  Sound  waves  are  disturbances 
in  air,  which  is  made  of  atoms,  but  light,  on  the  other  hand,  isn’t  a 
vibration  of  atoms.  Light,  unlike  sound,  can  travel  through  a  vac¬ 
uum:  if  you’re  reading  this  by  sunlight,  you’re  taking  advantage  of 
light  that  had  to  make  it  through  millions  of  miles  of  vacuum  to  get 
to  you.  Waves,  then,  are  not  just  a  trick  that  vibrating  atoms  can 
do.  Waves  are  one  of  the  basic  phenomena  of  the  universe.  At  the 
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end  of  this  book,  we’ll  even  see  that  the  things  we’ve  been  calling 
particles,  such  as  electrons,  are  really  waves!1 


6.1  Free  Waves 


a /Your  finger  makes  a  de¬ 
pression  in  the  surface  of  the 
water,  1.  The  wave  patterns 
starts  evolving,  2,  after  you 
remove  your  finger. 


6.1.1  Wave  motion 

Let’s  start  with  an  intuition-building  exercise  that  deals  with 
waves  in  matter,  since  they’re  easier  than  light  waves  to  get  your 
hands  on.  Put  your  fingertip  in  the  middle  of  a  cup  of  water  and 
then  remove  it  suddenly.  You’ll  have  noticed  two  results  that  are 
surprising  to  most  people.  First,  the  flat  surface  of  the  water  does 
not  simply  sink  uniformly  to  fill  in  the  volume  vacated  by  your 
finger.  Instead,  ripples  spread  out,  and  the  process  of  flattening  out 
occurs  over  a  long  period  of  time,  during  which  the  water  at  the 
center  vibrates  above  and  below  the  normal  water  level.  This  type 
of  wave  motion  is  the  topic  of  the  present  section.  Second,  you’ve 
found  that  the  ripples  bounce  off  of  the  walls  of  the  cup,  in  much 
the  same  way  that  a  ball  would  bounce  off  of  a  wall.  In  the  next 
section  we  discuss  what  happens  to  waves  that  have  a  boundary 
around  them.  Until  then,  we  confine  ourselves  to  wave  phenomena 
that  can  be  analyzed  as  if  the  medium  (e.g.,  the  water)  was  infinite 
and  the  same  everywhere. 

It  isn’t  hard  to  understand  why  removing  your  fingertip  creates 
ripples  rather  than  simply  allowing  the  water  to  sink  back  down 
uniformly.  The  initial  crater,  a/1,  left  behind  by  your  finger  has 
sloping  sides,  and  the  water  next  to  the  crater  flows  downhill  to  fill 
in  the  hole.  The  water  far  away,  on  the  other  hand,  initially  has 
no  way  of  knowing  what  has  happened,  because  there  is  no  slope 
for  it  to  flow  down.  As  the  hole  fills  up,  the  rising  water  at  the 
center  gains  upward  momentum,  and  overshoots,  creating  a  little 
hill  where  there  had  been  a  hole  originally.  The  area  just  outside  of 
this  region  has  been  robbed  of  some  of  its  water  in  order  to  build 
the  hill,  so  a  depressed  “moat”  is  formed,  a/2.  This  effect  cascades 
outward,  producing  ripples. 

There  are  three  main  ways  in  which  wave  motion  differs  from 
the  motion  of  objects  made  of  matter. 


1.  Superposition 

If  you  watched  the  water  in  the  cup  carefully,  you  noticed  the 
ghostlike  behavior  of  the  reflected  ripples  coming  back  toward  the 
center  of  the  cup  and  the  outgoing  ripples  that  hadn’t  yet  been  re¬ 
flected:  they  passed  right  through  each  other.  This  is  the  first,  and 
the  most  profound,  difference  between  wave  motion  and  the  rno- 

1  Speaking  more  carefully,  I  should  say  that  every  basic  building  block  of  light 
and  matter  has  both  wave  and  particle  properties. 
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Waves 


b  /  The  two  circular  patterns  of 
ripples  pass  through  each  other. 
Unlike  material  objects,  wave  pat¬ 
terns  can  overlap  in  space,  and 
when  this  happens  they  combine 
by  addition. 


tion  of  objects:  waves  do  not  display  any  repulsion  of  each  other 
analogous  to  the  normal  forces  between  objects  that  come  in  con¬ 
tact.  Two  wave  patterns  can  therefore  overlap  in  the  same  region  of 
space,  as  shown  in  figure  b.  Where  the  two  waves  coincide,  they  add 
together.  For  instance,  suppose  that  at  a  certain  location  in  at  a  cer¬ 
tain  moment  in  time,  each  wave  would  have  had  a  crest  3  cm  above 
the  normal  water  level.  The  waves  combine  at  this  point  to  make  a 
6-cm  crest.  We  use  negative  numbers  to  represent  depressions  in  the 
water.  If  both  waves  would  have  had  a  troughs  measuring  —3  cm, 
then  they  combine  to  make  an  extra-deep  —6  cm  trough.  A  +3  cm 
crest  and  a  —3  cm  trough  result  in  a  height  of  zero,  i.e.,  the  waves 
momentarily  cancel  each  other  out  at  that  point.  This  additive  rule 
is  referred  to  as  the  principle  of  superposition,  “superposition”  being 
merely  a  fancy  word  for  “adding.” 

Superposition  can  occur  not  just  with  sinusoidal  waves  like  the 
ones  in  the  figure  above  but  with  waves  of  any  shape.  The  figures 
on  the  following  page  show  superposition  of  wave  pulses.  A  pulse  is 
simply  a  wave  of  very  short  duration.  These  pulses  consist  only  of 
a  single  hump  or  trough.  If  you  hit  a  clothesline  sharply,  you  will 
observe  pulses  heading  off  in  both  directions.  This  is  analogous  to 
the  way  ripples  spread  out  in  all  directions  when  you  make  a  distur¬ 
bance  at  one  point  on  water.  The  same  occurs  when  the  hammer 
on  a  piano  comes  up  and  hits  a  string. 

Experiments  to  date  have  not  shown  any  deviation  from  the 
principle  of  superposition  in  the  case  of  light  waves.  For  other  types 
of  waves,  it  is  typically  a  very  good  approximation  for  low-energy 
waves. 
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d  /  As  the  wave  pulse  goes 
by,  the  ribbon  tied  to  the  spring 
is  not  carried  along.  The  motion 
of  the  wave  pattern  is  to  the 
right,  but  the  medium  (spring)  is 
moving  from  side  to  side,  not  to 
the  right.  ( PSSC  Physics) 


c  /  As  the  wave  pattern  passes  the  rubber  duck,  the  duck  stays 
put.  The  water  isn’t  moving  with  the  wave. 


2.  The  medium  is  not  transported  with  the  wave. 

The  sequence  of  three  photos  in  figure  c  shows  a  series  of  water 
waves  before  it  has  reached  a  rubber  duck  (left),  having  just  passed 
the  duck  (middle)  and  having  progressed  about  a  meter  beyond  the 
duck  (right).  The  duck  bobs  around  its  initial  position,  but  is  not 
carried  along  with  the  wave.  This  shows  that  the  water  itself  does 
not  flow  outward  with  the  wave.  If  it  did,  we  could  empty  one  end  of 
a  swimming  pool  simply  by  kicking  up  waves!  We  must  distinguish 
between  the  motion  of  the  medium  (water  in  this  case)  and  the 
motion  of  the  wave  pattern  through  the  medium.  The  medium 
vibrates;  the  wave  progresses  through  space. 

self-check  A 

In  figure  d,  you  can  detect  the  side-to-side  motion  of  the  spring  because 
the  spring  appears  blurry.  At  a  certain  instant,  represented  by  a  single 
photo,  how  would  you  describe  the  motion  of  the  different  parts  of  the 
spring?  Other  than  the  flat  parts,  do  any  parts  of  the  spring  have  zero 
velocity?  >  Answer,  p.  964 

A  worm  example  1 

The  worm  in  the  figure  is  moving  to  the  right.  The  wave  pattern, 
a  pulse  consisting  of  a  compressed  area  of  its  body,  moves  to 
the  left.  In  other  words,  the  motion  of  the  wave  pattern  is  in  the 
opposite  direction  compared  to  the  motion  of  the  medium. 


Surfing 


example  2 
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The  incorrect  belief  that  the  medium  moves  with  the  wave  is  often 
reinforced  by  garbled  secondhand  knowledge  of  surfing.  Anyone 
who  has  actually  surfed  knows  that  the  front  of  the  board  pushes 
the  water  to  the  sides,  creating  a  wake  —  the  surfer  can  even 
drag  his  hand  through  the  water,  as  in  in  figure  e.  If  the  water  was 
moving  along  with  the  wave  and  the  surfer,  this  wouldn’t  happen. 
The  surfer  is  carried  forward  because  forward  is  downhill,  not  be¬ 
cause  of  any  forward  flow  of  the  water.  If  the  water  was  flowing 
forward,  then  a  person  floating  in  the  water  up  to  her  neck  would 
be  carried  along  just  as  quickly  as  someone  on  a  surfboard.  In 
fact,  it  is  even  possible  to  surf  down  the  back  side  of  a  wave,  al¬ 
though  the  ride  wouldn’t  last  very  long  because  the  surfer  and  the 
wave  would  quickly  part  company. 


3.  A  wave’s  velocity  depends  on  the  medium. 

A  material  object  can  move  with  any  velocity,  and  can  be  sped 
up  or  slowed  down  by  a  force  that  increases  or  decreases  its  kinetic 
energy.  Not  so  with  waves.  The  speed  of  a  wave,  depends  on  the 
properties  of  the  medium  (and  perhaps  also  on  the  shape  of  the 
wave,  for  certain  types  of  waves).  Sound  waves  travel  at  about  340 
m/s  in  air,  1000  m/s  in  helium.  If  you  kick  up  water  waves  in  a 
pool,  you  will  find  that  kicking  harder  makes  waves  that  are  taller 
(and  therefore  carry  more  energy),  not  faster.  The  sound  waves 
from  an  exploding  stick  of  dynamite  carry  a  lot  of  energy,  but  are 
no  faster  than  any  other  waves.  In  the  following  section  we  will  give 
an  example  of  the  physical  relationship  between  the  wave  speed  and 
the  properties  of  the  medium. 

Breaking  waves  example  3 

The  velocity  of  water  waves  increases  with  depth.  The  crest  of  a 
wave  travels  faster  than  the  trough,  and  this  can  cause  the  wave 
to  break. 

Once  a  wave  is  created,  the  only  reason  its  speed  will  change  is 
if  it  enters  a  different  medium  or  if  the  properties  of  the  medium 
change.  It  is  not  so  surprising  that  a  change  in  medium  can  slow 
down  a  wave,  but  the  reverse  can  also  happen.  A  sound  wave  trav¬ 
eling  through  a  helium  balloon  will  slow  down  when  it  emerges  into 
the  air,  but  if  it  enters  another  balloon  it  will  speed  back  up  again! 
Similarly,  water  waves  travel  more  quickly  over  deeper  water,  so  a 
wave  will  slow  down  as  it  passes  over  an  underwater  ridge,  but  speed 
up  again  as  it  emerges  into  deeper  water. 

Hull  speed  example  4 

The  speeds  of  most  boats,  and  of  some  surface-swimming  ani¬ 
mals,  are  limited  by  the  fact  that  they  make  a  wave  due  to  their 
motion  through  the  water.  The  boat  in  figure  g  is  going  at  the 
same  speed  as  its  own  waves,  and  can’t  go  any  faster.  No  mat¬ 
ter  how  hard  the  boat  pushes  against  the  water,  it  can’t  make 


e  /  Example  2.  The  surfer  is 
dragging  his  hand  in  the  water. 


f/ Example  3:  a  breaking 

wave. 


g  /  Example  4.  The  boat  has 
run  up  against  a  limit  on  its  speed 
because  it  can’t  climb  over  its 
own  wave.  Dolphins  get  around 
the  problem  by  leaping  out  of  the 
water. 
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h  /  Circular  and  linear  wave 
patterns. 


i  /  Plane  and  spherical  wave 
patterns. 


the  wave  move  ahead  faster  and  get  out  of  the  way.  The  wave’s 
speed  depends  only  on  the  medium.  Adding  energy  to  the  wave 
doesn’t  speed  it  up,  it  just  increases  its  amplitude. 

A  water  wave,  unlike  many  other  types  of  wave,  has  a  speed  that 
depends  on  its  shape:  a  broader  wave  moves  faster.  The  shape 
of  the  wave  made  by  a  boat  tends  to  mold  itself  to  the  shape  of 
the  boat’s  hull,  so  a  boat  with  a  longer  hull  makes  a  broader  wave 
that  moves  faster.  The  maximum  speed  of  a  boat  whose  speed  is 
limited  by  this  effect  is  therefore  closely  related  to  the  length  of  its 
hull,  and  the  maximum  speed  is  called  the  hull  speed.  Sailboats 
designed  for  racing  are  not  just  long  and  skinny  to  make  them 
more  streamlined  —  they  are  also  long  so  that  their  hull  speeds 
will  be  high. 

Wave  patterns 

If  the  magnitude  of  a  wave’s  velocity  vector  is  preordained,  what 
about  its  direction?  Waves  spread  out  in  all  directions  from  every 
point  on  the  disturbance  that  created  them.  If  the  disturbance  is 
small,  we  may  consider  it  as  a  single  point,  and  in  the  case  of  water 
waves  the  resulting  wave  pattern  is  the  familiar  circular  ripple,  h/1. 
If,  on  the  other  hand,  we  lay  a  pole  on  the  surface  of  the  water 
and  wiggle  it  up  and  down,  we  create  a  linear  wave  pattern,  h/2. 
For  a  three-dimensional  wave  such  as  a  sound  wave,  the  analogous 
patterns  would  be  spherical  waves  and  plane  waves,  i. 

Infinitely  many  patterns  are  possible,  but  linear  or  plane  waves 
are  often  the  simplest  to  analyze,  because  the  velocity  vector  is  in 
the  same  direction  no  matter  what  part  of  the  wave  we  look  at.  Since 
all  the  velocity  vectors  are  parallel  to  one  another,  the  problem  is 
effectively  one-dimensional.  Throughout  this  chapter  and  the  next, 
we  will  restrict  ourselves  mainly  to  wave  motion  in  one  dimension, 
while  not  hesitating  to  broaden  our  horizons  when  it  can  be  done 
without  too  much  complication. 
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Discussion  Questions 


A  The  left  panel  of  the  figure  shows  a  sequence  of  snapshots  of  two 
positive  pulses  on  a  coil  spring  as  they  move  through  each  other.  In  the 
right  panel,  which  shows  a  positive  pulse  and  a  negative  one,  the  fifth 
frame  has  the  spring  just  about  perfectly  flat.  If  the  two  pulses  have  es¬ 
sentially  canceled  each  other  out  perfectly,  then  why  does  the  motion  pick 
up  again?  Why  doesn’t  the  spring  just  stay  flat? 


j  /  Discussion  question  A. 


B  Sketch  two  positive  wave  pulses  on  a  string  that  are  overlapping  but 
not  right  on  top  of  each  other,  and  draw  their  superposition.  Do  the  same 
for  a  positive  pulse  running  into  a  negative  pulse. 

C  A  traveling  wave  pulse  is  moving  to  the  right  on  a  string.  Sketch  the 
velocity  vectors  of  the  various  parts  of  the  string.  Now  do  the  same  for  a 
pulse  moving  to  the  left. 

D  In  a  spherical  sound  wave  spreading  out  from  a  point,  how  would 
the  energy  of  the  wave  fall  off  with  distance? 
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k  /  Hitting  a  key  on  a  piano 
causes  a  hammer  to  come  up 
from  underneath  and  hit  a  string 
(actually  a  set  of  three).  The 
result  is  a  pair  of  pulses  moving 
away  from  the  point  of  impact. 


I/A  pulse  on  a  string  splits 
in  two  and  heads  off  in  both 
directions. 


m  /  Modeling  a  string  as  a 
series  of  masses  connected  by 
springs. 


6.1.2  Waves  on  a  string 

So  far  you’ve  learned  some  counterintuitive  things  about  the 
behavior  of  waves,  but  intuition  can  be  trained.  The  first  half  of 
this  subsection  aims  to  build  your  intuition  by  investigating  a  simple, 
one-dimensional  type  of  wave:  a  wave  on  a  string.  If  you  have  ever 
stretched  a  string  between  the  bottoms  of  two  open-mouthed  cans  to 
talk  to  a  friend,  you  were  putting  this  type  of  wave  to  work.  Stringed 
instruments  are  another  good  example.  Although  we  usually  think 
of  a  piano  wire  simply  as  vibrating,  the  hammer  actually  strikes 
it  quickly  and  makes  a  dent  in  it,  which  then  ripples  out  in  both 
directions.  Since  this  chapter  is  about  free  waves,  not  bounded  ones, 
we  pretend  that  our  string  is  infinitely  long. 

After  the  qualitative  discussion,  we  will  use  simple  approxima¬ 
tions  to  investigate  the  speed  of  a  wave  pulse  on  a  string.  This  quick 
and  dirty  treatment  is  then  followed  by  a  rigorous  attack  using  the 
methods  of  calculus,  which  turns  out  to  be  both  simpler  and  more 
general. 

Intuitive  ideas 

Consider  a  string  that  has  been  struck,  1/1,  resulting  in  the  cre¬ 
ation  of  two  wave  pulses,  1/2,  one  traveling  to  the  left  and  one  to 
the  right.  This  is  analogous  to  the  way  ripples  spread  out  in  all 
directions  from  a  splash  in  water,  but  on  a  one-dimensional  string, 
“all  directions”  becomes  “both  directions.” 

We  can  gain  insight  by  modeling  the  string  as  a  series  of  masses 
connected  by  springs,  m.  (In  the  actual  string  the  mass  and  the 
springiness  are  both  contributed  by  the  molecules  themselves.)  If 
we  look  at  various  microscopic  portions  of  the  string,  there  will  be 
some  areas  that  are  flat,  1,  some  that  are  sloping  but  not  curved,  2, 
and  some  that  are  curved,  3  and  4.  In  example  1  it  is  clear  that  both 
the  forces  on  the  central  mass  cancel  out,  so  it  will  not  accelerate. 
The  same  is  true  of  2,  however.  Only  in  curved  regions  such  as  3 
and  4  is  an  acceleration  produced.  In  these  examples,  the  vector 
sum  of  the  two  forces  acting  on  the  central  mass  is  not  zero.  The 
important  concept  is  that  curvature  makes  force:  the  curved  areas  of 
a  wave  tend  to  experience  forces  resulting  in  an  acceleration  toward 
the  mouth  of  the  curve.  Note,  however,  that  an  uncurved  portion 
of  the  string  need  not  remain  motionless.  It  may  move  at  constant 
velocity  to  either  side. 

Approximate  treatment 

We  now  carry  out  an  approximate  treatment  of  the  speed  at 
which  two  pulses  will  spread  out  from  an  initial  indentation  on  a 
string.  For  simplicity,  we  imagine  a  hammer  blow  that  creates  a  tri¬ 
angular  dent,  n/1.  We  will  estimate  the  amount  of  time,  t,  required 
until  each  of  the  pulses  has  traveled  a  distance  equal  to  the  width 
of  the  pulse  itself.  The  velocity  of  the  pulses  is  then  ±w/t. 
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As  always,  the  velocity  of  a  wave  depends  on  the  properties  of 
the  medium,  in  this  case  the  string.  The  properties  of  the  string  can 
be  summarized  by  two  variables:  the  tension,  T,  and  the  mass  per 
unit  length,  /j,  (Greek  letter  mu). 

If  we  consider  the  part  of  the  string  encompassed  by  the  initial 
dent  as  a  single  object,  then  this  object  has  a  mass  of  approxi¬ 
mately  [iw  (mass/length  x  length=mass) .  (Here,  and  throughout 
the  derivation,  we  assume  that  h  is  much  less  than  w,  so  that  we  can 
ignore  the  fact  that  this  segment  of  the  string  has  a  length  slightly 
greater  than  w.)  Although  the  downward  acceleration  of  this  seg¬ 
ment  of  the  string  will  be  neither  constant  over  time  nor  uniform 
across  the  pulse,  we  will  pretend  that  it  is  constant  for  the  sake  of 
our  simple  estimate.  Roughly  speaking,  the  time  interval  between 
n/1  and  n/2  is  the  amount  of  time  required  for  the  initial  dent  to  ac¬ 
celerate  from  rest  and  reach  its  normal,  flattened  position.  Of  course 
the  tip  of  the  triangle  has  a  longer  distance  to  travel  than  the  edges, 
but  again  we  ignore  the  complications  and  simply  assume  that  the 
segment  as  a  whole  must  travel  a  distance  h.  Indeed,  it  might  seem 
surprising  that  the  triangle  would  so  neatly  spring  back  to  a  per¬ 
fectly  flat  shape.  It  is  an  experimental  fact  that  it  does,  but  our 
analysis  is  too  crude  to  address  such  details. 

The  string  is  kinked,  i.e.,  tightly  curved,  at  the  edges  of  the 
triangle,  so  it  is  here  that  there  will  be  large  forces  that  do  not 
cancel  out  to  zero.  There  are  two  forces  acting  on  the  triangular 
hump,  one  of  magnitude  T  acting  down  and  to  the  right,  and  one 
of  the  same  magnitude  acting  down  and  to  the  left.  If  the  angle 
of  the  sloping  sides  is  9,  then  the  total  force  on  the  segment  equals 
2 T  sin0.  Dividing  the  triangle  into  two  right  triangles,  we  see  that 
sin  0  equals  h  divided  by  the  length  of  one  of  the  sloping  sides.  Since 
h  is  much  less  than  w,  the  length  of  the  sloping  side  is  essentially 
the  same  as  w/2,  so  we  have  sin0  =  2 h/w,  and  F  =  4Th/w.  The 
acceleration  of  the  segment  (actually  the  acceleration  of  its  center 
of  mass)  is 


F 

a  =  — 
m 

ATh 


Liw2 


The  time  required  to  move  a  distance  h  under  constant  acceleration 
a  is  found  by  solving  h  =  (1/2 )at2  to  yield 


t  =  \J  2  hj  a 


n  /  A  triangular  pulse  spreads  out. 
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Our  final  result  for  the  speed  of  the  pulses  is 


v  =  w/t 


The  remarkable  feature  of  this  result  is  that  the  velocity  of  the 
pulses  does  not  depend  at  all  on  w  or  h,  i.e.,  any  triangular  pulse 
has  the  same  speed.  It  is  an  experimental  fact  (and  we  will  also 
prove  rigorously  below)  that  any  pulse  of  any  kind,  triangular  or 
otherwise,  travels  along  the  string  at  the  same  speed.  Of  course, 
after  so  many  approximations  we  cannot  expect  to  have  gotten  all 
the  numerical  factors  right.  The  correct  result  for  the  speed  of  the 
pulses  is 


The  importance  of  the  above  derivation  lies  in  the  insight  it 
brings  — that  all  pulses  move  with  the  same  speed  —  rather  than  in 
the  details  of  the  numerical  result.  The  reason  for  our  too- high  value 
for  the  velocity  is  not  hard  to  guess.  It  comes  from  the  assumption 
that  the  acceleration  was  constant,  when  actually  the  total  force  on 
the  segment  would  diminish  as  it  flattened  out. 


Treatment  using  calculus 


After  expending  considerable  effort  for  an  approximate  solution, 
we  now  display  the  power  of  calculus  with  a  rigorous  and  completely 
general  treatment  that  is  nevertheless  much  shorter  and  easier.  Let 
the  flat  position  of  the  string  define  the  x  axis,  so  that  y  measures 
how  far  a  point  on  the  string  is  from  equilibrium.  The  motion  of 
the  string  is  characterized  by  y(x,t),  a  function  of  two  variables. 
Knowing  that  the  force  on  any  small  segment  of  string  depends 
on  the  curvature  of  the  string  in  that  area,  and  that  the  second 
derivative  is  a  measure  of  curvature,  it  is  not  surprising  to  find  that 
the  infinitesimal  force  d F  acting  on  an  infinitesimal  segment  dx  is 
given  by 


d2y 

dF  =  T  — |  dx. 
oxz 

(This  can  be  proved  by  vector  addition  of  the  two  infinitesimal  forces 
acting  on  either  side.)  The  symbol  d  stands  for  a  partial  derivative, 
e.g.,  d/dx  means  a  derivative  with  respect  to  x  that  is  evaluated 
while  treating  t  as  a  constant.  The  acceleration  is  then  a  =  d F/  dm, 
or,  substituting  dm  =  ydx, 


d2y  _  T  d2y 
dt2  n  dx 2 


The  second  derivative  with  respect  to  time  is  related  to  the  second 
derivative  with  respect  to  position.  This  is  no  more  than  a  fancy 
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mathematical  statement  of  the  intuitive  fact  developed  above,  that 
the  string  accelerates  so  as  to  flatten  out  its  curves. 

Before  even  bothering  to  look  for  solutions  to  this  equation,  we 
note  that  it  already  proves  the  principle  of  superposition,  because 
the  derivative  of  a  sum  is  the  sum  of  the  derivatives.  Therefore  the 
sum  of  any  two  solutions  will  also  be  a  solution. 

Based  on  experiment,  we  expect  that  this  equation  will  be  sat¬ 
isfied  by  any  function  y(x,  t )  that  describes  a  pulse  or  wave  pattern 
moving  to  the  left  or  right  at  the  correct  speed  v.  In  general,  such 
a  function  will  be  of  the  form  y  =  f(x  —  vt )  or  y  =  f(x  +  vt),  where 
/  is  any  function  of  one  variable.  Because  of  the  chain  rule,  each 
derivative  with  respect  to  time  brings  out  a  factor  of  v.  Evaluating 
the  second  derivatives  on  both  sides  of  the  equation  gives 

(±«)2  /"  =  -  /". 

A4 

Squaring  gets  rid  of  the  sign,  and  we  find  that  we  have  a  valid 
solution  for  any  function  /,  provided  that  v  is  given  by 


6.1.3  Sound  and  light  waves 

Sound  waves 

The  phenomenon  of  sound  is  easily  found  to  have  all  the  char¬ 
acteristics  we  expect  from  a  wave  phenomenon: 

-  Sound  waves  obey  superposition.  Sounds  do  not  knock  other 
sounds  out  of  the  way  when  they  collide,  and  we  can  hear  more  than 
one  sound  at  once  if  they  both  reach  our  ear  simultaneously. 

-  The  medium  does  not  move  with  the  sound.  Even  standing  in 
front  of  a  titanic  speaker  playing  earsplitting  music,  we  do  not  feel 
the  slightest  breeze. 

-  The  velocity  of  sound  depends  on  the  medium.  Sound  travels 
faster  in  helium  than  in  air,  and  faster  in  water  than  in  helium. 
Putting  more  energy  into  the  wave  makes  it  more  intense,  not  faster. 
For  example,  you  can  easily  detect  an  echo  when  you  clap  your  hands 
a  short  distance  from  a  large,  flat  wall,  and  the  delay  of  the  echo  is 
no  shorter  for  a  louder  clap. 

Although  not  all  waves  have  a  speed  that  is  independent  of  the 
shape  of  the  wave,  and  this  property  therefore  is  irrelevant  to  our 
collection  of  evidence  that  sound  is  a  wave  phenomenon,  sound  does 
nevertheless  have  this  property.  For  instance,  the  music  in  a  large 
concert  hall  or  stadium  may  take  on  the  order  of  a  second  to  reach 
someone  seated  in  the  nosebleed  section,  but  we  do  not  notice  or 
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care,  because  the  delay  is  the  same  for  every  sound.  Bass,  drums, 
and  vocals  all  head  outward  from  the  stage  at  340  m/s,  regardless  of 
their  differing  wave  shapes.  (The  speed  of  sound  in  a  gas  is  related 
to  the  gas’s  physical  properties  in  example  13  on  p.  379.) 

If  sound  has  all  the  properties  we  expect  from  a  wave,  then  what 
type  of  wave  is  it?  It  is  a  series  of  compressions  and  expansions  of  the 
air.  Even  for  a  very  loud  sound,  the  increase  or  decrease  compared 
to  normal  atmospheric  pressure  is  no  more  than  a  part  per  million, 
so  our  ears  are  apparently  very  sensitive  instruments.  In  a  vacuum, 
there  is  no  medium  for  the  sound  waves,  and  so  they  cannot  exist. 
The  roars  and  whooshes  of  space  ships  in  Hollywood  movies  are  fun, 
but  scientifically  wrong. 

Light  waves 

Entirely  similar  observations  lead  us  to  believe  that  light  is  a 
wave,  although  the  concept  of  light  as  a  wave  had  a  long  and  tortu¬ 
ous  history.  It  is  interesting  to  note  that  Isaac  Newton  very  influen¬ 
tially  advocated  a  contrary  idea  about  light.  The  belief  that  matter 
was  made  of  atoms  was  stylish  at  the  time  among  radical  thinkers 
(although  there  was  no  experimental  evidence  for  their  existence), 
and  it  seemed  logical  to  Newton  that  light  as  well  should  be  made  of 
tiny  particles,  which  he  called  corpuscles  (Latin  for  “small  objects”). 
Newton’s  triumphs  in  the  science  of  mechanics,  i.e.,  the  study  of 
matter,  brought  him  such  great  prestige  that  nobody  bothered  to 
question  his  incorrect  theory  of  light  for  150  years.  One  persua¬ 
sive  proof  that  light  is  a  wave  is  that  according  to  Newton’s  theory, 
two  intersecting  beams  of  light  should  experience  at  least  some  dis¬ 
ruption  because  of  collisions  between  their  corpuscles.  Even  if  the 
corpuscles  were  extremely  small,  and  collisions  therefore  very  infre¬ 
quent,  at  least  some  dimming  should  have  been  measurable.  In  fact, 
very  delicate  experiments  have  shown  that  there  is  no  dimming. 

The  wave  theory  of  light  was  entirely  successful  up  until  the  20th 
century,  when  it  was  discovered  that  not  all  the  phenomena  of  light 
could  be  explained  with  a  pure  wave  theory.  It  is  now  believed  that 
both  light  and  matter  are  made  out  of  tiny  chunks  which  have  both 
wave  and  particle  properties.  For  now,  we  will  content  ourselves 
with  the  wave  theory  of  light,  which  is  capable  of  explaining  a  great 
many  things,  from  cameras  to  rainbows. 

If  light  is  a  wave,  what  is  waving?  What  is  the  medium  that 
wiggles  when  a  light  wave  goes  by?  It  isn’t  air.  A  vacuum  is  impen¬ 
etrable  to  sound,  but  light  from  the  stars  travels  happily  through 
zillions  of  miles  of  empty  space.  Light  bulbs  have  no  air  inside  them, 
but  that  doesn’t  prevent  the  light  waves  from  leaving  the  filament. 
For  a  long  time,  physicists  assumed  that  there  must  be  a  mysteri¬ 
ous  medium  for  light  waves,  and  they  called  it  the  ether  (not  to  be 
confused  with  the  chemical).  Supposedly  the  ether  existed  every¬ 
where  in  space,  and  was  immune  to  vacuum  pumps.  The  details  of 
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the  story  are  more  fittingly  reserved  for  later  in  this  course,  but  the 
end  result  was  that  a  long  series  of  experiments  failed  to  detect  any 
evidence  for  the  ether,  and  it  is  no  longer  believed  to  exist.  Instead, 
light  can  be  explained  as  a  wave  pattern  made  up  of  electrical  and 
magnetic  fields. 

6.1.4  Periodic  waves 

Period  and  frequency  of  a  periodic  wave 

You  choose  a  radio  station  by  selecting  a  certain  frequency.  We 
have  already  defined  period  and  frequency  for  vibrations, 

T  =  period  =  seconds  per  cycle 

/  =  frequency  =  1/T  =  cycles  per  second 

u  =  angular  frequency  =  2-irf  =  radians  per  second 

but  what  do  they  signify  in  the  case  of  a  wave?  We  can  recycle  our 
previous  definition  simply  by  stating  it  in  terms  of  the  vibrations 
that  the  wave  causes  as  it  passes  a  receiving  instrument  at  a  certain 
point  in  space.  For  a  sound  wave,  this  receiver  could  be  an  eardrum 
or  a  microphone.  If  the  vibrations  of  the  eardrum  repeat  themselves 
over  and  over,  i.e. ,  are  periodic,  then  we  describe  the  sound  wave 
that  caused  them  as  periodic.  Likewise  we  can  define  the  period 
and  frequency  of  a  wave  in  terms  of  the  period  and  frequency  of 
the  vibrations  it  causes.  As  another  example,  a  periodic  water  wave 
would  be  one  that  caused  a  rubber  duck  to  bob  in  a  periodic  manner 
as  they  passed  by  it. 

The  period  of  a  sound  wave  correlates  with  our  sensory  impres¬ 
sion  of  musical  pitch.  A  high  frequency  (short  period)  is  a  high  note. 
The  sounds  that  really  define  the  musical  notes  of  a  song  are  only 
the  ones  that  are  periodic.  It  is  not  possible  to  sing  a  nonperiodic 
sound  like  “sh”  with  a  definite  pitch. 

The  frequency  of  a  light  wave  corresponds  to  color.  Violet  is  the 
high-frequency  end  of  the  rainbow,  red  the  low-frequency  end.  A 
color  like  brown  that  does  not  occur  in  a  rainbow  is  not  a  periodic 
light  wave.  Many  phenomena  that  we  do  not  normally  think  of  as 
light  are  actually  just  forms  of  light  that  are  invisible  because  they 
fall  outside  the  range  of  frequencies  our  eyes  can  detect.  Beyond  the 
red  end  of  the  visible  rainbow,  there  are  infrared  and  radio  waves. 
Past  the  violet  end,  we  have  ultraviolet,  x-rays,  and  gamma  rays. 

Graphs  of  waves  as  a  function  of  position 

Some  waves,  like  sound  waves,  are  easy  to  study  by  placing  a 
detector  at  a  certain  location  in  space  and  studying  the  motion  as 
a  function  of  time.  The  result  is  a  graph  whose  horizontal  axis  is 
time.  With  a  water  wave,  on  the  other  hand,  it  is  simpler  just  to 
look  at  the  wave  directly.  This  visual  snapshot  amounts  to  a  graph 
of  the  height  of  the  water  wave  as  a  function  of  position.  Any  wave 
can  be  represented  in  either  way. 


o  /  A  graph  of  pressure  ver¬ 
sus  time  for  a  periodic  sound 
wave,  the  vowel  “ah.” 


p/A  similar  graph  for  a  non¬ 
periodic  wave,  “sh.” 


height 


x 

r  /  A  water  wave  profile  cre¬ 
ated  by  a  series  of  repeating 
pulses. 
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s  /  Wavelengths  of  linear  and 
circular  waves. 


An  easy  way  to  visualize  this  is  in  terms  of  a  strip  chart  recorder, 
an  obsolescing  device  consisting  of  a  pen  that  wiggles  back  and  forth 
as  a  roll  of  paper  is  fed  under  it.  It  can  be  used  to  record  a  per¬ 
son’s  electrocardiogram,  or  seismic  waves  too  small  to  be  felt  as  a 
noticeable  earthquake  but  detectable  by  a  seismometer.  Taking  the 
seismometer  as  an  example,  the  chart  is  essentially  a  record  of  the 
ground’s  wave  motion  as  a  function  of  time,  but  if  the  paper  was  set 
to  feed  at  the  same  velocity  as  the  motion  of  an  earthquake  wave,  it 
would  also  be  a  full-scale  representation  of  the  profile  of  the  actual 
wave  pattern  itself.  Assuming,  as  is  usually  the  case,  that  the  wave 
velocity  is  a  constant  number  regardless  of  the  wave’s  shape,  know¬ 
ing  the  wave  motion  as  a  function  of  time  is  equivalent  to  knowing 
it  as  a  function  of  position. 

Wavelength 

Any  wave  that  is  periodic  will  also  display  a  repeating  pattern 
when  graphed  as  a  function  of  position.  The  distance  spanned  by 
one  repetition  is  referred  to  as  one  wavelength.  The  usual  notation 
for  wavelength  is  A,  the  Greek  letter  lambda.  Wavelength  is  to  space 
as  period  is  to  time. 

Wave  velocity  related  to  frequency  and  wavelength 

Suppose  that  we  create  a  repetitive  disturbance  by  kicking  the 
surface  of  a  swimming  pool.  We  are  essentially  making  a  series  of 
wave  pulses.  The  wavelength  is  simply  the  distance  a  pulse  is  able  to 
travel  before  we  make  the  next  pulse.  The  distance  between  pulses 
is  A,  and  the  time  between  pulses  is  the  period,  T,  so  the  speed  of 
the  wave  is  the  distance  divided  by  the  time, 

v  =  X/T. 

This  important  and  useful  relationship  is  more  commonly  written 
in  terms  of  the  frequency, 


v  =  /A. 


Wavelength  of  radio  waves  example  5 

>  The  speed  of  light  is  3.0  x  108  m/s.  What  is  the  wavelength  of 
the  radio  waves  emitted  by  KMHD,  a  station  whose  frequency  is 
89.1  MHz? 

>  Solving  for  wavelength,  we  have 

A  =  v/f 

=  (3.0  x  108  m/s)  /(89.1  x  106  s^1) 

=  3.4  m 
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The  size  of  a  radio  antenna  is  closely  related  to  the  wavelength 
of  the  waves  it  is  intended  to  receive.  The  match  need  not  be  exact 
(since  after  all  one  antenna  can  receive  more  than  one  wavelength!), 
but  the  ordinary  “whip”  antenna  such  as  a  car’s  is  1/4  of  a  wave¬ 
length.  An  antenna  optimized  to  receive  KMHD’s  signal  would  have 
a  length  of  (3.4  m)/4  =  0.85  m. 

The  equation  v  =  f  A  defines  a  fixed  relationship  between  any  two 
of  the  variables  if  the  other  is  held  fixed.  The  speed  of  radio  waves 
in  air  is  almost  exactly  the  same  for  all  wavelengths  and  frequencies 
(it  is  exactly  the  same  if  they  are  in  a  vacuum),  so  there  is  a  fixed 
relationship  between  their  frequency  and  wavelength.  Thus  we  can 
say  either  “Are  we  on  the  same  wavelength?”  or  “Are  we  on  the 
same  frequency?” 

A  different  example  is  the  behavior  of  a  wave  that  travels  from 
a  region  where  the  medium  has  one  set  of  properties  to  an  area 
where  the  medium  behaves  differently.  The  frequency  is  now  fixed, 
because  otherwise  the  two  portions  of  the  wave  would  otherwise 
get  out  of  step,  causing  a  kink  or  discontinuity  at  the  boundary, 
which  would  be  unphysical.  (A  more  careful  argument  is  that  a 
kink  or  discontinuity  would  have  infinite  curvature,  and  waves  tend 
to  flatten  out  their  curvature.  An  infinite  curvature  would  flatten 
out  infinitely  fast,  i.e.,  it  could  never  occur  in  the  first  place.)  Since 
the  frequency  must  stay  the  same,  any  change  in  the  velocity  that 
results  from  the  new  medium  must  cause  a  change  in  wavelength. 

The  velocity  of  water  waves  depends  on  the  depth  of  the  water, 
so  based  on  A  =  v/f,  we  see  that  water  waves  that  move  into  a 
region  of  different  depth  must  change  their  wavelength,  as  shown  in 
figure  u.  This  effect  can  be  observed  when  ocean  waves  come  up  to 
the  shore.  If  the  deceleration  of  the  wave  pattern  is  sudden  enough, 
the  tip  of  the  wave  can  curl  over,  resulting  in  a  breaking  wave. 

A  note  on  dispersive  waves 

The  discussion  of  wave  velocity  given  here  is  actually  a  little  bit 
of  an  oversimplification  for  a  wave  whose  velocity  depends  on  its 
frequency  and  wavelength.  Such  a  wave  is  called  a  dispersive  wave. 
Nearly  all  the  waves  we  deal  with  in  this  course  are  nondispersive, 
but  the  issue  becomes  important  in  chapter  13,  where  it  is  discussed 
in  detail. 

Sinusoidal  waves 

Sinusoidal  waves  are  the  most  important  special  case  of  periodic 
waves.  In  fact,  many  scientists  and  engineers  would  be  uncomfort¬ 
able  with  defining  a  waveform  like  the  “ah”  vowel  sound  as  having 
a  definite  frequency  and  wavelength,  because  they  consider  only 
sine  waves  to  be  pure  examples  of  a  certain  frequency  and  wave¬ 
lengths.  Their  bias  is  not  unreasonable,  since  the  French  mathe¬ 
matician  Fourier  showed  that  any  periodic  wave  with  frequency  / 


t  /  Ultrasound,  i.e.,  sound  with 
frequencies  higher  than  the  range 
of  human  hearing,  was  used  to 
make  this  image  of  a  fetus.  The 
resolution  of  the  image  is  related 
to  the  wavelength,  since  details 
smaller  than  about  one  wave¬ 
length  cannot  be  resolved.  High 
resolution  therefore  requires  a 
short  wavelength,  corresponding 
to  a  high  frequency. 


u  /  A  water  wave  traveling 
into  a  region  with  different  depth 
will  change  its  wavelength. 
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can  be  constructed  as  a  superposition  of  sine  waves  with  frequencies 
/,  2/,  3/,  ...  In  this  sense,  sine  waves  are  the  basic,  pure  building 
blocks  of  all  waves.  (Fourier’s  result  so  surprised  the  mathematical 
community  of  France  that  he  was  ridiculed  the  first  time  he  publicly 
presented  his  theorem.) 

However,  what  definition  to  use  is  really  a  matter  of  convenience. 
Our  sense  of  hearing  perceives  any  two  sounds  having  the  same 
period  as  possessing  the  same  pitch,  regardless  of  whether  they  are 
sine  waves  or  not.  This  is  undoubtedly  because  our  ear-brain  system 
evolved  to  be  able  to  interpret  human  speech  and  animal  noises, 
which  are  periodic  but  not  sinusoidal.  Our  eyes,  on  the  other  hand, 
judge  a  color  as  pure  (belonging  to  the  rainbow  set  of  colors)  only 
if  it  is  a  sine  wave. 

Discussion  Questions 

A  Suppose  we  superimpose  two  sine  waves  with  equal  amplitudes 
but  slightly  different  frequencies,  as  shown  in  the  figure.  What  will  the 
superposition  look  like?  What  would  this  sound  like  if  they  were  sound 
waves? 


vwwwvwww 

m/wvwwm 

v  /  Discussion  question  A. 


w  /  The  pattern  of  waves  made 
by  a  point  source  moving  to  the 
right  across  the  water.  Note 
the  shorter  wavelength  of  the 
forward-emitted  waves  and 
the  longer  wavelength  of  the 
backward-going  ones. 


6.1.5  The  Doppler  effect 

Figure  7.2.8  shows  the  wave  pattern  made  by  the  tip  of  a  vi¬ 
brating  rod  which  is  moving  across  the  water.  If  the  rod  had  been 
vibrating  in  one  place,  we  would  have  seen  the  familiar  pattern  of 
concentric  circles,  all  centered  on  the  same  point.  But  since  the 
source  of  the  waves  is  moving,  the  wavelength  is  shortened  on  one 
side  and  lengthened  on  the  other.  This  is  known  as  the  Doppler 
effect. 

Note  that  the  velocity  of  the  waves  is  a  fixed  property  of  the 
medium,  so  for  example  the  forward-going  waves  do  not  get  an  extra 
boost  in  speed  as  would  a  material  object  like  a  bullet  being  shot 
forward  from  an  airplane. 

We  can  also  infer  a  change  in  frequency.  Since  the  velocity  is 
constant,  the  equation  v  =  f  A  tells  us  that  the  change  in  wave¬ 
length  must  be  matched  by  an  opposite  change  in  frequency:  higher 
frequency  for  the  waves  emitted  forward,  and  lower  for  the  ones 
emitted  backward.  The  frequency  Doppler  effect  is  the  reason  for 
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the  familiar  dropping-pitch  sound  of  a  race  car  going  by.  As  the  car 
approaches  us,  we  hear  a  higher  pitch,  but  after  it  passes  us  we  hear 
a  frequency  that  is  lower  than  normal. 

The  Doppler  effect  will  also  occur  if  the  observer  is  moving  but 
the  source  is  stationary.  For  instance,  an  observer  moving  toward  a 
stationary  source  will  perceive  one  crest  of  the  wave,  and  will  then  be 
surrounded  by  the  next  crest  sooner  than  she  otherwise  would  have, 
because  she  has  moved  toward  it  and  hastened  her  encounter  with 
it.  Roughly  speaking,  the  Doppler  effect  depends  only  the  relative 
motion  of  the  source  and  the  observer,  not  on  their  absolute  state 
of  motion  (which  is  not  a  well-defined  notion  in  physics)  or  on  their 
velocity  relative  to  the  medium. 

Restricting  ourselves  to  the  case  of  a  moving  source,  and  to  waves 
emitted  either  directly  along  or  directly  against  the  direction  of  mo¬ 
tion,  we  can  easily  calculate  the  wavelength,  or  equivalently  the 
frequency,  of  the  Doppler-shifted  waves.  Let  u  be  the  velocity  of  the 
source.  The  wavelength  of  the  forward-emitted  waves  is  shortened 
by  an  amount  uT  equal  to  the  distance  traveled  by  the  source  over 
the  course  of  one  period.  Using  the  definition  f  =  1/T  and  the  equa¬ 
tion  v  =  /A,  we  find  for  the  wavelength  X'  of  the  Doppler-shifted 
wave  the  equation 


A  similar  equation  can  be  used  for  the  backward-emitted  waves,  but 
with  a  plus  sign  rather  than  a  minus  sign. 

Doppler-shifted  sound  from  a  race  car  example  6 

>  If  a  race  car  moves  at  a  velocity  of  50  m/s,  and  the  velocity  of 
sound  is  340  m/s,  by  what  percentage  are  the  wavelength  and 
frequency  of  its  sound  waves  shifted  for  an  observer  lying  along 
its  line  of  motion? 

>  For  an  observer  whom  the  car  is  approaching,  we  find 

1  -  -  =  0.85, 
v 

so  the  shift  in  wavelength  is  15%.  Since  the  frequency  is  inversely 
proportional  to  the  wavelength  for  a  fixed  value  of  the  speed  of 
sound,  the  frequency  is  shifted  upward  by 


1/0.85  =  1.18, 


i.e. ,  a  change  of  18%.  (For  velocities  that  are  small  compared 
to  the  wave  velocities,  the  Doppler  shifts  of  the  wavelength  and 
frequency  are  about  the  same.) 

Doppler  shift  of  the  light  emitted  by  a  race  car  example  7 

>  What  is  the  percent  shift  in  the  wavelength  of  the  light  waves 
emitted  by  a  race  car’s  headlights? 
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x/The  galaxy  Ml  00  in  the 
constellation  Coma  Berenices. 
Under  higher  magnification,  the 
milky  clouds  reveal  themselves  to 
be  composed  of  trillions  of  stars. 


y  /  The  telescope  at  Mount 
Wilson  used  by  Hubble. 


>  Looking  up  the  speed  of  light  in  the  back  of  the  book,  v  =  3.0  x 
108  m/s,  we  find 

1  -  -=  0.99999983, 
v 

i.e. ,  the  percentage  shift  is  only  0.000017%. 

The  second  example  shows  that  under  ordinary  earthbound  cir¬ 
cumstances,  Doppler  shifts  of  light  are  negligible  because  ordinary 
things  go  so  much  slower  than  the  speed  of  light.  It’s  a  different 
story,  however,  when  it  comes  to  stars  and  galaxies,  and  this  leads 
us  to  a  story  that  has  profound  implications  for  our  understanding 
of  the  origin  of  the  universe. 

The  Big  Bang 

As  soon  as  astronomers  began  looking  at  the  sky  through  tele¬ 
scopes,  they  began  noticing  certain  objects  that  looked  like  clouds 
in  deep  space.  The  fact  that  they  looked  the  same  night  after  night 
meant  that  they  were  beyond  the  earth’s  atmosphere.  Not  know¬ 
ing  what  they  really  were,  but  wanting  to  sound  official,  they  called 
them  “nebulae,”  a  Latin  word  meaning  “clouds”  but  sounding  more 
impressive.  In  the  early  20th  century,  astronomers  realized  that  al¬ 
though  some  really  were  clouds  of  gas  (e.g.,  the  middle  “star”  of 
Orion’s  sword,  which  is  visibly  fuzzy  even  to  the  naked  eye  when 
conditions  are  good),  others  were  what  we  now  call  galaxies:  virtual 
island  universes  consisting  of  trillions  of  stars  (for  example  the  An¬ 
dromeda  Galaxy,  which  is  visible  as  a  fuzzy  patch  through  binoc¬ 
ulars).  Three  hundred  years  after  Galileo  had  resolved  the  Milky 
Way  into  individual  stars  through  his  telescope,  astronomers  real¬ 
ized  that  the  universe  is  made  of  galaxies  of  stars,  and  the  Milky 
Way  is  simply  the  visible  part  of  the  flat  disk  of  our  own  galaxy, 
seen  from  inside. 

This  opened  up  the  scientific  study  of  cosmology,  the  structure 
and  history  of  the  universe  as  a  whole,  a  field  that  had  not  been 
seriously  attacked  since  the  days  of  Newton.  Newton  had  realized 
that  if  gravity  was  always  attractive,  never  repulsive,  the  universe 
would  have  a  tendency  to  collapse.  His  solution  to  the  problem  was 
to  posit  a  universe  that  was  infinite  and  uniformly  populated  with 
matter,  so  that  it  would  have  no  geometrical  center.  The  gravita¬ 
tional  forces  in  such  a  universe  would  always  tend  to  cancel  out  by 
symmetry,  so  there  would  be  no  collapse.  By  the  20th  century,  the 
belief  in  an  unchanging  and  infinite  universe  had  become  conven¬ 
tional  wisdom  in  science,  partly  as  a  reaction  against  the  time  that 
had  been  wasted  trying  to  find  explanations  of  ancient  geological 
phenomena  based  on  catastrophes  suggested  by  biblical  events  like 
Noah’s  flood. 

In  the  1920’s  astronomer  Edwin  Hubble  began  studying  the 
Doppler  shifts  of  the  light  emitted  by  galaxies.  A  former  college 
football  player  with  a  serious  nicotine  addiction,  Hubble  did  not 
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set  out  to  change  our  image  of  the  beginning  of  the  universe.  His 
autobiography  seldom  even  mentions  the  cosmological  discovery  for 
which  he  is  now  remembered.  When  astronomers  began  to  study  the 
Doppler  shifts  of  galaxies,  they  expected  that  each  galaxy’s  direction 
and  velocity  of  motion  would  be  essentially  random.  Some  would  be 
approaching  us,  and  their  light  would  therefore  be  Doppler-shifted 
to  the  blue  end  of  the  spectrum,  while  an  equal  number  would  be 
expected  to  have  red  shifts.  What  Hubble  discovered  instead  was 
that  except  for  a  few  very  nearby  ones,  all  the  galaxies  had  red 
shifts,  indicating  that  they  were  receding  from  us  at  a  hefty  frac¬ 
tion  of  the  speed  of  light.  Not  only  that,  but  the  ones  farther  away 
were  receding  more  quickly.  The  speeds  were  directly  proportional 
to  their  distance  from  us. 

Did  this  mean  that  the  earth  (or  at  least  our  galaxy)  was  the 
center  of  the  universe?  No,  because  Doppler  shifts  of  light  only 
depend  on  the  relative  motion  of  the  source  and  the  observer.  If 
we  see  a  distant  galaxy  moving  away  from  us  at  10%  of  the  speed 
of  light,  we  can  be  assured  that  the  astronomers  who  live  in  that 
galaxy  will  see  ours  receding  from  them  at  the  same  speed  in  the 
opposite  direction.  The  whole  universe  can  be  envisioned  as  a  rising 
loaf  of  raisin  bread.  As  the  bread  expands,  there  is  more  and  more 
space  between  the  raisins.  The  farther  apart  two  raisins  are,  the 
greater  the  speed  with  which  they  move  apart. 

The  universe’s  expansion  is  presumably  decelerating  because  of 
gravitational  attraction  among  the  galaxies.  We  do  not  presently 
know  whether  there  is  enough  mass  in  the  universe  to  cause  enough 
attraction  to  halt  the  expansion  eventually.  But  perhaps  more  in¬ 
teresting  than  the  distant  future  of  the  universe  is  what  its  present 
expansion  implies  about  its  past.  Extrapolating  backward  in  time 
using  the  known  laws  of  physics,  the  universe  must  have  been  denser 
and  denser  at  earlier  and  earlier  times.  At  some  point,  it  must  have 
been  extremely  dense  and  hot,  and  we  can  even  detect  the  radia¬ 
tion  from  this  early  fireball,  in  the  form  of  microwave  radiation  that 
permeates  space.  The  phrase  Big  Bang  was  originally  coined  by  the 
doubters  of  the  theory  to  make  it  sound  ridiculous,  but  it  stuck, 
and  today  essentially  all  astronomers  accept  the  Big  Bang  theory 
based  on  the  very  direct  evidence  of  the  red  shifts  and  the  cosmic 
microwave  background  radiation. 

Finally  it  should  be  noted  what  the  Big  Bang  theory  is  not.  It  is 
not  an  explanation  of  why  the  universe  exists.  Such  questions  belong 
to  the  realm  of  religion,  not  science.  Science  can  find  ever  simpler 
and  ever  more  fundamental  explanations  for  a  variety  of  phenom¬ 
ena,  but  ultimately  science  takes  the  universe  as  it  is  according  to 
observations. 

Furthermore,  there  is  an  unfortunate  tendency,  even  among  many 
scientists,  to  speak  of  the  Big  Bang  theory  as  a  description  of  the 


z  /  How  do  astronomers  know 
what  mixture  of  wavelengths  a 
star  emitted  originally,  so  that 
they  can  tell  how  much  the 
Doppler  shift  was?  This  image 
(obtained  by  the  author  with 
equipment  costing  about  $5,  and 
no  telescope)  shows  the  mixture 
of  colors  emitted  by  the  star 
Sirius.  (If  you  have  the  book  in 
black  and  white,  blue  is  on  the  left 
and  red  on  the  right.)  The  star 
appears  white  or  bluish-white  to 
the  eye,  but  any  light  looks  white 
if  it  contains  roughly  an  equal 
mixture  of  the  rainbow  colors, 
i.e.,  of  all  the  pure  sinusoidal 
waves  with  wavelengths  lying  in 
the  visible  range.  Note  the  black 
“gap  teeth.”  These  are  the  fin¬ 
gerprint  of  hydrogen  in  the  outer 
atmosphere  of  Sirius.  These 
wavelengths  are  selectively  ab¬ 
sorbed  by  hydrogen.  Sirius  is  in 
our  own  galaxy,  but  similar  stars 
in  other  galaxies  would  have 
the  whole  pattern  shifted  toward 
the  red  end,  indicating  they  are 
moving  away  from  us. 
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aa  /  Shock  waves  are  cre¬ 
ated  by  the  X-15  rocket  plane, 
flying  at  3.5  times  the  speed  of 
sound. 


ab/This  fighter  jet  has  just 
accelerated  past  the  speed  of 
sound.  The  sudden  decom¬ 
pression  of  the  air  causes  water 
droplets  to  condense,  forming  a 
cloud. 


very  first  event  in  the  universe,  which  caused  everything  after  it. 
Although  it  is  true  that  time  may  have  had  a  beginning  (Einstein’s 
theory  of  general  relativity  admits  such  a  possibility),  the  methods 
of  science  can  only  work  within  a  certain  range  of  conditions  such 
as  temperature  and  density.  Beyond  a  temperature  of  about  109  K, 
the  random  thermal  motion  of  subatomic  particles  becomes  so  rapid 
that  its  velocity  is  comparable  to  the  speed  of  light.  Early  enough  in 
the  history  of  the  universe,  when  these  temperatures  existed,  New¬ 
tonian  physics  becomes  less  accurate,  and  we  must  describe  nature 
using  the  more  general  description  given  by  Einstein’s  theory  of 
relativity,  which  encompasses  Newtonian  physics  as  a  special  case. 
At  even  higher  temperatures,  beyond  about  1033  degrees,  physicists 
know  that  Einstein’s  theory  as  well  begins  to  fall  apart,  but  we  don’t 
know  how  to  construct  the  even  more  general  theory  of  nature  that 
would  work  at  those  temperatures.  No  matter  how  far  physics  pro¬ 
gresses,  we  will  never  be  able  to  describe  nature  at  infinitely  high 
temperatures,  since  there  is  a  limit  to  the  temperatures  we  can  ex¬ 
plore  by  experiment  and  observation  in  order  to  guide  us  to  the 
right  theory.  We  are  confident  that  we  understand  the  basic  physics 
involved  in  the  evolution  of  the  universe  starting  a  few  minutes  after 
the  Big  Bang,  and  we  may  be  able  to  push  back  to  milliseconds  or 
microseconds  after  it,  but  we  cannot  use  the  methods  of  science  to 
deal  with  the  beginning  of  time  itself. 

A  note  on  Doppler  shifts  of  light 

If  Doppler  shifts  depend  only  on  the  relative  motion  of  the  source 
and  receiver,  then  there  is  no  way  for  a  person  moving  with  the 
source  and  another  person  moving  with  the  receiver  to  determine 
who  is  moving  and  who  isn’t.  Either  can  blame  the  Doppler  shift 
entirely  on  the  other’s  motion  and  claim  to  be  at  rest  herself.  This  is 
entirely  in  agreement  with  the  principle  stated  originally  by  Galileo 
that  all  motion  is  relative. 

On  the  other  hand,  a  careful  analysis  of  the  Doppler  shifts  of 
water  or  sound  waves  shows  that  it  is  only  approximately  true,  at 
low  speeds,  that  the  shifts  just  depend  on  the  relative  motion  of  the 
source  and  observer.  For  instance,  it  is  possible  for  a  jet  plane  to 
keep  up  with  its  own  sound  waves,  so  that  the  sound  waves  appear 
to  stand  still  to  the  pilot  of  the  plane.  The  pilot  then  knows  she 
is  moving  at  exactly  the  speed  of  sound.  The  reason  this  doesn’t 
disprove  the  relativity  of  motion  is  that  the  pilot  is  not  really  de¬ 
termining  her  absolute  motion  but  rather  her  motion  relative  to  the 
air,  which  is  the  medium  of  the  sound  waves. 

Einstein  realized  that  this  solved  the  problem  for  sound  or  water 
waves,  but  would  not  salvage  the  principle  of  relative  motion  in  the 
case  of  light  waves,  since  light  is  not  a  vibration  of  any  physical 
medium  such  as  water  or  air.  Beginning  by  imagining  what  a  beam 
of  light  would  look  like  to  a  person  riding  a  motorcycle  alongside  it, 
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Einstein  eventually  came  up  with  a  radical  new  way  of  describing 
the  universe,  in  which  space  and  time  are  distorted  as  measured 
by  observers  in  different  states  of  motion.  As  a  consequence  of  this 
Theory  of  Relativity,  he  showed  that  light  waves  would  have  Doppler 
shifts  that  would  exactly,  not  just  approximately,  depend  only  on 
the  relative  motion  of  the  source  and  receiver. 

Discussion  Questions 

A  If  an  airplane  travels  at  exactly  the  speed  of  sound,  what  would  be 
the  wavelength  of  the  forward-emitted  part  of  the  sound  waves  it  emitted? 
How  should  this  be  interpreted,  and  what  would  actually  happen?  What 
happens  if  it’s  going  faster  than  the  speed  of  sound?  Can  you  use  this  to 
explain  what  you  see  in  figures  aa  and  ab? 

B  If  bullets  go  slower  than  the  speed  of  sound,  why  can  a  supersonic 
fighter  plane  catch  up  to  its  own  sound,  but  not  to  its  own  bullets? 

C  If  someone  inside  a  plane  is  talking  to  you,  should  their  speech  be 
Doppler  shifted? 
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Speech  is  what  separates  humans  most  decisively  from  animals.  No 
other  species  can  master  syntax,  and  even  though  chimpanzees  can 
learn  a  vocabulary  of  hand  signs,  there  is  an  unmistakable  difference 
between  a  human  infant  and  a  baby  chimp:  starting  from  birth,  the 
human  experiments  with  the  production  of  complex  speech  sounds. 

Since  speech  sounds  are  instinctive  for  us,  we  seldom  think  about 
them  consciously.  How  do  we  control  sound  waves  so  skillfully? 
Mostly  we  do  it  by  changing  the  shape  of  a  connected  set  of  hollow 
cavities  in  our  chest,  throat,  and  head.  Somehow  by  moving  the 
boundaries  of  this  space  in  and  out,  we  can  produce  all  the  vowel 
sounds.  Up  until  now,  we  have  been  studying  only  those  properties 
of  waves  that  can  be  understood  as  if  they  existed  in  an  infinite, 
open  space  with  no  boundaries.  In  this  chapter  we  address  what 
happens  when  a  wave  is  confined  within  a  certain  space,  or  when  a 
wave  pattern  encounters  the  boundary  between  two  different  media, 
such  as  when  a  light  wave  moving  through  air  encounters  a  glass 
windowpane. 

6.2.1  Reflection,  transmission,  and  absorption 

Reflection  and  transmission 


a  /  A  cross-sectional  view  of 
a  human  body,  showing  the  vocal 
tract. 


b  /  Circular  water  waves  are 
reflected  from  a  boundary  on  the 
left.  ( PSSC  Physics ) 


Sound  waves  can  echo  back  from  a  cliff,  and  light  waves  are 
reflected  from  the  surface  of  a  pond.  We  use  the  word  reflection, 
normally  applied  only  to  light  waves  in  ordinary  speech,  to  describe 
any  such  case  of  a  wave  rebounding  from  a  barrier.  Figure  (a)  shows 
a  circular  water  wave  being  reflected  from  a  straight  wall.  In  this 
chapter,  we  will  concentrate  mainly  on  reflection  of  waves  that  move 
in  one  dimension,  as  in  figure  c/1. 

Wave  reflection  does  not  surprise  us.  After  all,  a  material  object 
such  as  a  rubber  ball  would  bounce  back  in  the  same  way.  But  waves 
are  not  objects,  and  there  are  some  surprises  in  store. 

First,  only  part  of  the  wave  is  usually  reflected.  Looking  out 
through  a  window,  we  see  light  waves  that  passed  through  it,  but  a 
person  standing  outside  would  also  be  able  to  see  her  reflection  in 
the  glass.  A  light  wave  that  strikes  the  glass  is  partly  reflected  and 
partly  transmitted  (passed)  by  the  glass.  The  energy  of  the  original 
wave  is  split  between  the  two.  This  is  different  from  the  behavior  of 
the  rubber  ball,  which  must  go  one  way  or  the  other,  not  both. 

Second,  consider  what  you  see  if  you  are  swimming  underwater 
and  you  look  up  at  the  surface.  You  see  your  own  reflection.  This 
is  utterly  counterintuitive,  since  we  would  expect  the  light  waves  to 
burst  forth  to  freedom  in  the  wide-open  air.  A  material  projectile 
shot  up  toward  the  surface  would  never  rebound  from  the  water-air 
boundary! 

What  is  it  about  the  difference  between  two  media  that  causes 
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waves  to  be  partly  reflected  at  the  boundary  between  them?  Is 
it  their  density?  Their  chemical  composition?  Ultimately  all  that 
matters  is  the  speed  of  the  wave  in  the  two  media.  A  wave  is  partially 
reflected  and  partially  transmitted  at  the  boundary  between  media 
in  which  it  has  different  speeds.  For  example,  the  speed  of  light 
waves  in  window  glass  is  about  30%  less  than  in  air,  which  explains 
why  windows  always  make  reflections.  Figure  c  shows  examples  of 
wave  pulses  being  reflected  at  the  boundary  between  two  coil  springs 
of  different  weights,  in  which  the  wave  speed  is  different. 


Reflections  such  as  b  and  c/1,  where  a  wave  encounters  a  massive 
fixed  object,  can  usually  be  understood  on  the  same  basis  as  cases 
like  c/2  and  c/3  where  two  media  meet.  Example  c/1,  for  instance, 
is  like  a  more  extreme  version  of  example  c/2.  If  the  heavy  coil 
spring  in  c/2  was  made  heavier  and  heavier,  it  would  end  up  acting 
like  the  fixed  wall  to  which  the  light  spring  in  c/1  has  been  attached. 

self-check  B 

In  figure  c/1 ,  the  reflected  pulse  is  upside-down,  but  its  depth  is  just  as 
big  as  the  original  pulse’s  height.  How  does  the  energy  of  the  reflected 
pulse  compare  with  that  of  the  original?  o  Answer,  p.  964 

Fish  have  internal  ears.  example  8 

Why  don’t  fish  have  ear-holes?  The  speed  of  sound  waves  in 
a  fish’s  body  is  not  much  different  from  their  speed  in  water,  so 
sound  waves  are  not  strongly  reflected  from  a  fish’s  skin.  They 
pass  right  through  its  body,  so  fish  can  have  internal  ears. 
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c/1.  A  wave  on  a  coil  spring, 
initially  traveling  to  the  left,  is  re¬ 
flected  from  the  fixed  end.  2.  A 
wave  in  the  lighter  spring,  where 
the  wave  speed  is  greater,  trav¬ 
els  to  the  left  and  is  then  partly 
reflected  and  partly  transmitted  at 
the  boundary  with  the  heavier  coil 
spring,  which  has  a  lower  wave 
speed.  The  reflection  is  inverted. 
3.  A  wave  moving  to  the  right 
in  the  heavier  spring  is  partly  re¬ 
flected  at  the  boundary  with  the 
lighter  spring.  The  reflection  is 
uninverted.  (PSSC  Physics) 
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d  /  An  uninverted  reflection.  The 
reflected  pulse  is  reversed  front 
to  back,  but  is  not  upside-down. 


e/An  inverted  reflection.  The 
reflected  pulse  is  reversed  both 
front  to  back  and  top  to  bottom. 


Whale  songs  traveling  long  distances  example  9 

Sound  waves  travel  at  drastically  different  speeds  through  rock, 
water,  and  air.  Whale  songs  are  thus  strongly  reflected  both  at 
both  the  bottom  and  the  surface.  The  sound  waves  can  travel 
hundreds  of  miles,  bouncing  repeatedly  between  the  bottom  and 
the  surface,  and  still  be  detectable.  Sadly,  noise  pollution  from 
ships  has  nearly  shut  down  this  cetacean  version  of  the  inter¬ 
net. 

Long-distance  radio  communication  example  1 0 

Radio  communication  can  occur  between  stations  on  opposite 
sides  of  the  planet.  The  mechanism  is  entirely  similar  to  the  one 
explained  in  the  previous  example,  but  the  three  media  involved 
are  the  earth,  the  atmosphere,  and  the  ionosphere. 

self-check  C 

Sonar  is  a  method  for  ships  and  submarines  to  detect  each  other  by 
producing  sound  waves  and  listening  for  echoes.  What  properties  would 
an  underwater  object  have  to  have  in  order  to  be  invisible  to  sonar?  > 
Answer,  p.  964 

The  use  of  the  word  “reflection”  naturally  brings  to  mind  the  cre¬ 
ation  of  an  image  by  a  mirror,  but  this  might  be  confusing,  because 
we  do  not  normally  refer  to  “reflection”  when  we  look  at  surfaces 
that  are  not  shiny.  Nevertheless,  reflection  is  how  we  see  the  surfaces 
of  all  objects,  not  just  polished  ones.  When  we  look  at  a  sidewalk, 
for  example,  we  are  actually  seeing  the  reflecting  of  the  sun  from 
the  concrete.  The  reason  we  don’t  see  an  image  of  the  sun  at  our 
feet  is  simply  that  the  rough  surface  blurs  the  image  so  drastically. 

Inverted  and  uninverted  reflections 

Notice  how  the  pulse  reflected  back  to  the  right  in  example  c/2 
comes  back  upside-down,  whereas  the  one  reflected  back  to  the  left 
in  c/3  returns  in  its  original  upright  form.  This  is  true  for  other 
waves  as  well.  In  general,  there  are  two  possible  types  of  reflections, 
a  reflection  back  into  a  faster  medium  and  a  reflection  back  into  a 
slower  medium.  One  type  will  always  be  an  inverting  reflection  and 
one  noninverting. 

It’s  important  to  realize  that  when  we  discuss  inverted  and  un¬ 
inverted  reflections  on  a  string,  we  are  talking  about  whether  the 
wave  is  flipped  across  the  direction  of  motion  (i.e. ,  upside-down  in 
these  drawings).  The  reflected  pulse  will  always  be  reversed  front 
to  back,  as  shown  in  figures  d  and  e.  This  is  because  it  is  traveling 
in  the  other  direction.  The  leading  edge  of  the  pulse  is  what  gets 
reflected  first,  so  it  is  still  ahead  when  it  starts  back  to  the  left  - 
it’s  just  that  “ahead”  is  now  in  the  opposite  direction. 
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Absorption 

So  far  we  have  tacitly  assumed  that  wave  energy  remains  as  wave 
energy,  and  is  not  converted  to  any  other  form.  If  this  was  true,  then 
the  world  would  become  more  and  more  full  of  sound  waves,  which 
could  never  escape  into  the  vacuum  of  outer  space.  In  reality,  any 
mechanical  wave  consists  of  a  traveling  pattern  of  vibrations  of  some 
physical  medium,  and  vibrations  of  matter  always  produce  heat,  as 
when  you  bend  a  coathangar  back  and  forth  and  it  becomes  hot. 
We  can  thus  expect  that  in  mechanical  waves  such  as  water  waves, 
sound  waves,  or  waves  on  a  string,  the  wave  energy  will  gradually 
be  converted  into  heat.  This  is  referred  to  as  absorption.  The 
reduction  in  the  wave’s  energy  can  also  be  described  as  a  reduction  in 
amplitude,  the  relationship  between  them  being,  as  with  a  vibrating 
object,  E  oc  A2. 

The  wave  suffers  a  decrease  in  amplitude,  as  shown  in  figure  f. 
The  decrease  in  amplitude  amounts  to  the  same  fractional  change 
for  each  unit  of  distance  covered.  For  example,  if  a  wave  decreases 
from  amplitude  2  to  amplitude  1  over  a  distance  of  1  meter,  then 
after  traveling  another  meter  it  will  have  an  amplitude  of  1/2.  That 
is,  the  reduction  in  amplitude  is  exponential.  This  can  be  proved 
as  follows.  By  the  principle  of  superposition,  we  know  that  a  wave 
of  amplitude  2  must  behave  like  the  superposition  of  two  identical 
waves  of  amplitude  1.  If  a  single  amplitude- 1  wave  would  die  down  to 
amplitude  1/2  over  a  certain  distance,  then  two  amplitude- 1  waves 
superposed  on  top  of  one  another  to  make  amplitude  1+1=2  must 
die  down  to  amplitude  l/2+l/2=l  over  the  same  distance. 

self-check  D 

As  a  wave  undergoes  absorption,  it  loses  energy.  Does  this  mean  that 
it  slows  down?  >  Answer,  p.  964 

In  many  cases,  this  frictional  heating  effect  is  quite  weak.  Sound 
waves  in  air,  for  instance,  dissipate  into  heat  extremely  slowly,  and 
the  sound  of  church  music  in  a  cathedral  may  reverberate  for  as  much 
as  3  or  4  seconds  before  it  becomes  inaudible.  During  this  time  it 
has  traveled  over  a  kilometer!  Even  this  very  gradual  dissipation 
of  energy  occurs  mostly  as  heating  of  the  church’s  walls  and  by  the 
leaking  of  sound  to  the  outside  (where  it  will  eventually  end  up  as 
heat).  Under  the  right  conditions  (humid  air  and  low  frequency),  a 
sound  wave  in  a  straight  pipe  could  theoretically  travel  hundreds  of 
kilometers  before  being  noticeable  attenuated. 

In  general,  the  absorption  of  mechanical  waves  depends  a  great 
deal  on  the  chemical  composition  and  microscopic  structure  of  the 
medium.  Ripples  on  the  surface  of  antifreeze,  for  instance,  die  out 
extremely  rapidly  compared  to  ripples  on  water.  For  sound  waves 
and  surface  waves  in  liquids  and  gases,  what  matters  is  the  viscosity 
of  the  substance,  i.e.,  whether  it  flows  easily  like  water  or  mercury 
or  more  sluggishly  like  molasses  or  antifreeze.  This  explains  why 
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f/A  pulse  traveling  through 
a  highly  absorptive  medium. 
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our  intuitive  expectation  of  strong  absorption  of  sound  in  water  is 
incorrect.  Water  is  a  very  weak  absorber  of  sound  (viz.  whale  songs 
and  sonar),  and  our  incorrect  intuition  arises  from  focusing  on  the 
wrong  property  of  the  substance:  water’s  high  density,  which  is 
irrelevant,  rather  than  its  low  viscosity,  which  is  what  matters. 

Light  is  an  interesting  case,  since  although  it  can  travel  through 
matter,  it  is  not  itself  a  vibration  of  any  material  substance.  Thus 
we  can  look  at  the  star  Sirius,  1014  km  away  from  us,  and  be  as¬ 
sured  that  none  of  its  light  was  absorbed  in  the  vacuum  of  outer 
space  during  its  9-year  journey  to  us.  The  Hubble  Space  Telescope 
routinely  observes  light  that  has  been  on  its  way  to  us  since  the 
early  history  of  the  universe,  billions  of  years  ago.  Of  course  the 
energy  of  light  can  be  dissipated  if  it  does  pass  through  matter  (and 
the  light  from  distant  galaxies  is  often  absorbed  if  there  happen  to 
be  clouds  of  gas  or  dust  in  between). 

Soundproofing  example  1 1 

Typical  amateur  musicians  setting  out  to  soundproof  their  garages 
tend  to  think  that  they  should  simply  cover  the  walls  with  the 
densest  possible  substance.  In  fact,  sound  is  not  absorbed  very 
strongly  even  by  passing  through  several  inches  of  wood.  A  better 
strategy  for  soundproofing  is  to  create  a  sandwich  of  alternating 
layers  of  materials  in  which  the  speed  of  sound  is  very  different, 
to  encourage  reflection. 

The  classic  design  is  alternating  layers  of  fiberglass  and  plywood. 
The  speed  of  sound  in  plywood  is  very  high,  due  to  its  stiffness, 
while  its  speed  in  fiberglass  is  essentially  the  same  as  its  speed 
in  air.  Both  materials  are  fairly  good  sound  absorbers,  but  sound 
waves  passing  through  a  few  inches  of  them  are  still  not  going 
to  be  absorbed  sufficiently.  The  point  of  combining  them  is  that 
a  sound  wave  that  tries  to  get  out  will  be  strongly  reflected  at 
each  of  the  fiberglass-plywood  boundaries,  and  will  bounce  back 
and  forth  many  times  like  a  ping  pong  ball.  Due  to  all  the  back- 
and-forth  motion,  the  sound  may  end  up  traveling  a  total  distance 
equal  to  ten  times  the  actual  thickness  of  the  soundproofing  be¬ 
fore  it  escapes.  This  is  the  equivalent  of  having  ten  times  the 
thickness  of  sound-absorbing  material. 

Radio  transmission  example  12 

A  radio  transmitting  station  must  have  a  length  of  wire  or  cable 
connecting  the  amplifier  to  the  antenna.  The  cable  and  the  an¬ 
tenna  act  as  two  different  media  for  radio  waves,  and  there  will 
therefore  be  partial  reflection  of  the  waves  as  they  come  from  the 
cable  to  the  antenna.  If  the  waves  bounce  back  and  forth  many 
times  between  the  amplifier  and  the  antenna,  a  great  deal  of  their 
energy  will  be  absorbed.  There  are  two  ways  to  attack  the  prob¬ 
lem.  One  possibility  is  to  design  the  antenna  so  that  the  speed  of 
the  waves  in  it  is  as  close  as  possible  to  the  speed  of  the  waves 
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in  the  cable;  this  minimizes  the  amount  of  reflection.  The  other 
method  is  to  connect  the  amplifier  to  the  antenna  using  a  type 
of  wire  or  cable  that  does  not  strongly  absorb  the  waves.  Partial 
reflection  then  becomes  irrelevant,  since  all  the  wave  energy  will 
eventually  exit  through  the  antenna. 

Discussion  Questions 

A  A  sound  wave  that  underwent  a  pressure-inverting  reflection  would 
have  its  compressions  converted  to  expansions  and  vice  versa.  How 
would  its  energy  and  frequency  compare  with  those  of  the  original  sound? 
Would  it  sound  any  different?  What  happens  if  you  swap  the  two  wires 
where  they  connect  to  a  stereo  speaker,  resulting  in  waves  that  vibrate  in 
the  opposite  way? 

6.2.2  Quantitative  treatment  of  reflection 

In  this  subsection  we  analyze  the  reasons  why  reflections  occur  at 
a  speed-changing  boundary,  predict  quantitatively  the  intensities  of 
reflection  and  transmission,  and  discuss  how  to  predict  for  any  type 
of  wave  which  reflections  are  inverting  and  which  are  uninverting. 

Why  reflection  occurs 

To  understand  the  fundamental  reasons  for  what  does  occur  at 
the  boundary  between  media,  let’s  first  discuss  what  doesn’t  happen. 
For  the  sake  of  concreteness,  consider  a  sinusoidal  wave  on  a  string. 
If  the  wave  progresses  from  a  heavier  portion  of  the  string,  in  which 
its  velocity  is  low,  to  a  lighter-weight  part,  in  which  it  is  high,  then 
the  equation  v  =  f  A  tells  us  that  it  must  change  its  frequency,  or 
its  wavelength,  or  both.  If  only  the  frequency  changed,  then  the 
parts  of  the  wave  in  the  two  different  portions  of  the  string  would 
quickly  get  out  of  step  with  each  other,  producing  a  discontinuity  in 
the  wave,  g/1.  This  is  unphysical,  so  we  know  that  the  wavelength 
must  change  while  the  frequency  remains  constant,  g/2. 

But  there  is  still  something  unphysical  about  figure  g/2.  The 
sudden  change  in  the  shape  of  the  wave  has  resulted  in  a  sharp  kink 
at  the  boundary.  This  can’t  really  happen,  because  the  medium 
tends  to  accelerate  in  such  a  way  as  to  eliminate  curvature.  A  sharp 
kink  corresponds  to  an  infinite  curvature  at  one  point,  which  would 
produce  an  infinite  acceleration,  which  would  not  be  consistent  with 
the  smooth  pattern  of  wave  motion  envisioned  in  fig.  g/2.  Waves 
can  have  kinks,  but  not  stationary  kinks. 

We  conclude  that  without  positing  partial  reflection  of  the  wave, 
we  cannot  simultaneously  satisfy  the  requirements  of  (1)  continuity 
of  the  wave,  and  (2)  no  sudden  changes  in  the  slope  of  the  wave.  In 
other  words,  we  assume  that  both  the  wave  and  its  derivative  are 
continuous  functions.) 

Does  this  amount  to  a  proof  that  reflection  occurs?  Not  quite. 
We  have  only  proved  that  certain  types  of  wave  motion  are  not 
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g/1.  A  change  in  frequency 
without  a  change  in  wavelength 
would  produce  a  discontinuity  in 
the  wave.  2.  A  simple  change  in 
wavelength  without  a  reflection 
would  result  in  a  sharp  kink  in  the 
wave. 
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h/A  pulse  being  partially  re¬ 
flected  and  partially  transmitted 
at  the  boundary  between  two 
strings  in  which  the  wave  speed 
is  different.  The  top  drawing 
shows  the  pulse  heading  to  the 
right,  toward  the  heaver  string. 
For  clarity,  all  but  the  first  and 
last  drawings  are  schematic. 
Once  the  reflected  pulse  begins 
to  emerge  from  the  boundary, 
it  adds  together  with  the  trailing 
parts  of  the  incident  pulse.  Their 
sum,  shown  as  a  wider  line,  is 
what  is  actually  observed. 


valid  solutions.  In  the  following  subsection,  we  prove  that  a  valid 
solution  can  always  be  found  in  which  a  reflection  occurs.  Now  in 
physics,  we  normally  assume  (but  seldom  prove  formally)  that  the 
equations  of  motion  have  a  unique  solution,  since  otherwise  a  given 
set  of  initial  conditions  could  lead  to  different  behavior  later  on, 
but  the  Newtonian  universe  is  supposed  to  be  deterministic.  Since 
the  solution  must  be  unique,  and  we  derive  below  a  valid  solution 
involving  a  reflected  pulse,  we  will  have  ended  up  with  what  amounts 
to  a  proof  of  reflection. 

Intensity  of  reflection 

I  will  now  show,  in  the  case  of  waves  on  a  string,  that  it  is  possible 
to  satisfy  the  physical  requirements  given  above  by  constructing  a 
reflected  wave,  and  as  a  bonus  this  will  produce  an  equation  for 
the  proportions  of  reflection  and  transmission  and  a  prediction  as 
to  which  conditions  will  lead  to  inverted  and  which  to  uninverted 
reflection.  We  assume  only  that  the  principle  of  superposition  holds, 
which  is  a  good  approximation  for  waves  on  a  string  of  sufficiently 
small  amplitude. 

Let  the  unknown  amplitudes  of  the  reflected  and  transmitted 
waves  be  R  and  T,  respectively.  An  inverted  reflection  would  be 
represented  by  a  negative  value  of  R.  We  can  without  loss  of  gen¬ 
erality  take  the  incident  (original)  wave  to  have  unit  amplitude. 
Superposition  tells  us  that  if,  for  instance,  the  incident  wave  had 
double  this  amplitude,  we  could  immediately  find  a  corresponding 
solution  simply  by  doubling  R  and  T. 

Just  to  the  left  of  the  boundary,  the  height  of  the  wave  is  given 
by  the  height  1  of  the  incident  wave,  plus  the  height  R  of  the  part 
of  the  reflected  wave  that  has  just  been  created  and  begun  heading 
back,  for  a  total  height  of  1  +  R.  On  the  right  side  immediately  next 
to  the  boundary,  the  transmitted  wave  has  a  height  T.  To  avoid  a 
discontinuity,  we  must  have 

1  +  R  =  T. 

Next  we  turn  to  the  requirement  of  equal  slopes  on  both  sides  of 
the  boundary.  Let  the  slope  of  the  incoming  wave  be  s  immediately 
to  the  left  of  the  junction.  If  the  wave  was  100%  reflected,  and 
without  inversion,  then  the  slope  of  the  reflected  wave  would  be  —  s, 
since  the  wave  has  been  reversed  in  direction.  In  general,  the  slope 
of  the  reflected  wave  equals  —  sR,  and  the  slopes  of  the  superposed 
waves  on  the  left  side  add  up  to  s  —  sR.  On  the  right,  the  slope 
depends  on  the  amplitude,  T,  but  is  also  changed  by  the  stretching 
or  compression  of  the  wave  due  to  the  change  in  speed.  If,  for 
example,  the  wave  speed  is  twice  as  great  on  the  right  side,  then 
the  slope  is  cut  in  half  by  this  effect.  The  slope  on  the  right  is 
therefore  s(v\/v2)T,  where  v\  is  the  velocity  in  the  original  medium 
and  V2  the  velocity  in  the  new  medium.  Equality  of  slopes  gives 
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s  —  sR  =  s(vi/v2)T,  or 


The  first  equation  shows  that  there  is  no  reflection  unless  the 
two  wave  speeds  are  different,  and  that  the  reflection  is  inverted  in 
reflection  back  into  a  fast  medium. 

The  energies  of  the  transmitted  and  reflected  wavers  always  add 
up  to  the  same  as  the  energy  of  the  original  wave.  There  is  never 
any  abrupt  loss  (or  gain)  in  energy  when  a  wave  crosses  a  boundary; 
conversion  of  wave  energy  to  heat  occurs  for  many  types  of  waves, 
but  it  occurs  throughout  the  medium.  The  equation  for  T,  surpris¬ 
ingly,  allows  the  amplitude  of  the  transmitted  wave  to  be  greater 
than  1,  i.e.,  greater  than  that  of  the  incident  wave.  This  does  not 
violate  conservation  of  energy,  because  this  occurs  when  the  second 
string  is  less  massive,  reducing  its  kinetic  energy,  and  the  trans¬ 
mitted  pulse  is  broader  and  less  strongly  curved,  which  lessens  its 
potential  energy. 

Inverted  and  uninverted  reflections  in  general  (optional) 

For  waves  on  a  string,  reflections  back  into  a  faster  medium  are 
inverted,  while  those  back  into  a  slower  medium  are  uninverted.  Is 
this  true  for  all  types  of  waves?  The  rather  subtle  answer  is  that  it 
depends  on  what  property  of  the  wave  you  are  discussing. 

Let’s  start  by  considering  wave  disturbances  of  freeway  traffic. 
Anyone  who  has  driven  frequently  on  crowded  freeways  has  observed 
the  phenomenon  in  which  one  driver  taps  the  brakes,  starting  a  chain 
reaction  that  travels  backward  down  the  freeway  as  each  person  in 
turn  exercises  caution  in  order  to  avoid  rear-ending  anyone.  The 
reason  why  this  type  of  wave  is  relevant  is  that  it  gives  a  simple, 
easily  visualized  example  of  how  our  description  of  a  wave  depends 
on  which  aspect  of  the  wave  we  have  in  mind.  In  steadily  flowing 
freeway  traffic,  both  the  density  of  cars  and  their  velocity  are  con¬ 
stant  all  along  the  road.  Since  there  is  no  disturbance  in  this  pattern 
of  constant  velocity  and  density,  we  say  that  there  is  no  wave.  Now 
if  a  wave  is  touched  off  by  a  person  tapping  the  brakes,  we  can  either 
describe  it  as  a  region  of  high  density  or  as  a  region  of  decreasing 
velocity. 


velocity 


position 


excess 
density 

position 

i  /  A  wave  pattern  in  freeway 
traffic. 
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j/ln  the  mirror  image,  the 
areas  of  positive  excess  traffic 
density  are  still  positive,  but 
the  velocities  of  the  cars  have 
all  been  reversed,  so  areas  of 
positive  excess  velocity  have 
been  turned  into  negative  ones. 


l 


k  /  A  pulse  encounters  two 
boundaries. 


I  /  A  sine  wave  has  been  reflected 
at  two  different  boundaries,  and 
the  two  reflections  interfere. 


The  freeway  traffic  wave  is  in  fact  a  good  model  of  a  sound  wave, 
and  a  sound  wave  can  likewise  be  described  either  by  the  density 
(or  pressure)  of  the  air  or  by  its  speed.  Likewise  many  other  types 
of  waves  can  be  described  by  either  of  two  functions,  one  of  which 
is  often  the  derivative  of  the  other  with  respect  to  position. 

Now  let’s  consider  reflections.  If  we  observe  the  freeway  wave  in 
a  mirror,  the  high-density  area  will  still  appear  high  in  density,  but 
velocity  in  the  opposite  direction  will  now  be  described  by  a  neg¬ 
ative  number.  A  person  observing  the  mirror  image  will  draw  the 
same  density  graph,  but  the  velocity  graph  will  be  flipped  across  the 
x  axis,  and  its  original  region  of  negative  slope  will  now  have  posi¬ 
tive  slope.  Although  I  don’t  know  any  physical  situation  that  would 
correspond  to  the  reflection  of  a  traffic  wave,  we  can  immediately  ap¬ 
ply  the  same  reasoning  to  sound  waves,  which  often  do  get  reflected, 
and  determine  that  a  reflection  can  either  be  density-inverting  and 
velocity-uninverting  or  density-uninverting  and  velocity-inverting. 

This  same  type  of  situation  will  occur  over  and  over  as  one  en¬ 
counters  new  types  of  waves,  and  to  apply  the  analogy  we  need 
only  determine  which  quantities,  like  velocity,  become  negated  in  a 
mirror  image  and  which,  like  density,  stay  the  same. 

A  light  wave,  for  instance,  consists  of  a  traveling  pattern  of  elec¬ 
tric  and  magnetic  fields.  All  you  need  to  know  in  order  to  analyze  the 
reflection  of  light  waves  is  how  electric  and  magnetic  fields  behave 
under  reflection;  you  don’t  need  to  know  any  of  the  detailed  physics 
of  electricity  and  magnetism.  An  electric  held  can  be  detected,  for 
example,  by  the  way  one’s  hair  stands  on  end.  The  direction  of  the 
hair  indicates  the  direction  of  the  electric  held.  In  a  mirror  image, 
the  hair  points  the  other  way,  so  the  electric  held  is  apparently  re¬ 
versed  in  a  mirror  image.  The  behavior  of  magnetic  helds,  however, 
is  a  little  tricky.  The  magnetic  properties  of  a  bar  magnet,  for  in¬ 
stance,  are  caused  by  the  aligned  rotation  of  the  outermost  orbiting 
electrons  of  the  atoms.  In  a  mirror  image,  the  direction  of  rotation 
is  reversed,  say  from  clockwise  to  counterclockwise,  and  so  the  mag¬ 
netic  held  is  reversed  twice:  once  simply  because  the  whole  picture 
is  hipped  and  once  because  of  the  reversed  rotation  of  the  electrons. 
In  other  words,  magnetic  helds  do  not  reverse  themselves  in  a  mirror 
image.  We  can  thus  predict  that  there  will  be  two  possible  types  of 
reflection  of  light  waves.  In  one,  the  electric  held  is  inverted  and  the 
magnetic  held  uninverted  (example  23,  p.  705).  In  the  other,  the 
electric  held  is  uninverted  and  the  magnetic  held  inverted. 

6.2.3  Interference  effects 

If  you  look  at  the  front  of  a  pair  of  high-quality  binoculars,  you 
will  notice  a  greenish-blue  coating  on  the  lenses.  This  is  advertised 
as  a  coating  to  prevent  reflection.  Now  rehection  is  clearly  undesir¬ 
able  —  we  want  the  light  to  go  in  the  binoculars  —  but  so  far  I’ve 
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described  reflection  as  an  unalterable  fact  of  nature,  depending  only 
on  the  properties  of  the  two  wave  media.  The  coating  can’t  change 
the  speed  of  light  in  air  or  in  glass,  so  how  can  it  work?  The  key  is 
that  the  coating  itself  is  a  wave  medium.  In  other  words,  we  have 
a  three-layer  sandwich  of  materials:  air,  coating,  and  glass.  We  will 
analyze  the  way  the  coating  works,  not  because  optical  coatings  are 
an  important  part  of  your  education  but  because  it  provides  a  good 
example  of  the  general  phenomenon  of  wave  interference  effects. 

There  are  two  different  interfaces  between  media:  an  air-coating 
boundary  and  a  coating-glass  boundary.  Partial  reflection  and  par¬ 
tial  transmission  will  occur  at  each  boundary.  For  ease  of  visual¬ 
ization  let’s  start  by  considering  an  equivalent  system  consisting  of 
three  dissimilar  pieces  of  string  tied  together,  and  a  wave  pattern 
consisting  initially  of  a  single  pulse.  Figure  k/1  shows  the  incident 
pulse  moving  through  the  heavy  rope,  in  which  its  velocity  is  low. 
When  it  encounters  the  lighter- weight  rope  in  the  middle,  a  faster 
medium,  it  is  partially  reflected  and  partially  transmitted.  (The 
transmitted  pulse  is  bigger,  but  nevertheless  has  only  part  of  the 
original  energy.)  The  pulse  transmitted  by  the  first  interface  is  then 
partially  reflected  and  partially  transmitted  by  the  second  bound¬ 
ary,  k/3.  In  figure  k/4,  two  pulses  are  on  the  way  back  out  to  the 
left,  and  a  single  pulse  is  heading  off  to  the  right.  (There  is  still  a 
weak  pulse  caught  between  the  two  boundaries,  and  this  will  rattle 
back  and  forth,  rapidly  getting  too  weak  to  detect  as  it  leaks  energy 
to  the  outside  with  each  partial  reflection.) 

Note  how,  of  the  two  reflected  pulses  in  k/4,  one  is  inverted  and 
one  uninverted.  One  underwent  reflection  at  the  first  boundary  (a 
reflection  back  into  a  slower  medium  is  uninverted),  but  the  other 
was  reflected  at  the  second  boundary  (reflection  back  into  a  faster 
medium  is  inverted). 

Now  let’s  imagine  what  would  have  happened  if  the  incoming 
wave  pattern  had  been  a  long  sinusoidal  wave  train  instead  of  a 
single  pulse.  The  first  two  waves  to  reemerge  on  the  left  could  be 
in  phase,  1/1,  or  out  of  phase,  1/2,  or  anywhere  in  between.  The 
amount  of  lag  between  them  depends  entirely  on  the  width  of  the 
middle  segment  of  string.  If  we  choose  the  width  of  the  middle  string 
segment  correctly,  then  we  can  arrange  for  destructive  interference 
to  occur,  1/2,  with  cancellation  resulting  in  a  very  weak  reflected 
wave. 

This  whole  analysis  applies  directly  to  our  original  case  of  optical 
coatings.  Visible  light  from  most  sources  does  consist  of  a  stream 
of  short  sinusoidal  wave-trains  such  as  the  ones  drawn  above.  The 
only  real  difference  between  the  waves-on-a-rope  example  and  the 
case  of  an  optical  coating  is  that  the  first  and  third  media  are  air 
and  glass,  in  which  light  does  not  have  the  same  speed.  However, 
the  general  result  is  the  same  as  long  as  the  air  and  the  glass  have 
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m  /  A  pulse  bounces  back 
and  forth. 


n/We  model  a  guitar  string 
attached  to  the  guitar’s  body  at 
both  ends  as  a  light-weight  string 
attached  to  extremely  heavy 
strings  at  its  ends. 


light-wave  speeds  that  are  either  both  greater  than  the  coating’s  or 
both  less  than  the  coating’s. 

The  business  of  optical  coatings  turns  out  to  be  a  very  arcane 
one,  with  a  plethora  of  trade  secrets  and  “black  magic”  techniques 
handed  down  from  master  to  apprentice.  Nevertheless,  the  ideas 
you  have  learned  about  waves  in  general  are  sufficient  to  allow  you 
to  come  to  some  definite  conclusions  without  any  further  technical 
knowledge.  The  self-check  and  discussion  questions  will  direct  you 
along  these  lines  of  thought. 

self-check  E 

Color  corresponds  to  wavelength  of  light  waves.  Is  it  possible  to  choose 
a  thickness  for  an  optical  coating  that  will  produce  destructive  interfer¬ 
ence  for  all  colors  of  light?  >  Answer,  p. 

964 

This  example  was  typical  of  a  wide  variety  of  wave  interference 
effects.  With  a  little  guidance,  you  are  now  ready  to  figure  out 
for  yourself  other  examples  such  as  the  rainbow  pattern  made  by  a 
compact  disc  or  by  a  layer  of  oil  on  a  puddle. 

Discussion  Questions 

A  Is  it  possible  to  get  complete  destructive  interference  in  an  optical 
coating,  at  least  for  light  of  one  specific  wavelength? 

B  Sunlight  consists  of  sinusoidal  wave-trains  containing  on  the  order 
of  a  hundred  cycles  back-to-back,  for  a  length  of  something  like  a  tenth  of 
a  millimeter.  What  happens  if  you  try  to  make  an  optical  coating  thicker 
than  this? 

C  Suppose  you  take  two  microscope  slides  and  lay  one  on  top  of  the 
other  so  that  one  of  its  edges  is  resting  on  the  corresponding  edge  of  the 
bottom  one.  If  you  insert  a  sliver  of  paper  or  a  hair  at  the  opposite  end, 
a  wedge-shaped  layer  of  air  will  exist  in  the  middle,  with  a  thickness  that 
changes  gradually  from  one  end  to  the  other.  What  would  you  expect  to 
see  if  the  slides  were  illuminated  from  above  by  light  of  a  single  color? 
How  would  this  change  if  you  gradually  lifted  the  lower  edge  of  the  top 
slide  until  the  two  slides  were  finally  parallel? 

D  An  observation  like  the  one  described  in  discussion  question  C  was 
used  by  Newton  as  evidence  against  the  wave  theory  of  light!  If  Newton 
didn’t  know  about  inverting  and  noninverting  reflections,  what  would  have 
seemed  inexplicable  to  him  about  the  region  where  the  air  layer  had  zero 
or  nearly  zero  thickness? 

6.2.4  Waves  bounded  on  both  sides 

In  the  example  of  the  previous  section,  it  was  theoretically  true 
that  a  pulse  would  be  trapped  permanently  in  the  middle  medium, 
but  that  pulse  was  not  central  to  our  discussion,  and  in  any  case  it 
was  weakening  severely  with  each  partial  reflection.  Now  consider 
a  guitar  string.  At  its  ends  it  is  tied  to  the  body  of  the  instrument 
itself,  and  since  the  body  is  very  massive,  the  behavior  of  the  waves 
when  they  reach  the  end  of  the  string  can  be  understood  in  the 
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same  way  as  if  the  actual  guitar  string  was  attached  on  the  ends  to 
strings  that  were  extremely  massive.  Reflections  are  most  intense 
when  the  two  media  are  very  dissimilar.  Because  the  wave  speed  in 
the  body  is  so  radically  different  from  the  speed  in  the  string,  we 
should  expect  nearly  100%  reflection. 

Although  this  may  seem  like  a  rather  bizarre  physical  model  of 
the  actual  guitar  string,  it  already  tells  us  something  interesting 
about  the  behavior  of  a  guitar  that  we  would  not  otherwise  have 
understood.  The  body,  far  from  being  a  passive  frame  for  attaching 
the  strings  to,  is  actually  the  exit  path  for  the  wave  energy  in  the 
strings.  With  every  reflection,  the  wave  pattern  on  the  string  loses 
a  tiny  fraction  of  its  energy,  which  is  then  conducted  through  the 
body  and  out  into  the  air.  (The  string  has  too  little  cross-section  to 
make  sound  waves  efficiently  by  itself.)  By  changing  the  properties 
of  the  body,  moreover,  we  should  expect  to  have  an  effect  on  the 
manner  in  which  sound  escapes  from  the  instrument.  This  is  clearly 
demonstrated  by  the  electric  guitar,  which  has  an  extremely  massive, 
solid  wooden  body.  Here  the  dissimilarity  between  the  two  wave 
media  is  even  more  pronounced,  with  the  result  that  wave  energy 
leaks  out  of  the  string  even  more  slowly.  This  is  why  an  electric 
guitar  with  no  electric  pickup  can  hardly  be  heard  at  all,  and  it  is 
also  the  reason  why  notes  on  an  electric  guitar  can  be  sustained  for 
longer  than  notes  on  an  acoustic  guitar. 

If  we  initially  create  a  disturbance  on  a  guitar  string,  how  will 
the  reflections  behave?  In  reality,  the  finger  or  pick  will  give  the 
string  a  triangular  shape  before  letting  it  go,  and  we  may  think  of 
this  triangular  shape  as  a  very  broad  “dent”  in  the  string  which 
will  spread  out  in  both  directions.  For  simplicity,  however,  let’s  just 
imagine  a  wave  pattern  that  initially  consists  of  a  single,  narrow 
pulse  traveling  up  the  neck,  m/1.  After  reflection  from  the  top  end, 
it  is  inverted,  m/3.  Now  something  interesting  happens:  figure  m/5 
is  identical  to  figure  m/1.  After  two  reflections,  the  pulse  has  been 
inverted  twice  and  has  changed  direction  twice.  It  is  now  back  where 
it  started.  The  motion  is  periodic.  This  is  why  a  guitar  produces 
sounds  that  have  a  definite  sensation  of  pitch. 

self-check  F 

Notice  that  from  m/1  to  m/5,  the  pulse  has  passed  by  every  point  on  the 
string  exactly  twice.  This  means  that  the  total  distance  it  has  traveled 
equals  2  L,  where  L  is  the  length  of  the  string.  Given  this  fact,  what  are 
the  period  and  frequency  of  the  sound  it  produces,  expressed  in  terms 
of  L  and  v,  the  velocity  of  the  wave?  >  Answer,  p.  964 

Note  that  if  the  waves  on  the  string  obey  the  principle  of  super¬ 
position,  then  the  velocity  must  be  independent  of  amplitude,  and 
the  guitar  will  produce  the  same  pitch  regardless  of  whether  it  is 
played  loudly  or  softly.  In  reality,  waves  on  a  string  obey  the  prin¬ 
ciple  of  superposition  approximately,  but  not  exactly.  The  guitar, 
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o/The  period  of  this  double¬ 
pulse  pattern  is  half  of  what  we’d 
otherwise  expect. 


p  /  Any  wave  can  be  made 
by  superposing  sine  waves. 
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q  /  Graphs  of  loudness  ver¬ 
sus  frequency  for  the  vowel  “ah,” 
sung  as  three  different  musical 
notes.  G  is  consonant  with  D, 
since  every  overtone  of  G  that 
is  close  to  an  overtone  of  D 
(marked  “*”)  is  at  exactly  the 
same  frequency.  G  and  Cft  are 
dissonant  together,  since  some 
of  the  overtones  of  G  (marked 
“x”)  are  close  to,  but  not  right  on 
top  of,  those  of  C(t- 


like  just  about  any  acoustic  instrument,  is  a  little  out  of  tune  when 
played  loudly.  (The  effect  is  more  pronounced  for  wind  instruments 
than  for  strings,  but  wind  players  are  able  to  compensate  for  it.) 

Now  there  is  only  one  hole  in  our  reasoning.  Suppose  we  some¬ 
how  arrange  to  have  an  initial  setup  consisting  of  two  identical  pulses 
heading  toward  each  other,  as  in  figure  (g).  They  will  pass  through 
each  other,  undergo  a  single  inverting  reflection,  and  come  back  to 
a  configuration  in  which  their  positions  have  been  exactly  inter¬ 
changed.  This  means  that  the  period  of  vibration  is  half  as  long. 
The  frequency  is  twice  as  high. 

This  might  seem  like  a  purely  academic  possibility,  since  nobody 
actually  plays  the  guitar  with  two  picks  at  once!  But  in  fact  it  is  an 
example  of  a  very  general  fact  about  waves  that  are  bounded  on  both 
sides.  A  mathematical  theorem  called  Fourier’s  theorem  states  that 
any  wave  can  be  created  by  superposing  sine  waves.  Figure  p  shows 
how  even  by  using  only  four  sine  waves  with  appropriately  chosen 
amplitudes,  we  can  arrive  at  a  sum  which  is  a  decent  approximation 
to  the  realistic  triangular  shape  of  a  guitar  string  being  plucked. 
The  one- hump  wave,  in  which  half  a  wavelength  fits  on  the  string, 
will  behave  like  the  single  pulse  we  originally  discussed.  We  call 
its  frequency  fD.  The  two- hump  wave,  with  one  whole  wavelength, 
is  very  much  like  the  two-pulse  example.  For  the  reasons  discussed 
above,  its  frequency  is  2  f0.  Similarly,  the  three- hump  and  four- hump 
waves  have  frequencies  of  3 f0  and  4 fQ. 

Theoretically  we  would  need  to  add  together  infinitely  many 
such  wave  patterns  to  describe  the  initial  triangular  shape  of  the 
string  exactly,  although  the  amplitudes  required  for  the  very  high 
frequency  parts  would  be  very  small,  and  an  excellent  approximation 
could  be  achieved  with  as  few  as  ten  waves. 

We  thus  arrive  at  the  following  very  general  conclusion.  When¬ 
ever  a  wave  pattern  exists  in  a  medium  bounded  on  both  sides  by 
media  in  which  the  wave  speed  is  very  different,  the  motion  can  be 
broken  down  into  the  motion  of  a  (theoretically  infinite)  series  of  sine 
waves,  with  frequencies  /Q,  2/0,  3/0,  . . .  Except  for  some  technical 
details,  to  be  discussed  below,  this  analysis  applies  to  a  vast  range  of 
sound-producing  systems,  including  the  air  column  within  the  hu¬ 
man  vocal  tract.  Because  sounds  composed  of  this  kind  of  pattern 
of  frequencies  are  so  common,  our  ear-brain  system  has  evolved  so 
as  to  perceive  them  as  a  single,  fused  sensation  of  tone. 

Musical  applications 

Many  musicians  claim  to  be  able  to  pick  out  by  ear  several  of  the 
frequencies  2/0,  3/0,  ...,  called  overtones  or  harmonics  of  the  funda¬ 
mental  /0,  but  they  are  kidding  themselves.  In  reality,  the  overtone 
series  has  two  important  roles  in  music,  neither  of  which  depends 
on  this  fictitious  ability  to  “hear  out”  the  individual  overtones. 


376 


Chapter  6  Waves 


s  /  Standing  waves  on  a  rope.  (PSSC  Physics.) 


First,  the  relative  strengths  of  the  overtones  is  an  important 
part  of  the  personality  of  a  sound,  called  its  timbre  (rhymes  with 
“amber”).  The  characteristic  tone  of  the  brass  instruments,  for  ex¬ 
ample,  is  a  sound  that  starts  out  with  a  very  strong  harmonic  series 
extending  up  to  very  high  frequencies,  but  whose  higher  harmonics 
die  down  drastically  as  the  attack  changes  to  the  sustained  portion 
of  the  note. 

Second,  although  the  ear  cannot  separate  the  individual  harmon¬ 
ics  of  a  single  musical  tone,  it  is  very  sensitive  to  clashes  between 
the  overtones  of  notes  played  simultaneously,  i.e.,  in  harmony.  We 
tend  to  perceive  a  combination  of  notes  as  being  dissonant  if  they 
have  overtones  that  are  close  but  not  the  same.  Roughly  speaking, 
strong  overtones  whose  frequencies  differ  by  more  than  1%  and  less 
than  10%  cause  the  notes  to  sound  dissonant.  It  is  important  to  re¬ 
alize  that  the  term  “dissonance”  is  not  a  negative  one  in  music.  No 
matter  how  long  you  search  the  radio  dial,  you  will  never  hear  more 
than  three  seconds  of  music  without  at  least  one  dissonant  combi¬ 
nation  of  notes.  Dissonance  is  a  necessary  ingredient  in  the  creation 
of  a  musical  cycle  of  tension  and  release.  Musically  knowledgeable 
people  do  not  usually  use  the  word  “dissonant”  as  a  criticism  of  mu¬ 
sic,  and  if  they  do,  what  they  are  really  saying  is  that  the  dissonance 
has  been  used  in  a  clumsy  way,  or  without  providing  any  contrast 
between  dissonance  and  consonance. 


r  /  If  you  take  a  sine  wave 
and  make  a  copy  of  it  shifted 
over,  their  sum  is  still  a  sine  wave. 
The  same  is  not  true  for  a  square 
wave. 
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t  /  Surprisingly,  sound  waves 
undergo  partial  reflection  at  the 
open  ends  of  tubes  as  well  as 
closed  ones. 


u  /  Graphs  of  excess  density 
versus  position  for  the  lowest- 
frequency  standing  waves  of 
three  types  of  air  columns.  Points 
on  the  axis  have  normal  air 
density. 


Standing  waves 

Figure  s  shows  sinusoidal  wave  patterns  made  by  shaking  a  rope. 
I  used  to  enjoy  doing  this  at  the  bank  with  the  pens  on  chains,  back 
in  the  days  when  people  actually  went  to  the  bank.  You  might  think 
that  I  and  the  person  in  the  photos  had  to  practice  for  a  long  time 
in  order  to  get  such  nice  sine  waves.  In  fact,  a  sine  wave  is  the  only 
shape  that  can  create  this  kind  of  wave  pattern,  called  a  standing 
wave,  which  simply  vibrates  back  and  forth  in  one  place  without 
moving.  The  sine  wave  just  creates  itself  automatically  when  you 
find  the  right  frequency,  because  no  other  shape  is  possible. 

If  you  think  about  it,  it’s  not  even  obvious  that  sine  waves  should 
be  able  to  do  this  trick.  After  all,  waves  are  supposed  to  travel  at  a 
set  speed,  aren’t  they?  The  speed  isn’t  supposed  to  be  zero!  Well,  we 
can  actually  think  of  a  standing  wave  as  a  superposition  of  a  moving 
sine  wave  with  its  own  reflection,  which  is  moving  the  opposite  way. 
Sine  waves  have  the  unique  mathematical  property  that  the  sum  of 
sine  waves  of  equal  wavelength  is  simply  a  new  sine  wave  with  the 
same  wavelength.  As  the  two  sine  waves  go  back  and  forth,  they 
always  cancel  perfectly  at  the  ends,  and  their  sum  appears  to  stand 
still. 

Standing  wave  patterns  are  rather  important,  since  atoms  are 
really  standing-wave  patterns  of  electron  waves.  You  are  a  standing 
wave! 

Standing-wave  patterns  of  air  columns 

The  air  column  inside  a  wind  instrument  behaves  very  much 
like  the  wave-on-a-string  example  we’ve  been  concentrating  on  so 
far,  the  main  difference  being  that  we  may  have  either  inverting  or 
noninverting  reflections  at  the  ends. 

Some  organ  pipes  are  closed  at  both  ends.  The  speed  of  sound 
is  different  in  metal  than  in  air,  so  there  is  a  strong  reflection  at 
the  closed  ends,  and  we  can  have  standing  waves.  These  reflections 
are  both  density-noninverting,  so  we  get  symmetric  standing-wave 
patterns,  such  as  the  one  shown  in  figure  u/1. 

Figure  t  shows  the  sound  waves  in  and  around  a  bamboo  Japanese 
flute  called  a  shakuhachi,  which  is  open  at  both  ends  of  the  air  col¬ 
umn.  We  can  only  have  a  standing  wave  pattern  if  there  are  re¬ 
flections  at  the  ends,  but  that  is  very  counterintuitive  —  why  is 
there  any  reflection  at  all,  if  the  sound  wave  is  free  to  emerge  into 
open  space,  and  there  is  no  change  in  medium?  Recall  the  reason 
why  we  got  reflections  at  a  change  in  medium:  because  the  wave¬ 
length  changes,  so  the  wave  has  to  readjust  itself  from  one  pattern 
to  another,  and  the  only  way  it  can  do  that  without  developing  a 
kink  is  if  there  is  a  reflection.  Something  similar  is  happening  here. 
The  only  difference  is  that  the  wave  is  adjusting  from  being  a  plane 
wave  to  being  a  spherical  wave.  The  reflections  at  the  open  ends 
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are  density-inverting,  u/2,  so  the  wave  pattern  is  pinched  off  at  the 
ends.  Comparing  panels  1  and  2  of  the  figure,  we  see  that  although 
the  wave  pattens  are  different,  in  both  cases  the  wavelength  is  the 
same:  in  the  lowest-frequency  standing  wave,  half  a  wavelength  fits 
inside  the  tube.  Thus,  it  isn’t  necessary  to  memorize  which  type  of 
reflection  is  inverting  and  which  is  uninverting.  It’s  only  necessary 
to  know  that  the  tubes  are  symmetric. 

Finally,  we  can  have  an  asymmetric  tube:  closed  at  one  end  and 
open  at  the  other.  A  common  example  is  the  pan  pipes,  v,  which  are 
closed  at  the  bottom  and  open  at  the  top.  The  standing  wave  with 
the  lowest  frequency  is  therefore  one  in  which  1/4  of  a  wavelength 
fits  along  the  length  of  the  tube,  as  shown  in  figure  u/3. 

Sometimes  an  instrument’s  physical  appearance  can  be  mislead¬ 
ing.  A  concert  flute,  w,  is  closed  at  the  mouth  end  and  open  at 
the  other,  so  we  would  expect  it  to  behave  like  an  asymmetric  air 
column;  in  reality,  it  behaves  like  a  symmetric  air  column  open  at 
both  ends,  because  the  embouchure  hole  (the  hole  the  player  blows 
over)  acts  like  an  open  end.  The  clarinet  and  the  saxophone  look 
similar,  having  a  mouthpiece  and  reed  at  one  end  and  an  open  end 
at  the  other,  but  they  act  different.  In  fact  the  clarinet’s  air  col¬ 
umn  has  patterns  of  vibration  that  are  asymmetric,  the  saxophone 
symmetric.  The  discrepancy  comes  from  the  difference  between  the 
conical  tube  of  the  sax  and  the  cylindrical  tube  of  the  clarinet.  The 
adjustment  of  the  wave  pattern  from  a  plane  wave  to  a  spherical 
wave  is  more  gradual  at  the  flaring  bell  of  the  saxophone. 

self-check  G 

Draw  a  graph  of  pressure  versus  position  for  the  first  overtone  of  the  air 
column  in  a  tube  open  at  one  end  and  closed  at  the  other.  This  will  be 
the  next-to-longest  possible  wavelength  that  allows  for  a  point  of  maxi¬ 
mum  vibration  at  one  end  and  a  point  of  no  vibration  at  the  other.  How 
many  times  shorter  will  its  wavelength  be  compared  to  the  frequency 
of  the  lowest-frequency  standing  wave,  shown  in  the  figure?  Based  on 
this,  how  many  times  greater  will  its  frequency  be?  i>  Answer,  p.  964 

The  speed  of  sound  example  13 

We  can  get  a  rough  and  ready  derivation  of  the  equation  for  the 
speed  of  sound  by  analyzing  the  standing  waves  in  a  cylindrical 
air  column  as  a  special  type  of  Helmholtz  resonance  (example 
25  on  page  336),  in  which  the  cavity  happens  to  have  the  same 
cross-sectional  area  as  the  neck.  Roughly  speaking,  the  regions 
of  maximum  density  variation  act  like  the  cavity.  The  regions  of 
minimum  density  variation,  on  the  other  hand,  are  the  places 
where  the  velocity  of  the  air  is  varying  the  most;  these  regions 
throttle  back  the  speed  of  the  vibration,  because  of  the  inertia  of 
the  moving  air.  If  the  cylinder  has  cross-sectional  area  A,  then  the 
“cavity”  and  “neck”  parts  of  the  wave  both  have  lengths  of  some¬ 
thing  like  A/2,  and  the  volume  of  the  “cavity”  is  about  A\/2.  We 
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v  /  A  pan  pipe  is  an  asym¬ 
metric  air  column,  open  at  the  top 
and  closed  at  the  bottom. 


w  /  A  concert  flute  looks  like 
an  asymmetric  air  column,  open 
at  the  mouth  end  and  closed  at 
the  other.  However,  its  patterns  of 
vibration  are  symmetric,  because 
the  embouchure  hole  acts  like  an 
open  end. 
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get  v  =  f\  =  {..  .)\/yP0/p,  where  the  factor  (. . .)  represents  nu¬ 
merical  stuff  that  we  can’t  possibly  hope  to  have  gotten  right  with 
such  a  crude  argument.  The  correct  result  is  in  fact  v  =  y/yP0/p- 
Isaac  Newton  attempted  the  same  calculation,  but  didn’t  under¬ 
stand  the  thermodynamic  effects  involved,  and  therefore  got  a 
result  that  didn’t  have  the  correct  factor  of  y/y. 

This  chapter  is  summarized  on  page  996.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 
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Problems 


The  symbols  - ,  etc.  are  explained  on  page  384. 

1  The  musical  note  middle  C  has  a  frequency  of  262  Hz.  What 
are  its  period  and  wavelength? 

2  The  following  is  a  graph  of  the  height  of  a  water  wave  as  a 
function  of  position,  at  a  certain  moment  in  time. 


Trace  this  graph  onto  another  piece  of  paper,  and  then  sketch 
below  it  the  corresponding  graphs  that  would  be  obtained  if 

(a)  the  amplitude  and  frequency  were  doubled  while  the  velocity 
remained  the  same; 

(b)  the  frequency  and  velocity  were  both  doubled  while  the  am¬ 
plitude  remained  unchanged; 

(c)  the  wavelength  and  amplitude  were  reduced  by  a  factor  of 
three  while  the  velocity  was  doubled. 

Explain  all  your  answers.  [Problem  by  Arnold  Arons.] 

3  (a)  The  graph  shows  the  height  of  a  water  wave  pulse  as  a 
function  of  position.  Draw  a  graph  of  height  as  a  function  of  time 
for  a  specific  point  on  the  water.  Assume  the  pulse  is  traveling  to 
the  right. 

(b)  Repeat  part  a,  but  assume  the  pulse  is  traveling  to  the  left. 

(c)  Now  assume  the  original  graph  was  of  height  as  a  function  of 
time,  and  draw  a  graph  of  height  as  a  function  of  position,  assuming 
the  pulse  is  traveling  to  the  right. 

(d)  Repeat  part  c,  but  assume  the  pulse  is  traveling  to  the  left. 
Explain  all  your  answers.  [Problem  by  Arnold  Arons.] 

4  At  a  particular  moment  in  time,  a  wave  on  a  string  has  a  shape 
described  by  y  =  3.5  cos(0.737nc  +  0.457rt  +  0.377t).  The  stuff  inside 
the  cosine  is  in  radians.  Assume  that  the  units  of  the  numerical 
constants  are  such  that  x,  y,  and  t  are  in  SI  units.  >  Hint,  p.  960 

(a)  Is  the  wave  moving  in  the  positive  x  or  the  negative  x  direction? 

(b)  Find  the  wave’s  period,  frequency,  wavelength. 

(c)  Find  the  wave’s  velocity. 

(d)  Find  the  maximum  velocity  of  any  point  on  the  string,  and 
compare  with  the  magnitude  and  direction  of  the  wave’s  velocity. 

V  I 


Problem  3 
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Problem  5. 


Problem  8. 


5  The  figure  shows  one  wavelength  of  a  steady  sinusoidal  wave 
traveling  to  the  right  along  a  string.  Define  a  coordinate  system 
in  which  the  positive  x  axis  points  to  the  right  and  the  positive  y 
axis  up,  such  that  the  flattened  string  would  have  y  =  0.  Copy 
the  figure,  and  label  with  y  =  0  all  the  appropriate  parts  of  the 
string.  Similarly,  label  with  v  =  (1  all  parts  of  the  string  whose 
velocities  are  zero,  and  with  a  =  0  all  parts  whose  accelerations 
are  zero.  There  is  more  than  one  point  whose  velocity  is  of  the 
greatest  magnitude.  Pick  one  of  these,  and  indicate  the  direction  of 
its  velocity  vector.  Do  the  same  for  a  point  having  the  maximum 
magnitude  of  acceleration.  Explain  all  your  answers. 

[Problem  by  Arnold  Arons.]  ■ 

6  (a)  Find  an  equation  for  the  relationship  between  the  Doppler- 
shifted  frequency  of  a  wave  and  the  frequency  of  the  original  wave, 
for  the  case  of  a  stationary  observer  and  a  source  moving  directly 
toward  or  away  from  the  observer. 

(b)  Check  that  the  units  of  your  answer  make  sense. 

(c)  Check  that  the  dependence  on  vs  makes  sense. 

7  Suggest  a  quantitative  experiment  to  look  for  any  deviation 
from  the  principle  of  superposition  for  surface  waves  in  water.  Try 
to  make  your  experiment  simple  and  practical. 

8  The  simplest  trick  with  a  lasso  is  to  spin  a  flat  loop  in  a 
horizontal  plane.  The  whirling  loop  of  a  lasso  is  kept  under  tension 
mainly  due  to  its  own  rotation.  Although  the  spoke’s  force  on  the 
loop  has  an  inward  component,  we’ll  ignore  it.  The  purpose  of  this 
problem,  which  is  based  on  one  by  A.P.  French,  is  to  prove  a  cute 
fact  about  wave  disturbances  moving  around  the  loop.  As  far  as  I 
know,  this  fact  has  no  practical  implications  for  trick  roping!  Let  the 
loop  have  radius  r  and  mass  per  unit  length  //,  and  let  its  angular 
velocity  be  ui. 

(a)  Find  the  tension,  T,  in  the  loop  in  terms  of  r,  /j,  and  oj.  Assume 
the  loop  is  a  perfect  circle,  with  no  wave  disturbances  on  it  yet. 

>  Hint,  p.  960  >  Answer,  p.  970 

(b)  Find  the  velocity  of  a  wave  pulse  traveling  around  the  loop. 
Discuss  what  happens  when  the  pulse  moves  is  in  the  same  direction 
as  the  rotation,  and  when  it  travels  contrary  to  the  rotation.  ^ 

9  A  string  hangs  vertically,  free  at  the  bottom  and  attached  at 
the  top. 

(a)  Find  the  velocity  of  waves  on  the  string  as  a  function  of  the 
distance  from  the  bottom 

(b)  Find  the  acceleration  of  waves  on  the  string.  >  Answer,  p.  970 

(c)  Interpret  your  answers  to  parts  a  and  b  for  the  case  where  a 

pulse  comes  down  and  reaches  the  end  of  the  string.  What  happens 
next?  Check  your  answer  against  experiment  and  conservation  of 
energy.  ■ 
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10  Singing  that  is  off-pitch  by  more  than  about  1%  sounds  bad. 
How  fast  would  a  singer  have  to  be  moving  relative  to  the  rest  of 
a  band  to  make  this  much  of  a  change  in  pitch  due  to  the  Doppler 
effect? 


11  Light  travels  faster  in  warmer  air.  Use  this  fact  to  explain 
the  formation  of  a  mirage  appearing  like  the  shiny  surface  of  a  pool 
of  water  when  there  is  a  layer  of  hot  air  above  a  road. 


12  (a)  Compute  the  amplitude  of  light  that  is  reflected  back 

into  air  at  an  air-water  interface,  relative  to  the  amplitude  of  the 
incident  wave.  Assume  that  the  light  arrives  in  the  direction  directly 
perpendicular  to  the  surface. The  speeds  of  light  in  air  and  water  are 
3.0  x  108  and  2.2  x  108  m/s,  respectively. 

(b)  Find  the  energy  of  the  reflected  wave  as  a  fraction  of  the  incident 
energy.  >  Hint,  p.  960  V 


13  A  concert  flute  produces  its  lowest  note,  at  about  262  Hz, 
when  half  of  a  wavelength  fits  inside  its  tube.  Compute  the  length 
of  the  flute.  >  Answer,  p.  970  - 


14  (a)  A  good  tenor  saxophone  player  can  play  all  of  the  fol¬ 

lowing  notes  without  changing  her  fingering,  simply  by  altering  the 
tightness  of  her  lips:  Eb  (150  Hz),  Eb  (300  Hz),  Bb  (450  Hz),  and 
Eb  (600  Hz).  How  is  this  possible?  (I’m  not  asking  you  to  analyze 
the  coupling  between  the  lips,  the  reed,  the  mouthpiece,  and  the  air 
column,  which  is  very  complicated.) 

(b)  Some  saxophone  players  are  known  for  their  ability  to  use  this 


technique  to  play  “freak  notes,”  i.e.,  notes  above  the  normal  range 

C 

261.6  Hz 

of  the  instrument.  Why  isn’t  it  possible  to  play  notes  below  the 

D 

293.7 

normal  range  using  this  technique?  ■ 

E 

329.6 

F 

349.2 

G 

392.0 

A 

440.0 

15  The  table  gives  the  frequencies  of  the  notes  that  make  up 

the  key  of  F  major,  starting  from  middle  C  and  going  up  through 

Bb 

466.2 

all  seven  notes. 

Problem  15. 

(a)  Calculate  the  first  four  or  five  harmonics  of  C  and  G,  and  deter¬ 
mine  whether  these  two  notes  will  be  consonant  or  dissonant.  (Re¬ 
call  that  harmonics  that  differ  by  about  1-10%  cause  dissonance.) 

(b)  Do  the  same  for  C  and  Bb.  - 


Problems 


383 


Problem  16. 


16  A  Fabry-Perot  interferometer,  shown  in  the  figure  being  used 
to  measure  the  diameter  of  a  thin  filament,  consists  of  two  glass 
plates  with  an  air  gap  between  them.  As  the  top  plate  is  moved 
up  or  down  with  a  screw,  the  light  passing  through  the  plates  goes 
through  a  cycle  of  constructive  and  destructive  interference,  which  is 
mainly  due  to  interference  between  rays  that  pass  straight  through 
and  those  that  are  reflected  twice  back  into  the  air  gap.  (Although 
the  dimensions  in  this  drawing  are  distorted  for  legibility,  the  glass 
plates  would  really  be  much  thicker  than  the  length  of  the  wave- 
trains  of  light,  so  no  interference  effects  would  be  observed  due  to 
reflections  within  the  glass.) 

(a)  If  the  top  plate  is  cranked  down  so  that  the  thickness,  d,  of 
the  air  gap  is  much  less  than  the  wavelength  A  of  the  light,  i.e.,  in 
the  limit  d  — >  0,  what  is  the  phase  relationship  between  the  two 
rays?  (Recall  that  the  phase  can  be  inverted  by  a  reflection.)  Is  the 
interference  constructive,  or  destructive? 

(b)  If  d  is  now  slowly  increased,  what  is  the  first  value  of  d  for  which 
the  interference  is  the  same  as  at  d  — >  0?  Express  your  answer  in 
terms  of  A. 

(c)  Suppose  the  apparatus  is  first  set  up  as  shown  in  the  figure.  The 
filament  is  then  removed,  and  n  cycles  of  brightening  and  dimming 
are  counted  while  the  top  plate  is  brought  down  to  d  =  0.  What  is 
the  thickness  of  the  filament,  in  terms  of  n  and  A? 

Based  on  a  problem  by  D.J.  Raymond. 

17  (a)  A  wave  pulse  moves  into  a  new  medium,  where  its  ve¬ 

locity  is  greater  by  a  factor  a.  Find  an  expression  for  the  fraction, 
/,  of  the  wave  energy  that  is  transmitted,  in  terms  of  a.  Note  that, 
as  discussed  in  the  text,  you  cannot  simply  find  /  by  squaring  the 
amplitude  of  the  transmitted  wave.  >  Answer,  p.  970 

(b)  Suppose  we  wish  to  transmit  a  pulse  from  one  medium  to  an¬ 
other,  maximizing  the  fraction  of  the  wave  energy  transmitted.  To 
do  so,  we  sandwich  another  layer  in  between  them,  so  that  the  wave 
moves  from  the  initial  medium,  where  its  velocity  is  v±,  through  the 
intermediate  layer,  where  it  is  r>2,  and  on  into  the  final  layer,  where 
it  becomes  v^.  What  is  the  optimal  value  of  V2  ?  (Assume  that  the 
middle  layer  is  thicker  than  the  length  of  the  pulse,  so  there  are  no 
interference  effects.  Also,  although  there  will  be  later  echoes  that 
are  transmitted  after  multiple  reflections  back  and  forth  across  the 
middle  layer,  you  are  only  to  optimize  the  strength  of  the  transmit¬ 
ted  pulse  that  is  first  to  emerge.  In  other  words,  it’s  simply  a  matter 
of  applying  your  answer  from  part  a  twice  to  find  the  amount  that 
finally  gets  through.)  >  Answer,  p.  970  ^ 


Key  to  symbols: 

easy  -  typical  fi  challenging  ■  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Chapter  7 

Relativity 


7.1  Time  Is  Not  Absolute 

When  Einstein  first  began  to  develop  the  theory  of  relativity,  around 
1905,  the  only  real-world  observations  he  could  draw  on  were  am¬ 
biguous  and  indirect.  Today,  the  evidence  is  part  of  everyday  life. 
For  example,  every  time  you  use  a  GPS  receiver,  a,  you’re  using 
Einstein’s  theory  of  relativity.  Somewhere  between  1905  and  today, 
technology  became  good  enough  to  allow  conceptually  simple  ex¬ 
periments  that  students  in  the  early  20th  century  could  only  discuss 
in  terms  like  “Imagine  that  we  could. . .  ”  A  good  jumping-on  point 
is  1971.  In  that  year,  J.C.  Hafele  and  R.E.  Keating  brought  atomic 
clocks  aboard  commercial  airliners,  b,  and  went  around  the  world, 
once  from  east  to  west  and  once  from  west  to  east.  Hafele  and 
Keating  observed  that  there  was  a  discrepancy  between  the  times 
measured  by  the  traveling  clocks  and  the  times  measured  by  similar 
clocks  that  stayed  home  at  the  U.S.  Naval  Observatory  in  Wash¬ 
ington.  The  east-going  clock  lost  time,  ending  up  off  by  —59  ±  10 
nanoseconds,  while  the  west-going  one  gained  273  ±  7  ns. 

7.1.1  The  correspondence  principle 

This  establishes  that  time  doesn’t  work  the  way  Newton  believed 
it  did  when  he  wrote  that  “Absolute,  true,  and  mathematical  time, 
of  itself,  and  from  its  own  nature  flows  equably  without  regard  to 
anything  external. . .  ”  We  are  used  to  thinking  of  time  as  absolute 
and  universal,  so  it  is  disturbing  to  find  that  it  can  flow  at  a  different 
rate  for  observers  in  different  frames  of  reference.  Nevertheless,  the 
effects  that  Hafele  and  Keating  observed  were  small.  This  makes 
sense:  Newton’s  laws  have  already  been  thoroughly  tested  by  ex¬ 
periments  under  a  wide  variety  of  conditions,  so  a  new  theory  like 
relativity  must  agree  with  Newton’s  to  a  good  approximation,  within 
the  Newtonian  theory’s  realm  of  applicability.  This  requirement  of 
backward-compatibility  is  known  as  the  correspondence  principle. 

7.1.2  Causality 

It’s  also  reassuring  that  the  effects  on  time  were  small  compared 
to  the  three-day  lengths  of  the  plane  trips.  There  was  therefore  no 
opportunity  for  paradoxical  scenarios  such  as  one  in  which  the  east¬ 
going  experimenter  arrived  back  in  Washington  before  he  left  and 
then  convinced  himself  not  to  take  the  trip.  A  theory  that  maintains 


a /This  Global  Positioning 
System  (GPS)  system,  running 
on  a  smartphone  attached  to  a 
bike’s  handlebar,  depends  on 
Einstein’s  theory  of  relativity. 
Time  flows  at  a  different  rate 
aboard  a  GPS  satellite  than  it 
does  on  the  bike,  and  the  GPS 
software  has  to  take  this  into 
account. 


b  /  The  clock  took  up  two  seats, 
and  two  tickets  were  bought  for  it 
under  the  name  of  “Mr.  Clock.” 
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c/ Newton’s  laws  do  not  dis¬ 
tinguish  past  from  future.  The 
football  could  travel  in  either 
direction  while  obeying  Newton’s 
laws. 


T 


d  /  All  three  clocks  are  mov¬ 
ing  to  the  east.  Even  though  the 
west-going  plane  is  moving  to  the 
west  relative  to  the  air,  the  air 
is  moving  to  the  east  due  to  the 
earth’s  rotation. 


this  kind  of  orderly  relationship  between  cause  and  effect  is  said  to 
satisfy  causality. 

Causality  is  like  a  water-hungry  front-yard  lawn  in  Los  Angeles: 
we  know  we  want  it,  but  it’s  not  easy  to  explain  why.  Even  in  plain 
old  Newtonian  physics,  there  is  no  clear  distinction  between  past 
and  future.  In  figure  c,  number  18  throws  the  football  to  number 
25,  and  the  ball  obeys  Newton’s  laws  of  motion.  If  we  took  a  video 
of  the  pass  and  played  it  backward,  we  would  see  the  ball  flying  from 
25  to  18,  and  Newton’s  laws  would  still  be  satisfied.  Nevertheless, 
we  have  a  strong  psychological  impression  that  there  is  a  forward 
arrow  of  time.  I  can  remember  what  the  stock  market  did  last  year, 
but  I  can’t  remember  what  it  will  do  next  year.  Joan  of  Arc’s  mil¬ 
itary  victories  against  England  caused  the  English  to  burn  her  at 
the  stake;  it’s  hard  to  accept  that  Newton’s  laws  provide  an  equally 
good  description  of  a  process  in  which  her  execution  in  1431  caused 
her  to  win  a  battle  in  1429.  There  is  no  consensus  at  this  point 
among  physicists  on  the  origin  and  significance  of  time’s  arrow,  and 
for  our  present  purposes  we  don’t  need  to  solve  this  mystery.  In¬ 
stead,  we  merely  note  the  empirical  fact  that,  regardless  of  what 
causality  really  means  and  where  it  really  comes  from,  its  behavior 
is  consistent.  Specifically,  experiments  show  that  if  an  observer  in  a 
certain  frame  of  reference  observes  that  event  A  causes  event  B,  then 
observers  in  other  frames  agree  that  A  causes  B,  not  the  other  way 
around.  This  is  merely  a  generalization  about  a  large  body  of  ex¬ 
perimental  results,  not  a  logically  necessary  assumption.  If  Keating 
had  gone  around  the  world  and  arrived  back  in  Washington  before 
he  left,  it  would  have  disproved  this  statement  about  causality. 

7.1.3  Time  distortion  arising  from  motion  and  gravity 

Hafele  and  Keating  were  testing  specific  quantitative  predictions 
of  relativity,  and  they  verified  them  to  within  their  experiment’s 
error  bars.  Let’s  work  backward  instead,  and  inspect  the  empirical 
results  for  clues  as  to  how  time  works. 

The  two  traveling  clocks  experienced  effects  in  opposite  direc¬ 
tions,  and  this  suggests  that  the  rate  at  which  time  flows  depends 
on  the  motion  of  the  observer.  The  east-going  clock  was  moving  in 
the  same  direction  as  the  earth’s  rotation,  so  its  velocity  relative  to 
the  earth’s  center  was  greater  than  that  of  the  clock  that  remained 
in  Washington,  while  the  west-going  clock’s  velocity  was  correspond¬ 
ingly  reduced.  The  fact  that  the  east-going  clock  fell  behind,  and 
the  west-going  one  got  ahead,  shows  that  the  effect  of  motion  is  to 
make  time  go  more  slowly.  This  effect  of  motion  on  time  was  pre¬ 
dicted  by  Einstein  in  his  original  1905  paper  on  relativity,  written 
when  he  was  26. 

If  this  had  been  the  only  effect  in  the  Hafele-Keating  experiment, 
then  we  would  have  expected  to  see  effects  on  the  two  flying  clocks 
that  were  equal  in  size.  Making  up  some  simple  numbers  to  keep  the 
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arithmetic  transparent,  suppose  that  the  earth  rotates  from  west  to 
east  at  1000  km/hr,  and  that  the  planes  fly  at  300  km/hr.  Then  the 
speed  of  the  clock  on  the  ground  is  1000  km/hr,  the  speed  of  the 
clock  on  the  east-going  plane  is  1300  km/hr,  and  that  of  the  west¬ 
going  clock  700  km/hr.  Since  the  speeds  of  700,  1000,  and  1300 
km/hr  have  equal  spacing  on  either  side  of  1000,  we  would  expect 
the  discrepancies  of  the  moving  clocks  relative  to  the  one  in  the  lab 
to  be  equal  in  size  but  opposite  in  sign. 
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e  /  A  graph  showing  the  time  difference  between  two  atomic  clocks. 
One  clock  was  kept  at  Mitaka  Observatory,  at  58  m  above  sea  level. 
The  other  was  moved  back  and  forth  to  a  second  observatory,  Norikura 
Corona  Station,  at  the  peak  of  the  Norikura  volcano,  2876  m  above  sea 
level.  The  plateaus  on  the  graph  are  data  from  the  periods  when  the 
clocks  were  compared  side  by  side  at  Mitaka.  The  difference  between 
one  plateau  and  the  next  shows  a  gravitational  effect  on  the  rate  of  flow 
of  time,  accumulated  during  the  period  when  the  mobile  clock  was  at  the 
top  of  Norikura.  Cf.  problem  25,  p.  448. 
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In  fact,  the  two  effects  are  unequal  in  size:  —59  ns  and  273  ns. 
This  implies  that  there  is  a  second  effect  involved,  simply  due  to 
the  planes’  being  up  in  the  air.  This  was  verified  more  directly 
in  a  1978  experiment  by  Iijima  and  Fujiwara,  figure  e,  in  which 
identical  atomic  clocks  were  kept  at  rest  at  the  top  and  bottom  of  a 
mountain  near  Tokyo.  This  experiment,  unlike  the  Hafele-Keating 
one,  isolates  one  effect  on  time,  the  gravitational  one:  time’s  rate 
of  flow  increases  with  height  in  a  gravitational  field.  Einstein  didn’t 
figure  out  how  to  incorporate  gravity  into  relativity  until  1915,  after 
much  frustration  and  many  false  starts.  The  simpler  version  of  the 
theory  without  gravity  is  known  as  special  relativity,  the  full  version 
as  general  relativity.  We’ll  restrict  ourselves  to  special  relativity 
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f  /  The  correspondence  prin¬ 
ciple  requires  that  the  relativistic 
distortion  of  time  become  small 
for  small  velocities. 


until  section  7.4,  and  that  means  that  what  we  want  to  focus  on 
right  now  is  the  distortion  of  time  due  to  motion,  not  gravity. 

We  can  now  see  in  more  detail  how  to  apply  the  correspondence 
principle.  The  behavior  of  the  three  clocks  in  the  Hafele-Keating 
experiment  shows  that  the  amount  of  time  distortion  increases  as 
the  speed  of  the  clock’s  motion  increases.  Newton  lived  in  an  era 
when  the  fastest  mode  of  transportation  was  a  galloping  horse,  and 
the  best  pendulum  clocks  would  accumulate  errors  of  perhaps  a 
minute  over  the  course  of  several  days.  A  horse  is  much  slower 
than  a  jet  plane,  so  the  distortion  of  time  would  have  had  a  relative 
size  of  only  ~  10“ 15  —  much  smaller  than  the  clocks  were  capable 
of  detecting.  At  the  speed  of  a  passenger  jet,  the  effect  is  about 
1CH12,  and  state-of-the-art  atomic  clocks  in  1971  were  capable  of 
measuring  that.  A  GPS  satellite  travels  much  faster  than  a  jet  air¬ 
plane,  and  the  effect  on  the  satellite  turns  out  to  be  10-10.  The 
general  idea  here  is  that  all  physical  laws  are  approximations,  and 
approximations  aren’t  simply  right  or  wrong  in  different  situations. 
Approximations  are  better  or  worse  in  different  situations,  and  the 
question  is  whether  a  particular  approximation  is  good  enough  in  a 
given  situation  to  serve  a  particular  purpose.  The  faster  the  motion, 
the  worse  the  Newtonian  approximation  of  absolute  time.  Whether 
the  approximation  is  good  enough  depends  on  what  you’re  trying 
to  accomplish.  The  correspondence  principle  says  that  the  approxi¬ 
mation  must  have  been  good  enough  to  explain  all  the  experiments 
done  in  the  centuries  before  Einstein  came  up  with  relativity. 

By  the  way,  don’t  get  an  inflated  idea  of  the  importance  of  the 
Hafele-Keating  experiment.  Special  relativity  had  already  been  con¬ 
firmed  by  a  vast  and  varied  body  of  experiments  decades  before  1971. 
The  only  reason  I’m  giving  such  a  prominent  role  to  this  experiment, 
which  was  actually  more  important  as  a  test  of  general  relativity,  is 
that  it  is  conceptually  very  direct. 


7.2  Distortion  of  Space  and  Time 

7.2.1  The  Lorentz  transformation 

Relativity  says  that  when  two  observers  are  in  different  frames  of 
reference,  each  observer  considers  the  other  one’s  perception  of  time 
to  be  distorted.  We’ll  also  see  that  something  similar  happens  to 
their  observations  of  distances,  so  both  space  and  time  are  distorted. 
What  exactly  is  this  distortion?  How  do  we  even  conceptualize  it? 

The  idea  isn’t  really  as  radical  as  it  might  seem  at  first.  We 
can  visualize  the  structure  of  space  and  time  using  a  graph  with 
position  and  time  on  its  axes.  These  graphs  are  familiar  by  now, 
but  we’re  going  to  look  at  them  in  a  slightly  different  way.  Before,  we 
used  them  to  describe  the  motion  of  objects.  The  grid  underlying 
the  graph  was  merely  the  stage  on  which  the  actors  played  their 
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parts.  Now  the  background  comes  to  the  foreground:  it’s  time  and 
space  themselves  that  we’re  studying.  We  don’t  necessarily  need 
to  have  a  line  or  a  curve  drawn  on  top  of  the  grid  to  represent  a 
particular  object.  We  may,  for  example,  just  want  to  talk  about 
events,  depicted  as  points  on  the  graph  as  in  figure  a.  A  distortion 
of  the  Cartesian  grid  underlying  the  graph  can  arise  for  perfectly 
ordinary  reasons  that  Isaac  Newton  would  have  readily  accepted. 
For  example,  we  can  simply  change  the  units  used  to  measure  time 
and  position,  as  in  figure  b. 

We’re  going  to  have  quite  a  few  examples  of  this  type,  so  I’ll 
adopt  the  convention  shown  in  figure  c  for  depicting  them.  Figure 
c  summarizes  the  relationship  between  figures  a  and  b  in  a  more 
compact  form.  The  gray  rectangle  represents  the  original  coordinate 
grid  of  figure  a,  while  the  grid  of  black  lines  represents  the  new 
version  from  figure  b.  Omitting  the  grid  from  the  gray  rectangle 
makes  the  diagram  easier  to  decode  visually. 

Our  goal  of  unraveling  the  mysteries  of  special  relativity  amounts 
to  nothing  more  than  finding  out  how  to  draw  a  diagram  like  c 
in  the  case  where  the  two  different  sets  of  coordinates  represent 
measurements  of  time  and  space  made  by  two  different  observers, 
each  in  motion  relative  to  the  other.  Galileo  and  Newton  thought 
they  knew  the  answer  to  this  question,  but  their  answer  turned 
out  to  be  only  approximately  right.  To  avoid  repeating  the  same 
mistakes,  we  need  to  clearly  spell  out  what  we  think  are  the  basic 
properties  of  time  and  space  that  will  be  a  reliable  foundation  for 
our  reasoning.  I  want  to  emphasize  that  there  is  no  purely  logical 
way  of  deciding  on  this  list  of  properties.  The  ones  I’ll  list  are  simply 
a  summary  of  the  patterns  observed  in  the  results  from  a  large  body 
of  experiments.  Furthermore,  some  of  them  are  only  approximate. 
For  example,  property  1  below  is  only  a  good  approximation  when 
the  gravitational  field  is  weak,  so  it  is  a  property  that  applies  to 
special  relativity,  not  to  general  relativity. 

Experiments  show  that: 

1.  No  point  in  time  or  space  has  properties  that  make  it  different 
from  any  other  point. 

2.  Likewise,  all  directions  in  space  have  the  same  properties. 

3.  Motion  is  relative,  i.e.,  all  inertial  frames  of  reference  are 
equally  valid. 

4.  Causality  holds,  in  the  sense  described  on  page  385. 

5.  Time  depends  on  the  state  of  motion  of  the  observer. 

Most  of  these  are  not  very  subversive.  Properties  1  and  2  date 
back  to  the  time  when  Galileo  and  Newton  started  applying  the 


a /Two  events  are  given  as 
points  on  a  graph  of  position 
versus  time.  Joan  of  Arc  helps  to 
restore  Charles  VII  to  the  throne. 
At  a  later  time  and  a  different 
position,  Joan  of  Arc  is  sentenced 
to  death. 


b/A  change  of  units  distorts 
an  x-t  graph.  This  graph  depicts 
exactly  the  same  events  as  figure 
a.  The  only  change  is  that  the  x 
and  t  coordinates  are  measured 
using  different  units,  so  the  grid  is 
compressed  in  t  and  expanded  in 
x. 


c  /  A  convention  we’ll  use  to 
represent  a  distortion  of  time  and 
space. 
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d/A  Galilean  version  of  the 
relationship  between  two  frames 
of  reference.  As  in  all  such 
graphs  in  this  chapter,  the  original 
coordinates,  represented  by  the 
gray  rectangle,  have  a  time  axis 
that  goes  to  the  right,  and  a 
position  axis  that  goes  straight 
up. 


same  universal  laws  of  motion  to  the  solar  system  and  to  the  earth; 
this  contradicted  Aristotle,  who  believed  that,  for  example,  a  rock 
would  naturally  want  to  move  in  a  certain  special  direction  (down) 
in  order  to  reach  a  certain  special  location  (the  earth’s  surface). 
Property  3  is  the  reason  that  Einstein  called  his  theory  “relativity,” 
but  Galileo  and  Newton  believed  exactly  the  same  thing  to  be  true, 
as  dramatized  by  Galileo’s  run-in  with  the  Church  over  the  question 
of  whether  the  earth  could  really  be  in  motion  around  the  sun. 
Property  4  would  probably  surprise  most  people  only  because  it 
asserts  in  such  a  weak  and  specialized  way  something  that  they  feel 
deeply  must  be  true.  The  only  really  strange  item  on  the  list  is  5, 
but  the  Hafele-Keating  experiment  forces  it  upon  us. 

If  it  were  not  for  property  5,  we  could  imagine  that  figure  d 
would  give  the  correct  transformation  between  frames  of  reference 
in  motion  relative  to  one  another.  Let’s  say  that  observer  1,  whose 
grid  coincides  with  the  gray  rectangle,  is  a  hitch-hiker  standing  by 
the  side  of  a  road.  Event  A  is  a  raindrop  hitting  his  head,  and 
event  B  is  another  raindrop  hitting  his  head.  He  says  that  A  and  B 
occur  at  the  same  location  in  space.  Observer  2  is  a  motorist  who 
drives  by  without  stopping;  to  him,  the  passenger  compartment  of 
his  car  is  at  rest,  while  the  asphalt  slides  by  underneath.  He  says 
that  A  and  B  occur  at  different  points  in  space,  because  during  the 
time  between  the  first  raindrop  and  the  second,  the  hitch-hiker  has 
moved  backward.  On  the  other  hand,  observer  2  says  that  events  A 
and  C  occur  in  the  same  place,  while  the  hitch-hiker  disagrees.  The 
slope  of  the  grid-lines  is  simply  the  velocity  of  the  relative  motion 
of  each  observer  relative  to  the  other. 

Figure  d  has  familiar,  comforting,  and  eminently  sensible  behav¬ 
ior,  but  it  also  happens  to  be  wrong,  because  it  violates  property 
5.  The  distortion  of  the  coordinate  grid  has  only  moved  the  vertical 
lines  up  and  down,  so  both  observers  agree  that  events  like  B  and 
C  are  simultaneous.  If  this  was  really  the  way  things  worked,  then 
all  observers  could  synchronize  all  their  clocks  with  one  another  for 
once  and  for  all,  and  the  clocks  would  never  get  out  of  sync.  This 
contradicts  the  results  of  the  Hafele-Keating  experiment,  in  which 
all  three  clocks  were  initially  synchronized  in  Washington,  but  later 
went  out  of  sync  because  of  their  different  states  of  motion. 

It  might  seem  as  though  we  still  had  a  huge  amount  of  wiggle 
room  available  for  the  correct  form  of  the  distortion.  It  turns  out, 
however,  that  properties  1-5  are  sufficient  to  prove  that  there  is  only 
one  answer,  which  is  the  one  found  by  Einstein  in  1905.  To  see  why 
this  is,  let’s  work  by  a  process  of  elimination. 

Figure  e  shows  a  transformation  that  might  seem  at  first  glance 
to  be  as  good  a  candidate  as  any  other,  but  it  violates  property 
3,  that  motion  is  relative,  for  the  following  reason.  In  observer  2’s 
frame  of  reference,  some  of  the  grid  lines  cross  one  another.  This 
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means  that  observers  1  and  2  disagree  on  whether  or  not  certain 
events  are  the  same.  For  instance,  suppose  that  event  A  marks  the 
arrival  of  an  arrow  at  the  bull’s-eye  of  a  target,  and  event  B  is  the 
location  and  time  when  the  bull’s-eye  is  punctured.  Events  A  and  B 
occur  at  the  same  location  and  at  the  same  time.  If  one  observer  says 
that  A  and  B  coincide,  but  another  says  that  they  don’t,  we  have 
a  direct  contradiction.  Since  the  two  frames  of  reference  in  figure 
e  give  contradictory  results,  one  of  them  is  right  and  one  is  wrong. 
This  violates  property  3,  because  all  inertial  frames  of  reference  are 
supposed  to  be  equally  valid.  To  avoid  problems  like  this,  we  clearly 
need  to  make  sure  that  none  of  the  grid  lines  ever  cross  one  another. 

The  next  type  of  transformation  we  want  to  kill  off  is  shown  in 
figure  f,  in  which  the  grid  lines  curve,  but  never  cross  one  another. 
The  trouble  with  this  one  is  that  it  violates  property  1,  the  unifor¬ 
mity  of  time  and  space.  The  transformation  is  unusually  “twisty” 
at  A,  whereas  at  B  it’s  much  more  smooth.  This  can’t  be  correct, 
because  the  transformation  is  only  supposed  to  depend  on  the  rela¬ 
tive  state  of  motion  of  the  two  frames  of  reference,  and  that  given 
information  doesn’t  single  out  a  special  role  for  any  particular  point 
in  spacetime.  If,  for  example,  we  had  one  frame  of  reference  rotating 
relative  to  the  other,  then  there  would  be  something  special  about 
the  axis  of  rotation.  But  we’re  only  talking  about  inertial  frames  of 
reference  here,  as  specified  in  property  3,  so  we  can’t  have  rotation; 
each  frame  of  reference  has  to  be  moving  in  a  straight  line  at  con¬ 
stant  speed.  For  frames  related  in  this  way,  there  is  nothing  that 
could  single  out  an  event  like  A  for  special  treatment  compared  to 
B,  so  transformation  f  violates  property  1. 

The  examples  in  figures  e  and  f  show  that  the  transformation 
we’re  looking  for  must  be  linear,  meaning  that  it  must  transform 
lines  into  lines,  and  furthermore  that  it  has  to  take  parallel  lines  to 
parallel  lines.  Einstein  wrote  in  his  1905  paper  that  “. . .  on  account 
of  the  property  of  homogeneity  [property  1]  which  we  ascribe  to  time 
and  space,  the  [transformation]  must  be  linear.”1  Applying  this  to 
our  diagrams,  the  original  gray  rectangle,  which  is  a  special  type 
of  parallelogram  containing  right  angles,  must  be  transformed  into 
another  parallelogram.  There  are  three  types  of  transformations, 
figure  g,  that  have  this  property.  Case  I  is  the  Galilean  transforma¬ 
tion  of  figure  d  on  page  390,  which  we’ve  already  ruled  out. 


Case  II  can  also  be  discarded.  Here  every  point  on  the  grid  ro¬ 
tates  counterclockwise.  What  physical  parameter  would  determine 
the  amount  of  rotation?  The  only  thing  that  could  be  relevant  would 
be  v,  the  relative  velocity  of  the  motion  of  the  two  frames  of  reference 
with  respect  to  one  another.  But  if  the  angle  of  rotation  was  pro- 

XA.  Einstein,  “On  the  Electrodynamics  of  Moving  Bodies,”  Annalen  der 
Physik  17  (1905),  p.  891,  tr.  Saha  and  Bose. 


e  /  A  transformation  that  leads 
to  disagreements  about  whether 
two  events  occur  at  the  same 
time  and  place.  This  is  not  just 
a  matter  of  opinion.  Either  the 
arrow  hit  the  bull’s-eye  or  it  didn’t. 


f  /  A  nonlinear  transformation. 


Section  7.2  Distortion  of  Space  and  Time 


391 


t 


h  /  In  the  units  that  are  most 
convenient  for  relativity,  the  trans¬ 
formation  has  symmetry  about  a 
45-degree  diagonal  line. 


t 


i  /  Interpretation  of  the  Lorentz 
transformation.  The  slope  in¬ 
dicated  in  the  figure  gives  the 
relative  velocity  of  the  two  frames 
of  reference.  Events  A  and  B  that 
were  simultaneous  in  frame  1 
are  not  simultaneous  in  frame  2, 
where  event  A  occurs  to  the  right 
of  the  f  =  0  line  represented  by 
the  left  edge  of  the  grid,  but  event 
B  occurs  to  its  left. 


g/ Three  types  of  transformations  that  preserve  parallelism.  Their 
distinguishing  feature  is  what  they  do  to  simultaneity,  as  shown  by  what 
happens  to  the  left  edge  of  the  original  rectangle.  In  I,  the  left  edge 
remains  vertical,  so  simultaneous  events  remain  simultaneous.  In  II,  the 
left  edge  turns  counterclockwise.  In  III,  it  turns  clockwise. 


portional  to  v,  then  for  large  enough  velocities  the  grid  would  have 
left  and  right  reversed,  and  this  would  violate  property  4,  causality: 
one  observer  would  say  that  event  A  caused  a  later  event  B,  but 
another  observer  would  say  that  B  came  first  and  caused  A. 

The  only  remaining  possibility  is  case  III,  which  I’ve  redrawn 
in  figure  h  with  a  couple  of  changes.  This  is  the  one  that  Einstein 
predicted  in  1905.  The  transformation  is  known  as  the  Lorentz 
transformation,  after  Hendrik  Lorentz  (1853-1928),  who  partially 
anticipated  Einstein’s  work,  without  arriving  at  the  correct  inter¬ 
pretation.  The  distortion  is  a  kind  of  smooshing  and  stretching, 
as  suggested  by  the  hands.  Also,  we’ve  already  seen  in  figures  a-c 
on  page  389  that  we’re  free  to  stretch  or  compress  everything  as 
much  as  we  like  in  the  horizontal  and  vertical  directions,  because 
this  simply  corresponds  to  choosing  different  units  of  measurement 
for  time  and  distance.  In  figure  h  I’ve  chosen  units  that  give  the 
whole  drawing  a  convenient  symmetry  about  a  45-degree  diagonal 
line.  Ordinarily  it  wouldn’t  make  sense  to  talk  about  a  45-degree 
angle  on  a  graph  whose  axes  had  different  units.  But  in  relativity, 
the  symmetric  appearance  of  the  transformation  tells  us  that  space 
and  time  ought  to  be  treated  on  the  same  footing,  and  measured  in 
the  same  units. 

As  in  our  discussion  of  the  Galilean  transformation,  slopes  are 
interpreted  as  velocities,  and  the  slope  of  the  near- horizontal  lines 
in  figure  i  is  interpreted  as  the  relative  velocity  of  the  two  observers. 
The  difference  between  the  Galilean  version  and  the  relativistic  one 
is  that  now  there  is  smooshing  happening  from  the  other  side  as 
well.  Lines  that  were  vertical  in  the  original  grid,  representing  si¬ 
multaneous  events,  now  slant  over  to  the  right.  This  tells  us  that,  as 
required  by  property  5,  different  observers  do  not  agree  on  whether 
events  that  occur  in  different  places  are  simultaneous.  The  Hafele- 
Keating  experiment  tells  us  that  this  non-simultaneity  effect  is  fairly 
small,  even  when  the  velocity  is  as  big  as  that  of  a  passenger  jet, 
and  this  is  what  we  would  have  anticipated  by  the  correspondence 
principle.  The  way  that  this  is  expressed  in  the  graph  is  that  if  we 
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pick  the  time  unit  to  be  the  second,  then  the  distance  unit  turns  out 
to  be  hundreds  of  thousands  of  miles.  In  these  units,  the  velocity 
of  a  passenger  jet  is  an  extremely  small  number,  so  the  slope  v  in 
figure  i  is  extremely  small,  and  the  amount  of  distortion  is  tiny  — 
it  would  be  much  too  small  to  see  on  this  scale. 

The  only  thing  left  to  determine  about  the  Lorentz  transforma¬ 
tion  is  the  size  of  the  transformed  parallelogram  relative  to  the  size 
of  the  original  one.  Although  the  drawing  of  the  hands  in  figure  h 
may  suggest  that  the  grid  deforms  like  a  framework  made  of  rigid 
coat-hanger  wire,  that  is  not  the  case.  If  you  look  carefully  at  the 
figure,  you’ll  see  that  the  edges  of  the  smooshed  parallelogram  are 
actually  a  little  longer  than  the  edges  of  the  original  rectangle.  In 
fact  what  stays  the  same  is  not  lengths  but  areas ,  as  proved  in  the 
caption  to  figure  j . 


j  /  Proof  that  Lorentz  transformations  don’t  change  area:  We  first  subject  a  square  to  a  transformation 
with  velocity  v,  and  this  increases  its  area  by  a  factor  R(v),  which  we  want  to  prove  equals  1.  We  chop  the 
resulting  parallelogram  up  into  little  squares  and  finally  apply  a  —v  transformation;  this  changes  each  little 
square’s  area  by  a  factor  R(—v),  so  the  whole  figure’s  area  is  also  scaled  by  R{-v).  The  final  result  is  to  restore 
the  square  to  its  original  shape  and  area,  so  R(v)R(—v)  =  1.  But  R(v)  =  R(—v)  by  property  2  of  spacetime  on 
page  389,  which  states  that  all  directions  in  space  have  the  same  properties,  so  R(v)  =  1 . 


7.2.2  The  7  factor 

With  a  little  algebra  and  geometry  (homework  problem  7,  page 
444),  one  can  use  the  equal-area  property  to  show  that  the  factor  7 
(Greek  letter  gamma)  defined  in  figure  k  is  given  by  the  equation 

7  =  x/T^' 

If  you’ve  had  good  training  in  physics,  the  first  thing  you  probably 
think  when  you  look  at  this  equation  is  that  it  must  be  nonsense, 
because  its  units  don’t  make  sense.  How  can  we  take  something 
with  units  of  velocity  squared,  and  subtract  it  from  a  unitless  1? 
But  remember  that  this  is  expressed  in  our  special  relativistic  units, 
in  which  the  same  units  are  used  for  distance  and  time.  In  this 
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k  /  The  y  factor. 


I /The  ruler  is  moving  in  frame 
1,  represented  by  a  square,  but 
at  rest  in  frame  2,  shown  as  a 
parallelogram.  Each  picture  of 
the  ruler  is  a  snapshot  taken 
at  a  certain  moment  as  judged 
according  to  frame  2’s  notion 
of  simultaneity.  An  observer  in 
frame  1  judges  the  ruler’s  length 
instead  according  to  frame  1’s 
definition  of  simultaneity,  i.e., 
using  points  that  are  lined  up 
vertically  on  the  graph.  The  ruler 
appears  shorter  in  the  frame  in 
which  it  is  moving.  As  proved 
in  figure  m,  the  length  contracts 
from  L  to  L/y. 


system,  velocities  are  always  unitless.  This  sort  of  thing  happens 
frequently  in  physics.  For  instance,  before  James  Joule  discovered 
conservation  of  energy,  nobody  knew  that  heat  and  mechanical  en¬ 
ergy  were  different  forms  of  the  same  thing,  so  instead  of  measuring 
them  both  in  units  of  joules  as  we  would  do  now,  they  measured 
heat  in  one  unit  (such  as  calories)  and  mechanical  energy  in  another 
(such  as  foot-pounds).  In  ordinary  metric  units,  we  just  need  an 
extra  conversion  factor  c,  and  the  equation  becomes 


1 


Here’s  why  we  care  about  7.  Figure  k  defines  it  as  the  ratio  of  two 
times:  the  time  between  two  events  as  expressed  in  one  coordinate 
system,  and  the  time  between  the  same  two  events  as  measured  in 
the  other  one.  The  interpretation  is: 

Time  dilation 

A  clock  runs  fastest  in  the  frame  of  reference  of  an  observer 
who  is  at  rest  relative  to  the  clock.  An  observer  in  motion 
relative  to  the  clock  at  speed  v  perceives  the  clock  as  running 
more  slowly  by  a  factor  of  7. 


x  slice 


sliced 


m/This  figure  proves,  as  claimed  in  figure  I,  that  the  length  con¬ 
traction  is  x  =  1  /y.  First  we  slice  the  parallelogram  vertically  like  a  salami 
and  slide  the  slices  down,  making  the  top  and  bottom  edges  horizontal. 
Then  we  do  the  same  in  the  horizontal  direction,  forming  a  rectangle  with 
sides  y  and  x.  Since  both  the  Lorentz  transformation  and  the  slicing 
processes  leave  areas  unchanged,  the  area  yx  of  the  rectangle  must 
equal  the  area  of  the  original  square,  which  is  1 . 


As  proved  in  figures  1  and  m,  lengths  are  also  distorted: 

Length  contraction 

A  meter-stick  appears  longest  to  an  observer  who  is  at  rest 
relative  to  it.  An  observer  moving  relative  to  the  meter-stick 
at  v  observes  the  stick  to  be  shortened  by  a  factor  of  7. 

self-check  A 

What  is  y  when  v  =  0?  What  does  this  mean?  >  Answer,  p.  964 
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An  interstellar  road  trip  example  1 

Alice  stays  on  earth  while  her  twin  Betty  heads  off  in  a  spaceship 
for  Tau  Ceti,  a  nearby  star.  Tau  Ceti  is  12  light-years  away,  so 
even  though  Betty  travels  at  87%  of  the  speed  of  light,  it  will  take 
her  a  long  time  to  get  there:  14  years,  according  to  Alice. 


Betty  experiences  time  dilation.  At  this  speed,  her  y  is  2.0,  so  that 
the  voyage  will  only  seem  to  her  to  last  7  years.  But  there  is  per¬ 
fect  symmetry  between  Alice’s  and  Betty’s  frames  of  reference,  so 
Betty  agrees  with  Alice  on  their  relative  speed;  Betty  sees  herself 
as  being  at  rest,  while  the  sun  and  Tau  Ceti  both  move  backward 
at  87%  of  the  speed  of  light.  How,  then,  can  she  observe  Tau  Ceti 
to  get  to  her  in  only  7  years,  when  it  should  take  1 4  years  to  travel 
12  light-years  at  this  speed? 

We  need  to  take  into  account  length  contraction.  Betty  sees  the 
distance  between  the  sun  and  Tau  Ceti  to  be  shrunk  by  a  factor  of 
2.  The  same  thing  occurs  for  Alice,  who  observes  Betty  and  her 
spaceship  to  be  foreshortened. 

Large  time  dilation  example  2 

The  time  dilation  effect  in  the  Hafele-Keating  experiment  was  very 
small.  If  we  want  to  see  a  large  time  dilation  effect,  we  can’t  do 
it  with  something  the  size  of  the  atomic  clocks  they  used;  the  ki¬ 
netic  energy  would  be  greater  than  the  total  megatonnage  of  all 
the  world’s  nuclear  arsenals.  We  can,  however,  accelerate  sub¬ 
atomic  particles  to  speeds  at  which  y  is  large.  For  experimental 
particle  physicists,  relativity  is  something  you  do  all  day  before 
heading  home  and  stopping  off  at  the  store  for  milk.  An  early,  low- 
precision  experiment  of  this  kind  was  performed  by  Rossi  and  Hall 
in  1941,  using  naturally  occurring  cosmic  rays.  Figure  p  shows  a 
1974  experiment2  of  a  similar  type  which  verified  the  time  dilation 
predicted  by  relativity  to  a  precision  of  about  one  part  per  thou¬ 
sand. 


Particles  called  muons  (named  after  the  Greek  letter  p,  “myoo”) 
were  produced  by  an  accelerator  at  CERN,  near  Geneva.  A  muon 

2Bailey  at  al.,  Nucl.  Pliys.  B150(1979)  1 
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o  /  Muons  accelerated  to  nearly  c 
undergo  radioactive  decay  much 
more  slowly  than  they  would 
according  to  an  observer  at  rest 
with  respect  to  the  muons.  The 
first  two  data-points  (unfilled 
circles)  were  subject  to  large 
systematic  errors. 


Section  7.2  Distortion  of  Space  and  Time 


395 


p  /  Apparatus  used  for  the  test 
of  relativistic  time  dilation  de¬ 
scribed  in  example  2.  The  promi¬ 
nent  black  and  white  blocks  are 
large  magnets  surrounding  a  cir¬ 
cular  pipe  with  a  vacuum  inside, 
(c)  1974  by  CERN. 


is  essentially  a  heavier  version  of  the  electron.  Muons  undergo 
radioactive  decay,  lasting  an  average  of  only  2.197  ^s  before  they 
evaporate  into  an  electron  and  two  neutrinos.  The  1974  experi¬ 
ment  was  actually  built  in  order  to  measure  the  magnetic  proper¬ 
ties  of  muons,  but  it  produced  a  high-precision  test  of  time  dilation 
as  a  byproduct.  Because  muons  have  the  same  electric  charge 
as  electrons,  they  can  be  trapped  using  magnetic  fields.  Muons 
were  injected  into  the  ring  shown  in  figure  p,  circling  around  it  un¬ 
til  they  underwent  radioactive  decay.  At  the  speed  at  which  these 
muons  were  traveling,  they  had  y  =  29.33,  so  on  the  average  they 
lasted  29.33  times  longer  than  the  normal  lifetime.  In  other  words, 
they  were  like  tiny  alarm  clocks  that  self-destructed  at  a  randomly 
selected  time.  Figure  o  shows  the  number  of  radioactive  decays 
counted,  as  a  function  of  the  time  elapsed  after  a  given  stream  of 
muons  was  injected  into  the  storage  ring.  The  two  dashed  lines 
show  the  rates  of  decay  predicted  with  and  without  relativity.  The 
relativistic  line  is  the  one  that  agrees  with  experiment. 


q  /  Colliding  nuclei  show  relativis¬ 
tic  length  contraction. 
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bus  garage 


An  example  of  length  contraction  example  3 

Figure  q  shows  an  artist’s  rendering  of  the  length  contraction  for 
the  collision  of  two  gold  nuclei  at  relativistic  speeds  in  the  RHIC 
accelerator  in  Long  Island,  New  York,  which  went  on  line  in  2000. 
The  gold  nuclei  would  appear  nearly  spherical  (or  just  slightly 
lengthened  like  an  American  football)  in  frames  moving  along  with 
them,  but  in  the  laboratory’s  frame,  they  both  appear  drastically 
foreshortened  as  they  approach  the  point  of  collision.  The  later 
pictures  show  the  nuclei  merging  to  form  a  hot  soup,  in  which 
experimenters  hope  to  observe  a  new  form  of  matter. 


r/ Example  4:  In  the  garage’s  frame  of  reference,  the  bus  is  moving,  and  can  fit  in  the  garage  due  to 
its  length  contraction.  In  the  bus’s  frame  of  reference,  the  garage  is  moving,  and  can’t  hold  the  bus  due  to  its 
length  contraction. 

The  garage  paradox  example  4 

One  of  the  most  famous  of  all  the  so-called  relativity  paradoxes 
has  to  do  with  our  incorrect  feeling  that  simultaneity  is  well  de¬ 
fined.  The  idea  is  that  one  could  take  a  schoolbus  and  drive  it  at 
relativistic  speeds  into  a  garage  of  ordinary  size,  in  which  it  nor¬ 
mally  would  not  fit.  Because  of  the  length  contraction,  the  bus 
would  supposedly  fit  in  the  garage.  The  driver,  however,  will  per¬ 
ceive  the  garage  as  being  contracted  and  thus  even  less  able  to 
contain  the  bus. 

The  paradox  is  resolved  when  we  recognize  that  the  concept  of 
fitting  the  bus  in  the  garage  “all  at  once”  contains  a  hidden  as¬ 
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s/A  proof  that  causality  im¬ 
poses  a  universal  speed  limit.  In 
the  original  frame  of  reference, 
represented  by  the  square,  event 
A  happens  a  little  before  event  B. 
In  the  new  frame,  shown  by  the 
parallelogram,  A  happens  after 
t  =  0,  but  B  happens  before  t  =  0; 
that  is,  B  happens  before  A.  The 
time  ordering  of  the  two  events 
has  been  reversed.  This  can  only 
happen  because  events  A  and  B 
are  very  close  together  in  time 
and  fairly  far  apart  in  space.  The 
line  segment  connecting  A  and 
B  has  a  slope  greater  than  1, 
meaning  that  if  we  wanted  to  be 
present  at  both  events,  we  would 
have  to  travel  at  a  speed  greater 
than  c  (which  equals  1  in  the 
units  used  on  this  graph).  You  will 
find  that  if  you  pick  any  two  points 
for  which  the  slope  of  the  line 
segment  connecting  them  is  less 
than  1 ,  you  can  never  get  them  to 
straddle  the  new  t  =  0  line  in  this 
funny,  time-reversed  way.  Since 
different  observers  disagree  on 
the  time  order  of  events  like  A 
and  B,  causality  requires  that 
information  never  travel  from 
A  to  B  or  from  B  to  A;  if  it  did, 
then  we  would  have  time-travel 
paradoxes.  The  conclusion  is  that 
c  is  the  maximum  speed  of  cause 
and  effect  in  relativity. 


sumption,  the  assumption  that  it  makes  sense  to  ask  whether  the 
front  and  back  of  the  bus  can  simultaneously  be  in  the  garage. 
Observers  in  different  frames  of  reference  moving  at  high  rela¬ 
tive  speeds  do  not  necessarily  agree  on  whether  things  happen 
simultaneously.  As  shown  in  figure  r,  the  person  in  the  garage’s 
frame  can  shut  the  door  at  an  instant  B  he  perceives  to  be  si¬ 
multaneous  with  the  front  bumper’s  arrival  A  at  the  back  wall  of 
the  garage,  but  the  driver  would  not  agree  about  the  simultaneity 
of  these  two  events,  and  would  perceive  the  door  as  having  shut 
long  after  she  plowed  through  the  back  wall. 

7.2.3  The  universal  speed  c 

Let’s  think  a  little  more  about  the  role  of  the  45-degree  diagonal 
in  the  Lorentz  transformation.  Slopes  on  these  graphs  are  inter¬ 
preted  as  velocities.  This  line  has  a  slope  of  1  in  relativistic  units, 
but  that  slope  corresponds  to  c  in  ordinary  metric  units.  We  al¬ 
ready  know  that  the  relativistic  distance  unit  must  be  extremely 
large  compared  to  the  relativistic  time  unit,  so  c  must  be  extremely 
large.  Now  note  what  happens  when  we  perform  a  Lorentz  transfor¬ 
mation:  this  particular  line  gets  stretched,  but  the  new  version  of 
the  line  lies  right  on  top  of  the  old  one,  and  its  slope  stays  the  same. 
In  other  words,  if  one  observer  says  that  something  has  a  velocity 
equal  to  c,  every  other  observer  will  agree  on  that  velocity  as  well. 
(The  same  thing  happens  with  — c.) 

Velocities  don’t  simply  add  and  subtract. 

This  is  counterintuitive,  since  we  expect  velocities  to  add  and 
subtract  in  relative  motion.  If  a  dog  is  running  away  from  me  at  5 
m/s  relative  to  the  sidewalk,  and  I  run  after  it  at  3  m/s,  the  dog’s 
velocity  in  my  frame  of  reference  is  2  m/s.  According  to  everything 
we  have  learned  about  motion,  the  dog  must  have  different  speeds 
in  the  two  frames:  5  m/s  in  the  sidewalk’s  frame  and  2  m/s  in 
mine.  But  velocities  are  measured  by  dividing  a  distance  by  a  time, 
and  both  distance  and  time  are  distorted  by  relativistic  effects,  so 
we  actually  shouldn’t  expect  the  ordinary  arithmetic  addition  of 
velocities  to  hold  in  relativity;  it’s  an  approximation  that’s  valid  at 
velocities  that  are  small  compared  to  c. 

A  universal  speed  limit 

For  example,  suppose  Janet  takes  a  trip  in  a  spaceship,  and 
accelerates  until  she  is  moving  at  0.6c  relative  to  the  earth.  She 
then  launches  a  space  probe  in  the  forward  direction  at  a  speed 
relative  to  her  ship  of  0.6c.  We  might  think  that  the  probe  was  then 
moving  at  a  velocity  of  1.2c,  but  in  fact  the  answer  is  still  less  than 
c  (problem  1,  page  443).  This  is  an  example  of  a  more  general  fact 
about  relativity,  which  is  that  c  represents  a  universal  speed  limit. 
This  is  required  by  causality,  as  shown  in  figure  s. 
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t  /  The  Michelson-Morley  experiment,  shown  in  photographs,  and 
drawings  from  the  original  1887  paper.  1.  A  simplified  drawing  of  the 
apparatus.  A  beam  of  light  from  the  source,  s,  is  partially  reflected  and 
partially  transmitted  by  the  half-silvered  mirror  hq .  The  two  half-intensity 
parts  of  the  beam  are  reflected  by  the  mirrors  at  a  and  b,  reunited,  and 
observed  in  the  telescope,  t.  If  the  earth’s  surface  was  supposed  to  be 
moving  through  the  ether,  then  the  times  taken  by  the  two  light  waves  to 
pass  through  the  moving  ether  would  be  unequal,  and  the  resulting  time 
lag  would  be  detectable  by  observing  the  interference  between  the  waves 
when  they  were  reunited.  2.  In  the  real  apparatus,  the  light  beams  were 
reflected  multiple  times.  The  effective  length  of  each  arm  was  increased 
to  11  meters,  which  greatly  improved  its  sensitivity  to  the  small  expected 
difference  in  the  speed  of  light.  3.  In  an  earlier  version  of  the  experiment, 
they  had  run  into  problems  with  its  “extreme  sensitiveness  to  vibration,” 
which  was  “so  great  that  it  was  impossible  to  see  the  interference  fringes 
except  at  brief  intervals  ...even  at  two  o’clock  in  the  morning.”  They 
therefore  mounted  the  whole  thing  on  a  massive  stone  floating  in  a  pool 
of  mercury,  which  also  made  it  possible  to  rotate  it  easily.  4.  A  photo  of 
the  apparatus. 


Light  travels  at  c. 

Now  consider  a  beam  of  light.  We’re  used  to  talking  casually 
about  the  “speed  of  light,”  but  what  does  that  really  mean?  Motion 
is  relative,  so  normally  if  we  want  to  talk  about  a  velocity,  we  have 
to  specify  what  it’s  measured  relative  to.  A  sound  wave  has  a  certain 
speed  relative  to  the  air,  and  a  water  wave  has  its  own  speed  relative 
to  the  water.  If  we  want  to  measure  the  speed  of  an  ocean  wave,  for 
example,  we  should  make  sure  to  measure  it  in  a  frame  of  reference 
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u/A  ring  laser  gyroscope. 


at  rest  relative  to  the  water.  But  light  isn’t  a  vibration  of  a  physical 
medium;  it  can  propagate  through  the  near-perfect  vacuum  of  outer 
space,  as  when  rays  of  sunlight  travel  to  earth.  This  seems  like  a 
paradox:  light  is  supposed  to  have  a  specific  speed,  but  there  is  no 
way  to  decide  what  frame  of  reference  to  measure  it  in.  The  way 
out  of  the  paradox  is  that  light  must  travel  at  a  velocity  equal  to  c. 
Since  all  observers  agree  on  a  velocity  of  c,  regardless  of  their  frame 
of  reference,  everything  is  consistent. 

The  Michelson-Morley  experiment 

The  constancy  of  the  speed  of  light  had  in  fact  already  been 
observed  when  Einstein  was  an  8- year-old  boy,  but  because  nobody 
could  figure  out  how  to  interpret  it,  the  result  was  largely  ignored. 
In  1887  Michelson  and  Morley  set  up  a  clever  apparatus  to  measure 
any  difference  in  the  speed  of  light  beams  traveling  east-west  and 
north-south.  The  motion  of  the  earth  around  the  sun  at  110,000 
km/hour  (about  0.01%  of  the  speed  of  light)  is  to  our  west  during  the 
day.  Michelson  and  Morley  believed  that  light  was  a  vibration  of  a 
mysterious  medium  called  the  ether,  so  they  expected  that  the  speed 
of  light  would  be  a  fixed  value  relative  to  the  ether.  As  the  earth 
moved  through  the  ether,  they  thought  they  would  observe  an  effect 
on  the  velocity  of  light  along  an  east-west  line.  For  instance,  if  they 
released  a  beam  of  light  in  a  westward  direction  during  the  day,  they 
expected  that  it  would  move  away  from  them  at  less  than  the  normal 
speed  because  the  earth  was  chasing  it  through  the  ether.  They  were 
surprised  when  they  found  that  the  expected  0.01%  change  in  the 
speed  of  light  did  not  occur. 

The  ring  laser  gyroscope  example  5 

If  you’ve  flown  in  a  jet  plane,  you  can  thank  relativity  for  help¬ 
ing  you  to  avoid  crashing  into  a  mountain  or  an  ocean.  Figure  u 
shows  a  standard  piece  of  navigational  equipment  called  a  ring 
laser  gyroscope.  A  beam  of  light  is  split  into  two  parts,  sent 
around  the  perimeter  of  the  device,  and  reunited.  Since  the  speed 
of  light  is  constant,  we  expect  the  two  parts  to  come  back  together 
at  the  same  time.  If  they  don’t,  it’s  evidence  that  the  device  has 
been  rotating.  The  plane’s  computer  senses  this  and  notes  how 
much  rotation  has  accumulated. 

No  frequency-dependence  example  6 

Relativity  has  only  one  universal  speed,  so  it  requires  that  all  light 
waves  travel  at  the  same  speed,  regardless  of  their  frequency 
and  wavelength.  Presently  the  best  experimental  tests  of  the  in¬ 
variance  of  the  speed  of  light  with  respect  to  wavelength  come 
from  astronomical  observations  of  gamma-ray  bursts,  which  are 
sudden  outpourings  of  high-frequency  light,  believed  to  originate 
from  a  supernova  explosion  in  another  galaxy.  One  such  obser¬ 
vation,  in  2009, 3  found  that  the  times  of  arrival  of  all  the  different 
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frequencies  in  the  burst  differed  by  no  more  than  2  seconds  out 
of  a  total  time  in  flight  on  the  order  of  ten  billion  years! 


Discussion  Questions 

A  A  person  in  a  spaceship  moving  at  99.99999999%  of  the  speed 
of  light  relative  to  Earth  shines  a  flashlight  forward  through  dusty  air,  so 
the  beam  is  visible.  What  does  she  see?  What  would  it  look  like  to  an 
observer  on  Earth? 

B  A  question  that  students  often  struggle  with  is  whether  time  and 
space  can  really  be  distorted,  or  whether  it  just  seems  that  way.  Compare 
with  optical  illusions  or  magic  tricks.  How  could  you  verify,  for  instance, 
that  the  lines  in  the  figure  are  actually  parallel?  Are  relativistic  effects  the 
same,  or  not? 

C  On  a  spaceship  moving  at  relativistic  speeds,  would  a  lecture  seem 
even  longer  and  more  boring  than  normal? 

D  Mechanical  clocks  can  be  affected  by  motion.  For  example,  it  was 
a  significant  technological  achievement  to  build  a  clock  that  could  sail 
aboard  a  ship  and  still  keep  accurate  time,  allowing  longitude  to  be  deter¬ 
mined.  How  is  this  similar  to  or  different  from  relativistic  time  dilation? 

E  Figure  q  from  page  396,  depicting  the  collision  of  two  nuclei  at  the 
RHIC  accelerator,  is  reproduced  below.  What  would  the  shapes  of  the  two 
nuclei  look  like  to  a  microscopic  observer  riding  on  the  left-hand  nucleus? 
To  an  observer  riding  on  the  right-hand  one?  Can  they  agree  on  what  is 
happening?  If  not,  why  not  —  after  all,  shouldn’t  they  see  the  same  thing 
if  they  both  compare  the  two  nuclei  side-by-side  at  the  same  instant  in 
time? 


v/ Discussion  question  E:  colliding  nuclei  show  relativistic  length 
contraction. 

F  If  you  stick  a  piece  of  foam  rubber  out  the  window  of  your  car  while 
driving  down  the  freeway,  the  wind  may  compress  it  a  little.  Does  it  make 
sense  to  interpret  the  relativistic  length  contraction  as  a  type  of  strain 
that  pushes  an  object’s  atoms  together  like  this?  How  does  this  relate  to 
discussion  question  E? 


Discussion  question  B 
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G  The  machine-gunner  in  the  figure  sends  out  a  spray  of  bullets. 
Suppose  that  the  bullets  are  being  shot  into  outer  space,  and  that  the 
distances  traveled  are  trillions  of  miles  (so  that  the  human  figure  in  the 
diagram  is  not  to  scale).  After  a  long  time,  the  bullets  reach  the  points 
shown  with  dots  which  are  all  equally  far  from  the  gun.  Their  arrivals 
at  those  points  are  events  A  through  E,  which  happen  at  different  times. 
Sketch  these  events  on  a  position-time  graph.  The  chain  of  impacts  ex¬ 
tends  across  space  at  a  speed  greater  than  c.  Does  this  violate  special 
relativity? 


Discussion  question  G. 


402 


Chapter  7  Relativity 


7.2.4  No  action  at  a  distance 

The  Newtonian  picture 

The  Newtonian  picture  of  the  universe  has  particles  interacting 
with  each  other  by  exerting  forces  from  a  distance,  and  these  forces 
are  imagined  to  occur  without  any  time  delay.  For  example,  suppose 
that  super-powerful  aliens,  angered  when  they  hear  disco  music  in 
our  AM  radio  transmissions,  come  to  our  solar  system  on  a  mission 
to  cleanse  the  universe  of  our  aesthetic  contamination.  They  apply 
a  force  to  our  sun,  causing  it  to  go  flying  out  of  the  solar  system  at 
a  gazillion  miles  an  hour.  According  to  Newton’s  laws,  the  gravita¬ 
tional  force  of  the  sun  on  the  earth  will  immediately  start  dropping 
off.  This  will  be  detectable  on  earth,  and  since  sunlight  takes  eight 
minutes  to  get  from  the  sun  to  the  earth,  the  change  in  gravitational 
force  will,  according  to  Newton,  be  the  first  way  in  which  earthlings 
learn  the  bad  news  —  the  sun  will  not  visibly  start  receding  until 
a  little  later.  Although  this  scenario  is  fanciful,  it  shows  a  real  fea¬ 
ture  of  Newton’s  laws:  that  information  can  be  transmitted  from 
one  place  in  the  universe  to  another  with  zero  time  delay,  so  that 
transmission  and  reception  occur  at  exactly  the  same  instant.  New¬ 
ton  was  sharp  enough  to  realize  that  this  required  a  nontrivial  as¬ 
sumption,  which  was  that  there  was  some  completely  objective  and 
well-defined  way  of  saying  whether  two  things  happened  at  exactly 
the  same  instant.  He  stated  this  assumption  explicitly:  “Absolute, 
true,  and  mathematical  time,  of  itself,  and  from  its  own  nature  flows 
at  a  constant  rate  without  regard  to  anything  external. . .  ” 

Time  delays  in  forces  exerted  at  a  distance 

Relativity  forbids  Newton’s  instantaneous  action  at  a  distance. 
For  suppose  that  instantaneous  action  at  a  distance  existed.  It 
would  then  be  possible  to  send  signals  from  one  place  in  the  uni¬ 
verse  to  another  without  any  time  lag.  This  would  allow  perfect 
synchronization  of  all  clocks.  But  the  Hafele-Keating  experiment 
demonstrates  that  clocks  A  and  B  that  have  been  initially  syn¬ 
chronized  will  drift  out  of  sync  if  one  is  in  motion  relative  to  the 
other.  With  instantaneous  transmission  of  signals,  we  could  deter¬ 
mine,  without  having  to  wait  for  A  and  B  to  be  reunited,  which  was 
ahead  and  which  was  behind.  Since  they  don’t  need  to  be  reunited, 
neither  one  needs  to  undergo  any  acceleration;  each  clock  can  fix  an 
inertial  frame  of  reference,  with  a  velocity  vector  that  changes  nei¬ 
ther  its  direction  nor  its  magnitude.  But  this  violates  the  principle 
that  constant-velocity  motion  is  relative,  because  each  clock  can  be 
considered  to  be  at  rest,  in  its  own  frame  of  reference.  Since  no  ex¬ 
periment  has  ever  detected  any  violation  of  the  relativity  of  motion, 
we  conclude  that  instantaneous  action  at  a  distance  is  impossible. 

Since  forces  can’t  be  transmitted  instantaneously,  it  becomes 
natural  to  imagine  force-effects  spreading  outward  from  their  source 
like  ripples  on  a  pond,  and  we  then  have  no  choice  but  to  impute 
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some  physical  reality  to  these  ripples.  We  call  them  fields,  and 
they  have  their  own  independent  existence.  Gravity  is  transmitted 
through  a  field  called  the  gravitational  held.  Besides  gravity,  there 
are  other  fundamental  fields  of  force  such  as  electricity  and  mag¬ 
netism  (ch.  10-11).  Ripples  of  the  electric  and  magnetic  fields  turn 
out  to  be  light  waves.  This  tells  us  that  the  speed  at  which  electric 
and  magnetic  held  ripples  spread  must  be  c,  and  by  an  argument 
similar  to  the  one  in  subsection  7.2.3  the  same  must  hold  for  any 
other  fundamental  held,  including  the  gravitational  held. 

Fields  don’t  have  to  wiggle;  they  can  hold  still  as  well.  The 
earth’s  magnetic  held,  for  example,  is  nearly  constant,  which  is  why 
we  can  use  it  for  direction-finding. 

Even  empty  space,  then,  is  not  perfectly  featureless.  It  has  mea¬ 
surable  properties.  For  example,  we  can  drop  a  rock  in  order  to 
measure  the  direction  of  the  gravitational  held,  or  use  a  magnetic 
compass  to  hnd  the  direction  of  the  magnetic  held.  This  concept 
made  a  deep  impression  on  Einstein  as  a  child.  He  recalled  that  as 
a  hve-year-old,  the  gift  of  a  magnetic  compass  convinced  him  that 
there  was  “something  behind  things,  something  deeply  hidden.” 

More  evidence  that  Gelds  of  force  are  real:  they  carry  energy. 

The  smoking-gun  argument  for  this  strange  notion  of  traveling 
force  ripples  comes  from  the  fact  that  they  carry  energy.  In  hgure 
x/1,  Alice  and  Betty  hold  balls  A  and  B  at  some  distance  from 
one  another.  These  balls  make  a  force  on  each  other;  it  doesn’t 
really  matter  for  the  sake  of  our  argument  whether  this  force  is 
gravitational,  electrical,  or  magnetic.  Let’s  say  it’s  electrical,  i.e., 
that  the  balls  have  the  kind  of  electrical  charge  that  sometimes 
causes  your  socks  to  cling  together  when  they  come  out  of  the  clothes 
dryer.  We’ll  say  the  force  is  repulsive,  although  again  it  doesn’t 
really  matter. 

If  Alice  chooses  to  move  her  ball  closer  to  Betty’s,  x/2,  Alice  will 
have  to  do  some  mechanical  work  against  the  electrical  repulsion, 
burning  off  some  of  the  calories  from  that  chocolate  cheesecake  she 
had  at  lunch.  This  reduction  in  her  body’s  chemical  energy  is  offset 
by  a  corresponding  increase  in  the  electrical  interaction  energy.  Not 
only  that,  but  Alice  feels  the  resistance  stiffen  as  the  balls  get  closer 
together  and  the  repulsion  strengthens.  She  has  to  do  a  little  extra 
work,  but  this  is  all  properly  accounted  for  in  the  interaction  energy. 

x  /  Fields  carry  energy  But  now  suPPose>  x/3,  that  Betty  decides  to  play  a  trick  on  Al¬ 

ice  by  tossing  B  far  away  just  as  Alice  is  getting  ready  to  move  A. 
We  have  already  established  that  Alice  can’t  feel  B’s  motion  instan¬ 
taneously,  so  the  electric  forces  must  actually  be  propagated  by  an 
electric  field.  Of  course  this  experiment  is  utterly  impractical,  but 
suppose  for  the  sake  of  argument  that  the  time  it  takes  the  change 
in  the  electric  field  to  propagate  across  the  diagram  is  long  enough 
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so  that  Alice  can  complete  her  motion  before  she  feels  the  effect  of 
B’s  disappearance.  She  is  still  getting  stale  information  about  B’s 
position.  As  she  moves  A  to  the  right,  she  feels  a  repulsion,  because 
the  field  in  her  region  of  space  is  still  the  field  caused  by  B  in  its  old 
position.  She  has  burned  some  chocolate  cheesecake  calories,  and  it 
appears  that  conservation  of  energy  has  been  violated,  because  these 
calories  can’t  be  properly  accounted  for  by  any  interaction  with  B, 
which  is  long  gone. 

If  we  hope  to  preserve  the  law  of  conservation  of  energy,  then 
the  only  possible  conclusion  is  that  the  electric  field  itself  carries 
away  the  cheesecake  energy.  In  fact,  this  example  represents  an 
impractical  method  of  transmitting  radio  waves.  Alice  does  work 
on  charge  A,  and  that  energy  goes  into  the  radio  waves.  Even  if  B 
had  never  existed,  the  radio  waves  would  still  have  carried  energy, 
and  Alice  would  still  have  had  to  do  work  in  order  to  create  them. 

Discussion  Questions 

A  Amy  and  Bill  are  flying  on  spaceships  in  opposite  directions  at  such 
high  velocities  that  the  relativistic  effect  on  time’s  rate  of  flow  is  easily 
noticeable.  Motion  is  relative,  so  Amy  considers  herself  to  be  at  rest  and 
Bill  to  be  in  motion.  She  says  that  time  is  flowing  normally  for  her,  but 
Bill  is  slow.  But  Bill  can  say  exactly  the  same  thing.  How  can  they  both 
think  the  other  is  slow?  Can  they  settle  the  disagreement  by  getting  on 
the  radio  and  seeing  whose  voice  is  normal  and  whose  sounds  slowed 
down  and  Darth-Vadery? 


B  The  figure  shows  a  famous  thought  experiment  devised  by  Einstein. 
A  train  is  moving  at  constant  velocity  to  the  right  when  bolts  of  lightning 
strike  the  ground  near  its  front  and  back.  Alice,  standing  on  the  dirt  at 
the  midpoint  of  the  flashes,  observes  that  the  light  from  the  two  flashes 
arrives  simultaneously,  so  she  says  the  two  strikes  must  have  occurred 
simultaneously.  Bob,  meanwhile,  is  sitting  aboard  the  train,  at  its  middle. 
He  passes  by  Alice  at  the  moment  when  Alice  later  figures  out  that  the 
flashes  happened.  Later,  he  receives  flash  2,  and  then  flash  1.  He  infers 
that  since  both  flashes  traveled  half  the  length  of  the  train,  flash  2  must 
have  occurred  first.  How  can  this  be  reconciled  with  Alice’s  belief  that  the 
flashes  were  simultaneous?  Explain  using  a  graph. 
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aa  /  The  light  cone. 


C  Resolve  the  following  paradox  by  drawing  a  spacetime  diagram 
(i.e.,  a  graph  of  x  versus  t).  Andy  and  Beth  are  in  motion  relative  to  one 
another  at  a  significant  fraction  of  c.  As  they  pass  by  each  other,  they 
exchange  greetings,  and  Beth  tells  Andy  that  she  is  going  to  blow  up  a 
stick  of  dynamite  one  hour  later.  One  hour  later  by  Andy’s  clock,  she 
still  hasn’t  exploded  the  dynamite,  and  he  says  to  himself,  “She  hasn’t 
exploded  it  because  of  time  dilation.  It’s  only  been  40  minutes  for  her.” 
He  now  accelerates  suddenly  so  that  he’s  moving  at  the  same  velocity 
as  Beth.  The  time  dilation  no  longer  exists.  If  he  looks  again,  does  he 
suddenly  see  the  flash  from  the  explosion?  How  can  this  be?  Would  he 
see  her  go  through  20  minutes  of  her  life  in  fast-motion? 

D  Use  a  graph  to  resolve  the  following  relativity  paradox.  Relativity 
says  that  in  one  frame  of  reference,  event  A  could  happen  before  event 
B,  but  in  someone  else’s  frame  B  would  come  before  A.  How  can  this  be? 
Obviously  the  two  people  could  meet  up  at  A  and  talk  as  they  cruised 
past  each  other.  Wouldn’t  they  have  to  agree  on  whether  B  had  already 
happened? 

E  The  rod  in  the  figure  is  perfectly  rigid.  At  event  A,  the  hammer  strikes 
one  end  of  the  rod.  At  event  B,  the  other  end  moves.  Since  the  rod  is 
perfectly  rigid,  it  can’t  compress,  so  A  and  B  are  simultaneous.  In  frame 
2,  B  happens  before  A.  Did  the  motion  at  the  right  end  cause  the  person 
on  the  left  to  decide  to  pick  up  the  hammer  and  use  it? 

7.2.5  The  light  cone 

Given  an  event  P,  we  can  now  classify  all  the  causal  relationships 
in  which  P  can  participate.  In  Newtonian  physics,  these  relation¬ 
ships  fell  into  two  classes:  P  could  potentially  cause  any  event  that 
lay  in  its  future,  and  could  have  been  caused  by  any  event  in  its 
past.  In  relativity,  we  have  a  three-way  distinction  rather  than  a 
two-way  one.  There  is  a  third  class  of  events  that  are  too  far  away 
from  P  in  space,  and  too  close  in  time,  to  allow  any  cause  and  effect 
relationship,  since  causality’s  maximum  velocity  is  c.  Since  we’re 
working  in  units  in  which  c  =  1,  the  boundary  of  this  set  is  formed 
by  the  lines  with  slope  ±1  on  a  (t,  x )  plot.  This  is  referred  to  as  the 
light  cone,  for  reasons  that  become  more  visually  obvious  when  we 
consider  more  than  one  spatial  dimension,  figure  aa. 

Events  lying  inside  one  another’s  light  cones  are  said  to  have 
a  timelike  relationship.  Events  outside  each  other’s  light  cones  are 
spacelike  in  relation  to  one  another,  and  in  the  case  where  they  lie 
on  the  surfaces  of  each  other’s  light  cones  the  term  is  lightlike. 

7.2.6  *  The  spacetime  interval 

The  light  cone  is  an  object  of  central  importance  in  both  special 
and  general  relativity.  It  relates  the  geometry  of  spacetime  to  pos¬ 
sible  cause- and- effect  relationships  between  events.  This  is  funda¬ 
mentally  how  relativity  works:  it’s  a  geometrical  theory  of  causality. 

These  ideas  naturally  lead  us  to  ask  what  fruitful  analogies  we 
can  form  between  the  bizarre  geometry  of  spacetime  and  the  more 
familiar  geometry  of  the  Euclidean  plane.  The  light  cone  cuts  space- 
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time  into  different  regions  according  to  certain  measurements  of  re¬ 
lationships  between  points  (events).  Similarly,  a  circle  in  Euclidean 
geometry  cuts  the  plane  into  two  parts,  an  interior  and  an  exterior, 
according  to  the  measurement  of  the  distance  from  the  circle’s  cen¬ 
ter.  A  circle  stays  the  same  when  we  rotate  the  plane.  A  light  cone 
stays  the  same  when  we  change  frames  of  reference.  Let’s  build  up 
the  analogy  more  explicitly. 

Measurement  in  Euclidean  geometry 

We  say  that  two  line  segments  are  congruent,  AB  =  CD,  if  the 
distance  between  points  A  and  B  is  the  same  as  the  distance 
between  C  and  D,  as  measured  by  a  rigid  ruler. 

Measurement  in  spacetime 

We  define  AB  =  CD  if: 

1.  AB  and  CD  are  both  spacelike,  and  the  two  distances  are  equal 
as  measured  by  a  rigid  ruler,  in  a  frame  where  the  two  events 
touch  the  ruler  simultaneously. 

2.  AB  and  CD  are  both  timelike,  and  the  two  time  intervals  are 
equal  as  measured  by  clocks  moving  inertially. 

3.  AB  and  CD  are  both  lightlike. 

The  three  parts  of  the  relativistic  version  each  require  some  jus¬ 
tification. 

Case  1  has  to  be  the  way  it  is  because  space  is  part  of  space- 
time.  In  special  relativity,  this  space  is  Euclidean,  so  the  definition 
of  congruence  has  to  agree  with  the  Euclidean  definition,  in  the  case 
where  it  is  possible  to  apply  the  Euclidean  definition.  The  spacelike 
relation  between  the  points  is  both  necessary  and  sufficient  to  make 
this  possible.  If  points  A  and  B  are  spacelike  in  relation  to  one 
another,  then  a  frame  of  reference  exists  in  which  they  are  simulta¬ 
neous,  so  we  can  use  a  ruler  that  is  at  rest  in  that  frame  to  measure 
their  distance.  If  they  are  lightlike  or  timelike,  then  no  such  frame 
of  reference  exists.  For  example,  there  is  no  frame  of  reference  in 
which  Charles  VII ’s  restoration  to  the  throne  is  simultaneous  with 
Joan  of  Arc’s  execution,  so  we  can’t  arrange  for  both  of  these  events 
to  touch  the  same  ruler  at  the  same  time. 

The  definition  in  case  2  is  the  only  sensible  way  to  proceed  if 
we  are  to  respect  the  symmetric  treatment  of  time  and  space  in 
relativity.  The  timelike  relation  between  the  events  is  necessary  and 
sufficient  to  make  it  possible  for  a  clock  to  move  from  one  to  the 
other.  It  makes  a  difference  that  the  clocks  move  inertially,  because 
the  twins  in  example  1  on  p.  395  disagree  on  the  clock  time  between 
the  traveling  twin’s  departure  and  return. 

Case  3  may  seem  strange,  since  it  says  that  any  two  lightlike 
intervals  are  congruent.  But  this  is  the  only  possible  definition, 
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because  this  case  can  be  obtained  as  a  limit  of  the  timelike  one. 
Suppose  that  AB  is  a  timelike  interval,  but  in  the  planet  earth’s 
frame  of  reference  it  would  be  necessary  to  travel  at  almost  the 
speed  of  light  in  order  to  reach  B  from  A.  The  required  speed  is  less 
than  c  (i.e.,  less  than  1)  by  some  tiny  amount  e.  In  the  earth’s  frame, 
the  clock  referred  to  in  the  definition  suffers  extreme  time  dilation. 
The  time  elapsed  on  the  clock  is  very  small.  As  e  approaches  zero, 
and  the  relationship  between  A  and  B  approaches  a  lightlike  one, 
this  clock  time  approaches  zero.  In  this  sense,  the  relativistic  notion 
of  “distance”  is  very  different  from  the  Euclidean  one.  In  Euclidean 
geometry,  the  distance  between  two  points  can  only  be  zero  if  they 
are  the  same  point. 

The  case  splitting  involved  in  the  relativistic  definition  is  a  little 
ugly.  Having  worked  out  the  physical  interpretation,  we  can  now 
consolidate  the  definition  in  a  nicer  way  by  appealing  to  Cartesian 
coordinates. 

Cartesian  definition  of  distance  in  Euclidean  geometry 

Given  a  vector  (Ax,  Ay)  from  point  A  to  point  B,  the  square 
of  the  distance  between  them  is  defined  as  AB“  =  Ax2  +  Ay2. 

Definition  of  the  interval  in  relativity 

Given  points  separated  by  coordinate  differences  Ax,  Ay,  A z, 
and  At,  the  spacetime  interval  X  (cursive  letter  “I” )  between 
them  is  defined  as  X  =  At2  —  Ax2  —  Ay2  —  A  z2 . 

This  is  stated  in  natural  units,  so  all  four  terms  on  the  right-hand 
side  have  the  same  units;  in  metric  units  with  c  /  1,  appropriate 
factors  of  c  should  be  inserted  in  order  to  make  the  units  of  the 
terms  agree.  The  interval  X  is  positive  if  AB  is  timelike  (regardless 
of  which  event  comes  first),  zero  if  lightlike,  and  negative  if  spacelike. 
Since  X  can  be  negative,  we  can’t  in  general  take  its  square  root  and 
define  a  real  number  AB  as  in  the  Euclidean  case.  When  the  interval 
is  timelike,  we  can  interpret  \fZ  as  a  time,  and  when  it’s  spacelike 
we  can  take  \J — X  to  be  a  distance. 

The  Euclidean  definition  of  distance  (i.e.,  the  Pythagorean  the¬ 
orem)  is  useful  because  it  gives  the  same  answer  regardless  of  how 
we  rotate  the  plane.  Although  it  is  stated  in  terms  of  a  certain 
coordinate  system,  its  result  is  unambiguously  defined  because  it  is 
the  same  regardless  of  what  coordinate  system  we  arbitrarily  pick. 
Similarly,  X  is  useful  because,  as  proved  in  example  8  below,  it  is 
the  same  regardless  of  our  frame  of  reference,  i.e.,  regardless  of  our 
choice  of  coordinates. 

Pioneer  10  example  7 

>  The  Pioneer  1 0  space  probe  was  launched  in  1 972,  and  in  1 973 
was  the  first  craft  to  fly  by  the  planet  Jupiter.  It  crossed  the  orbit 
of  the  planet  Neptune  in  1983,  after  which  telemetry  data  were 
received  until  2002.  The  following  table  gives  the  spacecraft’s 
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position  relative  to  the  sun  at  exactly  midnight  on  January  1 ,  1983 
and  January  1 ,  1 995.  The  1 983  date  is  taken  to  be  t  =  0. 

t  (s)  x  y  z 

0  1.784  x  1012  m  3.951  x1012m  0.237  x  1012  m 

3.7869120000  x  108  s  2.420  x  1012  m  8.827  x  1012  m  0.488  x  1012  m 

Compare  the  time  elapsed  on  the  spacecraft  to  the  time  in  a  frame 
of  reference  tied  to  the  sun. 

>  We  can  convert  these  data  into  natural  units,  with  the  distance 
unit  being  the  second  (i.e.,  a  light-second,  the  distance  light  trav¬ 
els  in  one  second)  and  the  time  unit  being  seconds.  Converting 
and  carrying  out  this  subtraction,  we  have: 

At  (s)  Ax  Ay  A z 

3.78691 20000  x  1 08  s  0.2121  x  1 04  s  1.626  x  104  s  0.084  x  104  s 

Comparing  the  exponents  of  the  temporal  and  spatial  numbers, 
we  can  see  that  the  spacecraft  was  moving  at  a  velocity  on  the 
order  of  1 0~4  of  the  speed  of  light,  so  relativistic  effects  should  be 
small  but  not  completely  negligible. 

Since  the  interval  is  timelike,  we  can  take  its  square  root  and 
interpret  it  as  the  time  elapsed  on  the  spacecraft.  The  result  is 
\jx  =  3.786911996  x  108  s.  This  is  0.4  s  less  than  the  time 
elapsed  in  the  sun’s  frame  of  reference. 


Invariance  of  the  interval  example  8 

In  this  example  we  prove  that  the  interval  is  the  same  regard¬ 
less  of  what  frame  of  reference  we  compute  it  in.  This  is  called 
“Lorentz  invariance.”  The  proof  is  limited  to  the  timelike  case. 
Given  events  A  and  B,  construct  the  light-rectangle  as  defined  in 
figure  ab/1.  On  p.  393  we  proved  that  the  Lorentz  transformation 
doesn’t  change  the  area  of  a  shape  in  the  x-t  plane.  Therefore 
the  area  of  this  rectangle  is  unchanged  if  we  switch  to  the  frame 
of  reference  ab/2,  in  which  A  and  B  occurred  at  the  same  location 
and  were  separated  by  a  time  interval  r.  This  area  equals  half 
the  interval  1  between  A  and  B.  But  a  straightforward  calculation 
shows  that  the  rectangle  in  ab/1  also  has  an  area  equal  to  half 
the  interval  calculated  in  that  frame.  Since  the  area  in  any  frame 


ab  /  Light-rectangles,  example  8. 

1.  The  gray  light-rectangle  repre¬ 
sents  the  set  of  all  events  such  as 
P  that  could  be  visited  after  A  and 
before  B. 

2.  The  rectangle  becomes  a 
square  in  the  frame  in  which  A 
and  B  occur  at  the  same  location 
in  space. 

3.  The  area  of  the  dashed  square 
is  t2,  so  the  area  of  the  gray 
square  is  t2/2. 
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equals  half  the  interval,  and  the  area  is  the  same  in  all  frames, 
the  interval  is  equal  in  all  frames  as  well. 


A  numerical  example  of  invariance  example  9 

Figure  ac  shows  two  frames  of  reference  in  motion  relative  to 
one  another  at  v  =  3/5.  (For  this  velocity,  the  stretching  and 
squishing  of  the  main  diagonals  are  both  by  a  factor  of  2.)  Events 
are  marked  at  coordinates  that  in  the  frame  represented  by  the 
square  are 


(t,x)  =  (  0,0)  and 
(f,x)  =  (13,11). 


The  interval  between  these  events  is  1 32  —  1 1 2  =  48.  In  the 
frame  represented  by  the  parallelogram,  the  same  two  events  lie 
at  coordinates 


(t',x')  =  (  0,0)  and 

(f/,x/)  =  (8,4). 


Calculating  the  interval  using  these  values,  the  result  is 
82  -  42  =  48,  which  comes  out  the  same  as  in  the  other  frame. 
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7.2.7  *  Four-vectors  and  the  inner  product 

Example  7  makes  it  natural  that  we  define  a  type  of  vector  with 
four  components,  the  first  one  relating  to  time  and  the  others  being 
spatial.  These  are  known  as  four-vectors.  It’s  clear  how  we  should 
define  the  equivalent  of  a  dot  product  in  relativity: 

A  B  =  AtBt  -  AXBX  -  AyBy  -  AZBZ 

The  term  “dot  product”  has  connotations  of  referring  only  to  three- 
vectors,  so  the  operation  of  taking  the  scalar  product  of  two  four- 
vectors  is  usually  referred  to  instead  as  the  “inner  product.”  The 
spacetime  interval  can  then  be  thought  of  as  the  inner  product  of  a 
four- vector  with  itself.  We  care  about  the  relativistic  inner  product 
for  exactly  the  same  reason  we  care  about  its  Euclidean  version;  both 
are  scalars,  so  they  have  a  fixed  value  regardless  of  what  coordinate 
system  we  choose. 


The  twin  paradox  example  1 0 

Alice  and  Betty  are  identical  twins.  Betty  goes  on  a  space  voyage 
at  relativistic  speeds,  traveling  away  from  the  earth  and  then  turn¬ 
ing  around  and  coming  back.  Meanwhile,  Alice  stays  on  earth. 
When  Betty  returns,  she  is  younger  than  Alice  because  of  rela¬ 
tivistic  time  dilation  (example  1 ,  p.  395). 

But  isn’t  it  valid  to  say  that  Betty’s  spaceship  is  standing  still  and 
the  earth  moving?  In  that  description,  wouldn’t  Alice  end  up 
younger  and  Betty  older?  This  is  referred  to  as  the  “twin  paradox.” 
It  can’t  really  be  a  paradox,  since  it’s  exactly  what  was  observed 
in  the  Hafele-Keating  experiment  (p.  385). 

Betty’s  track  in  the  x-t  plane  (her  “world-line”  in  relativistic  jargon) 
consists  of  vectors  b  and  c  strung  end-to-end  (figure  ad).  We 
could  adopt  a  frame  of  reference  in  which  Betty  was  at  rest  during 
b  (i.e. ,  bx  =  0),  but  there  is  no  frame  in  which  b  and  c  are  parallel, 
so  there  is  no  frame  in  which  Betty  was  at  rest  during  both  b  and 
c.  This  resolves  the  paradox. 

We  have  already  established  by  other  methods  that  Betty  ages 
less  that  Alice,  but  let’s  see  how  this  plays  out  in  a  simple  numer¬ 
ical  example.  Omitting  units  and  making  up  simple  numbers,  let’s 
say  that  the  vectors  in  figure  ad  are 

a  =  (6,1) 
b  =  (3, 2) 
c  =  (3,-1), 

where  the  components  are  given  in  the  order  (t,x).  The  time 
experienced  by  Alice  is  then 

|a|  =  >/  62  —  1 2  =  5.9, 


ad  /  Example  10. 
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ae  /  Example  11. 


which  is  greater  than  the  Betty’s  elapsed  time 

|b|  +  |c|  =  V32  -22  +  V32-(— 1)2  =  5.1. 

Simultaneity  using  inner  products  example  1 1 

Suppose  that  an  observer  O  moves  inertially  along  a  vector  o, 
and  let  the  vector  separating  two  events  P  and  Q  be  s.  O  judges 
these  events  to  be  simultaneous  if  o  ■  s  =  0.  To  see  why  this  is 
true,  suppose  we  pick  a  coordinate  system  as  defined  by  O.  In 
this  coordinate  system,  O  considers  herself  to  be  at  rest,  so  she 
says  her  vector  has  only  a  time  component,  o  =  (At,  0,0,0).  If 
she  considers  P  and  Q  to  be  simultaneous,  then  the  vector  from 
P  to  Q  is  of  the  form  (0,  Ax,  Ay,  Az).  The  inner  product  is  then 
zero,  since  each  of  the  four  terms  vanishes.  Since  the  inner  prod¬ 
uct  is  independent  of  the  choice  of  coordinate  system,  it  doesn’t 
matter  that  we  chose  one  tied  to  O  herself.  Any  other  observer 
O'  can  look  at  O’s  motion,  note  that  o  ■  s  =  0,  and  infer  that  O 
must  consider  P  and  Q  to  be  simultaneous,  even  if  O'  says  they 
weren’t. 


af  /  The  pattern  of  waves  made 
by  a  point  source  moving  to  the 
right  across  the  water.  Note 
the  shorter  wavelength  of  the 
forward-emitted  waves  and 
the  longer  wavelength  of  the 
backward-going  ones. 


7.2.8  x  Doppler  shifts  of  light  and  addition  of  velocities 

When  Doppler  shifts  happen  to  ripples  on  a  pond  or  the  sound 
waves  from  an  airplane,  they  can  depend  on  the  relative  motion  of 
three  different  objects:  the  source,  the  receiver,  and  the  medium. 
But  light  waves  don’t  have  a  medium.  Therefore  Doppler  shifts 
of  light  can  only  depend  on  the  relative  motion  of  the  source  and 
observer. 

One  simple  case  is  the  one  in  which  the  relative  motion  of  the 
source  and  the  receiver  is  perpendicular  to  the  line  connecting  them. 
That  is,  the  motion  is  transverse.  Nonrelativistic  Doppler  shifts  hap¬ 
pen  because  the  distance  between  the  source  and  receiver  is  chang¬ 
ing,  so  in  nonrelativistic  physics  we  don’t  expect  any  Doppler  shift 
at  all  when  the  motion  is  transverse,  and  this  is  what  is  in  fact  ob¬ 
served  to  high  precision.  For  example,  the  photo  shows  shortened 
and  lengthened  wavelengths  to  the  right  and  left,  along  the  source’s 
line  of  motion,  but  an  observer  above  or  below  the  source  measures 
just  the  normal,  unshifted  wavelength  and  frequency.  But  relativis- 
tically,  we  have  a  time  dilation  effect,  so  for  light  waves  emitted 
transversely,  there  is  a  Doppler  shift  of  I/7  in  frequency  (or  7  in 
wavelength) . 

The  other  simple  case  is  the  one  in  which  the  relative  motion  of 
the  source  and  receiver  is  longitudinal,  i.e. ,  they  are  either  approach¬ 
ing  or  receding  from  one  another.  For  example,  distant  galaxies  are 
receding  from  our  galaxy  due  to  the  expansion  of  the  universe,  and 
this  expansion  was  originally  detected  because  Doppler  shifts  toward 
the  red  (low-frequency)  end  of  the  spectrum  were  observed. 

Nonrelativistically,  we  would  expect  the  light  from  such  a  galaxy 
to  be  Doppler  shifted  down  in  frequency  by  some  factor,  which 
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would  depend  on  the  relative  velocities  of  three  different  objects:  the 
source,  the  wave’s  medium,  and  the  receiver.  Relativistically,  things 
get  simpler,  because  light  isn’t  a  vibration  of  a  physical  medium,  so 
the  Doppler  shift  can  only  depend  on  a  single  velocity  v,  which  is 
the  rate  at  which  the  separation  between  the  source  and  the  receiver 
is  increasing. 

The  square  in  figure  ah  is  the  “graph  paper”  used  by  someone 
who  considers  the  source  to  be  at  rest,  while  the  parallelogram  plays 
a  similar  role  for  the  receiver.  The  figure  is  drawn  for  the  case  where 
v  =  3/5  (in  units  where  c  =  1),  and  in  this  case  the  stretch  factor 
of  the  long  diagonal  is  2.  To  keep  the  area  the  same,  the  short 
diagonal  has  to  be  squished  to  half  its  original  size.  But  now  it’s  a 
matter  of  simple  geometry  to  show  that  OP  equals  half  the  width 
of  the  square,  and  this  tells  us  that  the  Doppler  shift  is  a  factor  of 
1/2  in  frequency.  That  is,  the  squish  factor  of  the  short  diagonal  is 
interpreted  as  the  Doppler  shift.  To  get  this  as  a  general  equation  for 
velocities  other  than  3/5,  one  can  show  by  straightforward  fiddling 
with  the  result  of  part  c  of  problem  7  on  p.  444  that  the  Doppler 
shift  is 


D(v) 


./'Ey 

V  l+v 


Here  v  >  0  is  the  case  where  the  source  and  receiver  are  getting 
farther  apart,  v  <  0  the  case  where  they  are  approaching.  (This  is 
the  opposite  of  the  sign  convention  used  in  subsection  6.1.5.  It  is 
convenient  to  change  conventions  here  so  that  we  can  use  positive 
values  of  v  in  the  case  of  cosmological  red-shifts,  which  are  the  most 
important  application.) 

Suppose  that  Alice  stays  at  home  on  earth  while  her  twin  Betty 
takes  off  in  her  rocket  ship  at  3/5  of  the  speed  of  light.  When  I 
first  learned  relativity,  the  thing  that  caused  me  the  most  pain  was 
understanding  how  each  observer  could  say  that  the  other  was  the 
one  whose  time  was  slow.  It  seemed  to  me  that  if  I  could  take  a 
pill  that  would  speed  up  my  mind  and  my  body,  then  naturally  I 
would  see  everybody  else  as  being  slow.  Shouldn’t  the  same  apply 
to  relativity?  But  suppose  Alice  and  Betty  get  on  the  radio  and  try 
to  settle  who  is  the  fast  one  and  who  is  the  slow  one.  Each  twin’s 
voice  sounds  slooooowed  doooowwwwn  to  the  other.  If  Alice  claps 
her  hands  twice,  at  a  time  interval  of  one  second  by  her  clock,  Betty 
hears  the  hand-claps  coming  over  the  radio  two  seconds  apart,  but 
the  situation  is  exactly  symmetric,  and  Alice  hears  the  same  thing  if 
Betty  claps.  Each  twin  analyzes  the  situation  using  a  diagram  iden¬ 
tical  to  ah,  and  attributes  her  sister’s  observations  to  a  complicated 
combination  of  time  distortion,  the  time  taken  by  the  radio  signals 
to  propagate,  and  the  motion  of  her  twin  relative  to  her. 


ag  /  A  graphical  representa¬ 
tion  of  the  Lorentz  transformation 
for  a  velocity  of  (3/5 )c.  The  long 
diagonal  is  stretched  by  a  factor 
of  two,  the  short  one  is  half  its 
former  length,  and  the  area  is  the 
same  as  before. 


o  P 


ah  /  At  event  O,  the  source 
and  the  receiver  are  on  top  of 
each  other,  so  as  the  source 
emits  a  wave  crest,  it  is  received 
without  any  time  delay.  At  P,  the 
source  emits  another  wave  crest, 
and  at  Q  the  receiver  receives  it. 
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Turn  your  book  upside-down  and  reinterpret  figure  ah.  >  Answer,  p. 
964 


A  symmetry  property  of  the  Doppler  effect  example  12 

Suppose  that  A  and  B  are  at  rest  relative  to  one  another,  but  C  is 
moving  along  the  line  between  A  and  B.  A  transmits  a  signal  to  C, 
who  then  retransmits  it  to  B.  The  signal  accumulates  two  Doppler 
shifts,  and  the  result  is  their  product  D( v)D(-v).  But  this  product 
must  equal  1,  so  we  must  have  D(-v)D(v)  =  1,  which  can  be 
verified  directly  from  the  equation. 

The  Ives-Stilwell  experiment  example  13 

The  result  of  example  1 2  was  the  basis  of  one  of  the  earliest  labo¬ 
ratory  tests  of  special  relativity,  by  Ives  and  Stilwell  in  1938.  They 
observed  the  light  emitted  by  excited  by  a  beam  of  H|  and  HJ 
ions  with  speeds  of  a  few  tenths  of  a  percent  of  c.  Measuring 
the  light  from  both  ahead  of  and  behind  the  beams,  they  found 
that  the  product  of  the  Doppler  shifts  D(v)D(-v)  was  equal  to  1 , 
as  predicted  by  relativity.  If  relativity  had  been  false,  then  one 
would  have  expected  the  product  to  differ  from  1  by  an  amount 
that  would  have  been  detectable  in  their  experiment.  In  2003, 
Saathoff  et  al.  carried  out  an  extremely  precise  version  of  the 
Ives-Stilwell  technique  with  Li+  ions  moving  at  6.4%  of  c.  The 
frequencies  observed,  in  units  of  MHz,  were: 


fo 

W-v) 
f0D(v) 
\/ f0D(—v)  ■  f0D(v) 


=  546466918.8  ±0.4 

(unshifted  frequency) 
=  582490203.44  ±  .09 
(shifted  frequency,  forward) 
=  512671442.9  ±0.5 
(shifted  frequency,  backward) 
=  546466918.6  ±0.3 


The  results  show  incredibly  precise  agreement  between  f0  and 
\J f0D(-v )  •  f0D(v),  as  expected  relativistically  because  D(v)D(-v) 
is  supposed  to  equal  1.  The  agreement  extends  to  9  significant 
figures,  whereas  if  relativity  had  been  false  there  should  have 
been  a  relative  disagreement  of  about  v2  =  .004,  i.e.,  a  discrep¬ 
ancy  in  the  third  significant  figure.  The  spectacular  agreement 
with  theory  has  made  this  experiment  a  lightning  rod  for  anti¬ 
relativity  kooks. 


We  saw  on  p.  398  that  relativistic  velocities  should  not  be  ex¬ 
pected  to  be  exactly  additive,  and  problem  1  on  p.  443  verifies  this 
in  the  special  case  where  A  moves  relative  to  B  at  0.6c  and  B  relative 
to  C  at  0.6c  —  the  result  not  being  1.2c.  The  relativistic  Doppler 
shift  provides  a  simple  way  of  deriving  a  general  equation  for  the 
relativistic  combination  of  velocities;  problem  17  on  p.  447  guides 
you  through  the  steps  of  this  derivation,  and  the  result  is  given  on 
p.  978. 
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7.3  Dynamics 

So  far  we  have  said  nothing  about  how  to  predict  motion  in  relativ¬ 
ity.  Do  Newton’s  laws  still  work?  Do  conservation  laws  still  apply? 
The  answer  is  yes,  but  many  of  the  definitions  need  to  be  modified, 
and  certain  entirely  new  phenomena  occur,  such  as  the  equivalence 
of  energy  and  mass,  as  described  by  the  famous  equation  E  =  me2. 


7.3.1  Momentum 

Consider  the  following  scheme  for  traveling  faster  than  the  speed 
of  light.  The  basic  idea  can  be  demonstrated  by  dropping  a  ping- 
pong  ball  and  a  baseball  stacked  on  top  of  each  other  like  a  snowman. 
They  separate  slightly  in  mid-air,  and  the  baseball  therefore  has  time 
to  hit  the  floor  and  rebound  before  it  collides  with  the  ping-pong 
ball,  which  is  still  on  the  way  down.  The  result  is  a  surprise  if  you 
haven’t  seen  it  before:  the  ping-pong  ball  flies  off  at  high  speed  and 
hits  the  ceiling!  A  similar  fact  is  known  to  people  who  investigate 
the  scenes  of  accidents  involving  pedestrians.  If  a  car  moving  at 
90  kilometers  per  hour  hits  a  pedestrian,  the  pedestrian  flies  off  at 
nearly  double  that  speed,  180  kilometers  per  hour.  Now  suppose 
the  car  was  moving  at  90  percent  of  the  speed  of  light.  Would  the 
pedestrian  fly  off  at  180%  of  c? 


To  see  why  not,  we  have  to  back  up  a  little  and  think  about 
where  this  speed-doubling  result  comes  from.  For  any  collision,  there 
is  a  special  frame  of  reference,  the  center-of-mass  frame,  in  which 
the  two  colliding  objects  approach  each  other,  collide,  and  rebound 
with  their  velocities  reversed.  In  the  center-of-mass  frame,  the  total 
momentum  of  the  objects  is  zero  both  before  and  after  the  collision. 
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a  /  An  unequal  collision,  viewed  in 
the  center-of-mass  frame,  1,  and 
in  the  frame  where  the  small  ball 
is  initially  at  rest,  2.  The  motion 
is  shown  as  it  would  appear  on 
the  film  of  an  old-fashioned  movie 
camera,  with  an  equal  amount  of 
time  separating  each  frame  from 
the  next.  Film  1  was  made  by 
a  camera  that  tracked  the  center 
of  mass,  film  2  by  one  that  was 
initially  tracking  the  small  ball, 
and  kept  on  moving  at  the  same 
speed  after  the  collision. 


Figure  a/1  shows  such  a  frame  of  reference  for  objects  of  very 
unequal  mass.  Before  the  collision,  the  large  ball  is  moving  relatively 
slowly  toward  the  top  of  the  page,  but  because  of  its  greater  mass, 
its  momentum  cancels  the  momentum  of  the  smaller  ball,  which  is 
moving  rapidly  in  the  opposite  direction.  The  total  momentum  is 
zero.  After  the  collision,  the  two  balls  just  reverse  their  directions  of 
motion.  We  know  that  this  is  the  right  result  for  the  outcome  of  the 
collision  because  it  conserves  both  momentum  and  kinetic  energy, 
and  everything  not  forbidden  is  compulsory,  i.e.,  in  any  experiment, 
there  is  only  one  possible  outcome,  which  is  the  one  that  obeys  all 
the  conservation  laws. 

self-check  C 

How  do  we  know  that  momentum  and  kinetic  energy  are  conserved  in 
figure  a/1  ?  >  Answer,  p.  964 

Let’s  make  up  some  numbers  as  an  example.  Say  the  small  ball 
has  a  mass  of  1  kg,  the  big  one  8  kg.  In  frame  1,  let’s  make  the 
velocities  as  follows: 


before  the  collision 

after  the  collision 

o 

-0.8 

0.8 

O 

0.1 

-0.1 

Figure  a/2  shows  the  same  collision  in  a  frame  of  reference  where 
the  small  ball  was  initially  at  rest.  To  find  all  the  velocities  in  this 
frame,  we  just  add  0.8  to  all  the  ones  in  the  previous  table. 


before  the  collision 

after  the  collision 

o 

0 

1.6 

O 

0.9 

0.7 
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In  this  frame,  as  expected,  the  small  ball  flies  off  with  a  velocity, 

1.6,  that  is  almost  twice  the  initial  velocity  of  the  big  ball,  0.9. 

If  all  those  velocities  were  in  meters  per  second,  then  that’s  ex¬ 
actly  what  happened.  But  what  if  all  these  velocities  were  in  units 
of  the  speed  of  light?  Now  it’s  no  longer  a  good  approximation 
just  to  add  velocities.  We  need  to  combine  them  according  to  the 
relativistic  rules.  For  instance,  the  technique  used  in  problem  1  on 
p.  443  can  be  used  to  show  that  combining  a  velocity  of  0.8  times 
the  speed  of  light  with  another  velocity  of  0.8  results  in  0.98,  not 

1.6.  The  results  are  very  different: 


before  the  collision 

after  the  collision 

o 

0 

0.98 

O 

0.83 

0.76 

b  /  An  8-kg  ball  moving  at  83% 
of  the  speed  of  light  hits  a  1  -kg 
ball.  The  balls  appear  foreshort¬ 
ened  due  to  the  relativistic  distor¬ 
tion  of  space. 


We  can  interpret  this  as  follows.  Figure  a/1  is  one  in  which  the 
big  ball  is  moving  fairly  slowly.  This  is  very  nearly  the  way  the 
scene  would  be  seen  by  an  ant  standing  on  the  big  ball.  According 
to  an  observer  in  frame  b,  however,  both  balls  are  moving  at  nearly 
the  speed  of  light  after  the  collision.  Because  of  this,  the  balls 
appear  foreshortened,  but  the  distance  between  the  two  balls  is  also 
shortened.  To  this  observer,  it  seems  that  the  small  ball  isn’t  pulling 
away  from  the  big  ball  very  fast. 

Now  here’s  what’s  interesting  about  all  this.  The  outcome  shown 
in  figure  a/2  was  supposed  to  be  the  only  one  possible,  the  only 
one  that  satisfied  both  conservation  of  energy  and  conservation  of 
momentum.  So  how  can  the  different  result  shown  in  figure  b  be 
possible?  The  answer  is  that  relativistically,  momentum  must  not 
equal  mv.  The  old,  familiar  definition  is  only  an  approximation 
that’s  valid  at  low  speeds.  If  we  observe  the  behavior  of  the  small 
ball  in  figure  b,  it  looks  as  though  it  somehow  had  some  extra  inertia. 
It’s  as  though  a  football  player  tried  to  knock  another  player  down 
without  realizing  that  the  other  guy  had  a  three-hundred-pound  bag 
full  of  lead  shot  hidden  under  his  uniform  —  he  just  doesn’t  seem 
to  react  to  the  collision  as  much  as  he  should.  As  proved  in  section 
7.3.4,  this  extra  inertia  is  described  by  redefining  momentum  as 

p  =  m'yv. 
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c  /  Example  14. 


0.6  0.7  0.8 


v 

d/Two  early  high-precision 
tests  of  the  relativistic  equation 
p  =  rrryv  for  momentum.  Graph¬ 
ing  p/m  rather  than  p  allows  the 
data  for  electrons  and  protons  to 
be  placed  on  the  same  graph. 
Natural  units  are  used,  so  that 
the  horizontal  axis  is  the  velocity 
in  units  of  c,  and  the  vertical  axis 
is  the  unitless  quantity  p/mc. 
The  very  small  error  bars  for 
the  data  point  from  Zrelov  are 
represented  by  the  height  of  the 
black  rectangle. 


At  very  low  velocities,  7  is  close  to  1,  and  the  result  is  very  nearly 
mv,  as  demanded  by  the  correspondence  principle.  But  at  very  high 
velocities,  7  gets  very  big  —  the  small  ball  in  figure  b  has  a  7  of 
5.0,  and  therefore  has  five  times  more  inertia  than  we  would  expect 
nonrelativistically. 

This  also  explains  the  answer  to  another  paradox  often  posed 
by  beginners  at  relativity.  Suppose  you  keep  on  applying  a  steady 
force  to  an  object  that’s  already  moving  at  0.9999c.  Why  doesn’t 
it  just  keep  on  speeding  up  past  c?  The  answer  is  that  force  is  the 
rate  of  change  of  momentum.  At  0.9999c,  an  object  already  has  a  7 
of  71,  and  therefore  has  already  sucked  up  71  times  the  momentum 
you’d  expect  at  that  speed.  As  its  velocity  gets  closer  and  closer  to 
c,  its  7  approaches  infinity.  To  move  at  c,  it  would  need  an  infinite 
momentum,  which  could  only  be  caused  by  an  infinite  force. 

Push  as  hard  as  you  like  . . .  example  14 

We  don’t  have  to  depend  on  our  imaginations  to  see  what  would 
happen  if  we  kept  on  applying  a  force  to  an  object  indefinitely  and 
tried  to  accelerate  it  past  c.  A  nice  experiment  of  this  type  was 
done  by  Bertozzi  in  1964.  In  this  experiment,  electrons  were  ac¬ 
celerated  by  an  electric  field  E  through  a  distance  t\.  Applying 
Newton’s  laws  gives  Newtonian  predictions  a/v  for  the  accelera¬ 
tion  and  tN  for  the  time  required.4 

The  electrons  were  then  allowed  to  fly  down  a  pipe  for  a  further 
distance  £ 2  =  8.4  m  without  being  acted  on  by  any  force.  The 
time  of  flight  t2  for  this  second  distance  was  used  to  find  the  final 
velocity  1/  =  4/fe  to  which  they  had  actually  been  accelerated. 

Figure  c  shows  the  results.5  According  to  Newton,  an  accelera¬ 
tion  aN  acting  for  a  time  tN  should  produce  a  final  velocity  aNtN. 
The  solid  line  in  the  graph  shows  the  prediction  of  Newton’s  laws, 
which  is  that  a  constant  force  exerted  steadily  over  time  will  pro¬ 
duce  a  velocity  that  rises  linearly  and  without  limit. 

The  experimental  data,  shown  as  black  dots,  clearly  tell  a  different 
story.  The  velocity  never  goes  above  a  certain  maximum  value, 
which  we  identify  as  c.  The  dashed  line  shows  the  predictions 
of  special  relativity,  which  are  in  good  agreement  with  the  experi¬ 
mental  results. 

Figure  d  shows  experimental  data  confirming  the  relativistic 
equation  for  momentum. 


4Newton’s  second  law  gives  a/v  =  F/m  =  eE/m.  The  constant-acceleration 
equation  Ax  =  (1  /2 )af2  then  gives  fw  =  sj 2m£-\  JeE. 

5To  make  the  low-energy  portion  of  the  graph  legible,  Bertozzi’s  highest- 
energy  data  point  is  omitted. 
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7.3.2  Equivalence  of  mass  and  energy 

Now  we’re  ready  to  see  why  mass  and  energy  must  be  equivalent 
as  claimed  in  the  famous  E  =  rue2.  So  far  we’ve  only  considered 
collisions  in  which  none  of  the  kinetic  energy  is  converted  into  any 
other  form  of  energy,  such  as  heat  or  sound.  Let’s  consider  what 
happens  if  a  blob  of  putty  moving  at  velocity  v  hits  another  blob 
that  is  initially  at  rest,  sticking  to  it.  The  nonrelativistic  result  is 
that  to  obey  conservation  of  momentum  the  two  blobs  must  fly  off 
together  at  v/2.  Half  of  the  initial  kinetic  energy  has  been  converted 
to  heat.6 

Relativistically,  however,  an  interesting  thing  happens.  A  hot 
object  has  more  momentum  than  a  cold  object!  This  is  because 
the  relativistically  correct  expression  for  momentum  is  rrr/v ,  and 
the  more  rapidly  moving  atoms  in  the  hot  object  have  higher  values 
of  7.  In  our  collision,  the  final  combined  blob  must  therefore  be 
moving  a  little  more  slowly  than  the  expected  u/2,  since  otherwise 
the  final  momentum  would  have  been  a  little  greater  than  the  initial 
momentum.  To  an  observer  who  believes  in  conservation  of  momen¬ 
tum  and  knows  only  about  the  overall  motion  of  the  objects  and  not 
about  their  heat  content,  the  low  velocity  after  the  collision  would 
seem  to  be  the  result  of  a  magical  change  in  the  mass,  as  if  the  mass 
of  two  combined,  hot  blobs  of  putty  was  more  than  the  sum  of  their 
individual  masses. 

Now  we  know  that  the  masses  of  all  the  atoms  in  the  blobs  must 
be  the  same  as  they  always  were.  The  change  is  due  to  the  change  in 
7  with  heating,  not  to  a  change  in  mass.  The  heat  energy,  however, 
seems  to  be  acting  as  if  it  was  equivalent  to  some  extra  mass. 

But  this  whole  argument  was  based  on  the  fact  that  heat  is  a 
form  of  kinetic  energy  at  the  atomic  level.  Would  E  =  me2  apply  to 
other  forms  of  energy  as  well?  Suppose  a  rocket  ship  contains  some 
electrical  energy  stored  in  a  battery.  If  we  believed  that  E  =  me2 
applied  to  forms  of  kinetic  energy  but  not  to  electrical  energy,  then 
we  would  have  to  believe  that  the  pilot  of  the  rocket  could  slow 
the  ship  down  by  using  the  battery  to  run  a  heater!  This  would 
not  only  be  strange,  but  it  would  violate  the  principle  of  relativity, 
because  the  result  of  the  experiment  would  be  different  depending 
on  whether  the  ship  was  at  rest  or  not.  The  only  logical  conclusion  is 
that  all  forms  of  energy  are  equivalent  to  mass.  Running  the  heater 
then  has  no  effect  on  the  motion  of  the  ship,  because  the  total 
energy  in  the  ship  was  unchanged;  one  form  of  energy  (electrical) 
was  simply  converted  to  another  (heat). 

The  equation  E  =  me 2  tells  us  how  much  energy  is  equivalent 

6  A  double-mass  object  moving  at  half  the  speed  does  not  have  the  same 
kinetic  energy.  Kinetic  energy  depends  on  the  square  of  the  velocity,  so  cutting 
the  velocity  in  half  reduces  the  energy  by  a  factor  of  1/4,  which,  multiplied  by 
the  doubled  mass,  makes  1/2  the  original  energy. 
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to  how  much  mass:  the  conversion  factor  is  the  square  of  the  speed 
of  light,  c.  Since  c  a  big  number,  you  get  a  really  really  big  number 
when  you  multiply  it  by  itself  to  get  c2 .  This  means  that  even  a  small 
amount  of  mass  is  equivalent  to  a  very  large  amount  of  energy. 


LIGHTSALLASKEW 
ffl  THE  HEAVENS 


e  /  Example  15,  page  420. 


Men  of  Science  More  or  Less 
Agog  Over  Results  of  Eclipse 
Observations. 


EINSTEIN  THEORY  TRIUMPHS 


Stars  Not  Where  They  Seemed 
or  Were  Calculated  to  be, 
but  Nobody  Need  Worry. 


A  BOOK  FOR  12  WISE  MEN 


No  More  in  All  the  World  Could 
Comprehend  It,  Said  Einstein  When 
Hie  Daring  Publishers  Accepted  It. 


f/A  New  York  Times  head¬ 
line  from  November  10,  1919, 
describing  the  observations 
discussed  in  example  15. 


Gravity  bending  light  example  15 

Gravity  is  a  universal  attraction  between  things  that  have  mass, 
and  since  the  energy  in  a  beam  of  light  is  equivalent  to  some 
very  small  amount  of  mass,  we  expect  that  light  will  be  affected 
by  gravity,  although  the  effect  should  be  very  small.  The  first  im¬ 
portant  experimental  confirmation  of  relativity  came  in  1919  when 
stars  next  to  the  sun  during  a  solar  eclipse  were  observed  to  have 
shifted  a  little  from  their  ordinary  position.  (If  there  was  no  eclipse, 
the  glare  of  the  sun  would  prevent  the  stars  from  being  observed.) 
Starlight  had  been  deflected  by  the  sun’s  gravity.  Figure  e  is  a 
photographic  negative,  so  the  circle  that  appears  bright  is  actu¬ 
ally  the  dark  face  of  the  moon,  and  the  dark  area  is  really  the 
bright  corona  of  the  sun.  The  stars,  marked  by  lines  above  and 
below  them,  appeared  at  positions  slightly  different  than  their  nor¬ 
mal  ones. 

Black  holes  example  16 

A  star  with  sufficiently  strong  gravity  can  prevent  light  from  leav¬ 
ing.  Quite  a  few  black  holes  have  been  detected  via  their  gravita¬ 
tional  forces  on  neighboring  stars  or  clouds  of  gas  and  dust. 


420 


Chapter  7  Relativity 


You’ve  learned  about  conservation  of  mass  and  conservation  of 
energy,  but  now  we  see  that  they’re  not  even  separate  conservation 
laws.  As  a  consequence  of  the  theory  of  relativity,  mass  and  en¬ 
ergy  are  equivalent,  and  are  not  separately  conserved  —  one  can 
be  converted  into  the  other.  Imagine  that  a  magician  waves  his 
wand,  and  changes  a  bowl  of  dirt  into  a  bowl  of  lettuce.  You’d  be 
impressed,  because  you  were  expecting  that  both  dirt  and  lettuce 
would  be  conserved  quantities.  Neither  one  can  be  made  to  vanish, 
or  to  appear  out  of  thin  air.  However,  there  are  processes  that  can 
change  one  into  the  other.  A  farmer  changes  dirt  into  lettuce,  and 
a  compost  heap  changes  lettuce  into  dirt.  At  the  most  fundamen¬ 
tal  level,  lettuce  and  dirt  aren’t  really  different  things  at  all;  they’re 
just  collections  of  the  same  kinds  of  atoms  —  carbon,  hydrogen,  and 
so  on.  Because  mass  and  energy  are  like  two  different  sides  of  the 
same  coin,  we  may  speak  of  mass-energy,  a  single  conserved  quantity, 
found  by  adding  up  all  the  mass  and  energy,  with  the  appropriate 
conversion  factor:  E  +  me2. 

A  rusting  nail  example  1 7 

o  An  iron  nail  is  left  in  a  cup  of  water  until  it  turns  entirely  to  rust. 
The  energy  released  is  about  0.5  MJ.  In  theory,  would  a  suffi¬ 
ciently  precise  scale  register  a  change  in  mass?  If  so,  how  much? 

>  The  energy  will  appear  as  heat,  which  will  be  lost  to  the  envi¬ 
ronment.  The  total  mass-energy  of  the  cup,  water,  and  iron  will 
indeed  be  lessened  by  0.5  MJ.  (If  it  had  been  perfectly  insulated, 
there  would  have  been  no  change,  since  the  heat  energy  would 
have  been  trapped  in  the  cup.)  The  speed  of  light  is  c  =  3  x  108 
meters  per  second,  so  converting  to  mass  units,  we  have 

E 

m= 

c2 

_  0.5  x  106  J 

(3  x  108  m/s)2 
=  6  x  10-12  kilograms. 

The  change  in  mass  is  too  small  to  measure  with  any  practical 
technique.  This  is  because  the  square  of  the  speed  of  light  is 
such  a  large  number. 

Electron-positron  annihilation  example  18 

Natural  radioactivity  in  the  earth  produces  positrons,  which  are 
like  electrons  but  have  the  opposite  charge.  A  form  of  antimat¬ 
ter,  positrons  annihilate  with  electrons  to  produce  gamma  rays,  a 
form  of  high-frequency  light.  Such  a  process  would  have  been 
considered  impossible  before  Einstein,  because  conservation  of 
mass  and  energy  were  believed  to  be  separate  principles,  and 
this  process  eliminates  100%  of  the  original  mass.  The  amount 
of  energy  produced  by  annihilating  1  kg  of  matter  with  1  kg  of 
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g/Top:  A  PET  scanner.  Middle: 
Each  positron  annihilates  with  an 
electron,  producing  two  gamma- 
rays  that  fly  off  back-to-back. 
When  two  gamma  rays  are  ob¬ 
served  simultaneously  in  the  ring 
of  detectors,  they  are  assumed  to 
come  from  the  same  annihilation 
event,  and  the  point  at  which  they 
were  emitted  must  lie  on  the  line 
connecting  the  two  detectors. 
Bottom:  A  scan  of  a  person’s 
torso.  The  body  has  concentrated 
the  radioactive  tracer  around  the 
stomach,  indicating  an  abnormal 
medical  condition. 


antimatter  is 


E  =  me2 

=  (2  kg)  (3.0  x  108  m/s)2 
=  2  x  1 017  J, 

which  is  on  the  same  order  of  magnitude  as  a  day’s  energy  con¬ 
sumption  for  the  entire  world’s  population! 

Positron  annihilation  forms  the  basis  for  the  medical  imaging  tech¬ 
nique  called  a  PET  (positron  emission  tomography)  scan,  in  which 
a  positron-emitting  chemical  is  injected  into  the  patient  and  map¬ 
ped  by  the  emission  of  gamma  rays  from  the  parts  of  the  body 
where  it  accumulates. 

One  commonly  hears  some  misinterpretations  of  E  =  me2,  one 
being  that  the  equation  tells  us  how  much  kinetic  energy  an  object 
would  have  if  it  was  moving  at  the  speed  of  light.  This  wouldn’t 
make  much  sense,  both  because  the  equation  for  kinetic  energy  has 
1/2  in  it,  KE  =  (1/2 )mv2,  and  because  a  material  object  can’t  be 
made  to  move  at  the  speed  of  light.  However,  this  naturally  leads 
to  the  question  of  just  how  much  mass-energy  a  moving  object  has. 
We  know  that  when  the  object  is  at  rest,  it  has  no  kinetic  energy,  so 
its  mass-energy  is  simply  equal  to  the  energy-equivalent  of  its  mass, 
me2, 

£  =  me 2  when  v  =  0, 

where  the  symbol  £  (cursive  “E” )  stands  for  mass-energy.  The  point 
of  using  the  new  symbol  is  simply  to  remind  ourselves  that  we’re 
talking  about  relativity,  so  an  object  at  rest  has  £  =  me2,  not  E  =  0 
as  we’d  assume  in  nonrelativistic  physics. 

Suppose  we  start  accelerating  the  object  with  a  constant  force. 
A  constant  force  means  a  constant  rate  of  transfer  of  momentum, 
but  p  =  m'yv  approaches  infinity  as  v  approaches  c,  so  the  object 
will  only  get  closer  and  closer  to  the  speed  of  light,  but  never  reach 
it.  Now  what  about  the  work  being  done  by  the  force?  The  force 
keeps  doing  work  and  doing  work,  which  means  that  we  keep  on 
using  up  energy.  Mass-energy  is  conserved,  so  the  energy  being 
expended  must  equal  the  increase  in  the  object’s  mass-energy.  We 
can  continue  this  process  for  as  long  as  we  like,  and  the  amount  of 
mass-energy  will  increase  without  limit.  We  therefore  conclude  that 
an  object’s  mass-energy  approaches  infinity  as  its  speed  approaches 
the  speed  of  light, 

£  — >  00  when  v  — >  c. 

Now  that  we  have  some  idea  what  to  expect,  what  is  the  actual 
equation  for  the  mass-energy?  As  proved  in  section  7.3.4,  it  is 

£  =  m'ye2 . 
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self-check  D 

Verify  that  this  equation  has  the  two  properties  we  wanted.  > 

Answer,  p.  964 

KE  compared  to  me 2  at  low  speeds  example  19 

>  An  object  is  moving  at  ordinary  nonrelativistic  speeds.  Compare 
its  kinetic  energy  to  the  energy  me2  it  has  purely  because  of  its 
mass. 

o  The  speed  of  light  is  a  very  big  number,  so  me2  is  a  huge  num¬ 
ber  of  joules.  The  object  has  a  gigantic  amount  of  energy  be¬ 
cause  of  its  mass,  and  only  a  relatively  small  amount  of  additional 
kinetic  energy  because  of  its  motion. 

Another  way  of  seeing  this  is  that  at  low  speeds,  y  is  only  a  tiny 
bit  greater  than  1 ,  so  £  is  only  a  tiny  bit  greater  than  me2. 

The  correspondence  principle  for  mass-energy  example  20 

>  Show  that  the  equation  £  =  myc 2  obeys  the  correspondence 
principle. 

>  As  we  accelerate  an  object  from  rest,  its  mass-energy  becomes 
greater  than  its  resting  value.  Classically,  we  interpret  this  excess 
mass-energy  as  the  object’s  kinetic  energy, 

KE  =  £(v)-£(v  =  0) 

=  myc2  -  me2 
=  m(y  -  1  )c2. 

1  /2 

Expressing  y  as  (1  -  i/2/c2)-  7  and  making  use  of  the  approx¬ 
imation  (1  +  e)p  «  1  +  pe  for  small  e,  we  have  y  «  1  +  v2/2 c2, 
so 

KE*m(1+^-1)c2 

1  2 

=  2  mv  ’ 

which  is  the  classical  expression.  As  demanded  by  the  corre¬ 
spondence  principle,  relativity  agrees  with  classical  physics  at 
speeds  that  are  small  compared  to  the  speed  of  light. 

7.3.3  *  The  energy-momentum  four-vector 

Starting  from  £  =  my  and  p  =  myv,  a  little  algebra  allows  one 
to  prove  the  identity 

2  c'2  2 

m  =  £  —  p  . 

We  can  define  an  energy-momentum  four-vector, 

P  =  (£,Px,Py,Pz), 

and  the  relation  m2  =  £2  —  p2  then  arises  from  the  inner  product 
p-p.  Since  £  and  p  are  separately  conserved,  the  energy- momentum 
four- vector  is  also  conserved. 
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Results  from  Meyer  et  al,  1963 

v  Y 

0.9870  1.0002(5) 

0.9881  1.0012(5) 

0.9900  0.9998(5) 

which  according  to  special  relativity  should  equal  1.  Their  results, 
tabulated  in  the  sidebar,  show  excellent  agreement  with  theory. 

Energy  and  momentum  of  light  example  21 

Light  has  m  =  0  and  y  =  oo,  so  if  we  try  to  apply  £  =  my  and 
p  =  rrryv  to  light,  or  to  any  massless  particle,  we  get  the  indeter¬ 
minate  form  0-oo,  which  can’t  be  evaluated  without  a  delicate  and 
laborious  evaluation  of  limits  as  in  problem  11  on  p.  446. 

Applying  m2  =  £2  -  p2  yields  the  same  result,  £  =  \p\,  much  more 
easily.  This  example  demonstrates  that  although  we  encountered 
the  relations  £  =  my  and  p  =  myv  first,  the  identity  m2  =  £2  -  p2 
is  actually  more  fundamental. 

Figure  q  on  p.  711  shows  an  experiment  that  verified  £  =  \p\ 
empirically. 

For  the  reasons  given  in  example  21,  we  take  m 2  =  £2  —  p2 
to  be  the  definition  of  mass  in  relativity.  One  thing  to  be  careful 
about  is  that  this  definition  is  not  additive.  Suppose  that  we  lump 
two  systems  together  and  call  them  one  big  system,  adding  their 
mass-energies  and  momenta.  When  we  do  this,  the  mass  of  the 
combination  is  not  the  same  as  the  sum  of  the  masses.  For  example, 
suppose  we  have  two  rays  of  light  moving  in  opposite  directions, 
with  energy- momentum  vectors  (£,  £,  0, 0)  and  (£,  —£,  0,  0).  Adding 
these  gives  (2£,0, 0),  which  implies  a  mass  equal  to  2£.  In  fact,  in 
the  early  universe,  where  the  density  of  light  was  high,  the  universe’s 
ambient  gravitational  fields  were  mainly  those  caused  by  the  light 
it  contained. 

Mass-energy,  not  energy,  goes  in  the  energy-momentum  four- 
vector  example  22 

When  we  say  that  something  is  a  four-vector,  we  mean  that  it 
behaves  properly  under  a  Lorentz  transformation:  we  can  draw 
such  a  four-vector  on  graph  paper,  and  then  when  we  change 
frames  of  reference,  we  should  be  able  to  measure  the  vector  in 
the  new  frame  of  reference  by  using  the  new  version  of  the  graph- 
paper  grid  derived  from  the  old  one  by  a  Lorentz  transformation. 

If  we  had  used  the  energy  E  rather  than  the  mass-energy  £  to 
construct  the  energy-momentum  four-vector,  we  wouldn’t  have 
gotten  a  valid  four-vector.  An  easy  way  to  see  this  is  to  consider 
the  case  where  a  noninteracting  object  is  at  rest  in  some  frame 
of  reference.  Its  momentum  and  kinetic  energy  are  both  zero.  If 
we’d  defined  p  =  {E,  px,  py,  pz)  rather  than  p  =  (£,  px,  py,  pz),  we 


A  high-precision  test  of  this  fundamental  relativistic  relationship 
was  carried  out  by  Meyer  et  al.  in  1963  by  studying  the  motion  of 
electrons  in  static  electric  and  magnetic  fields.  They  define  the 
quantity 

2  £2 
Y2=  2, 

m-  +  p- 
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would  have  had  p  =  0  in  this  frame.  But  when  we  draw  a  zero  vec¬ 
tor,  we  get  a  point,  and  a  point  remains  a  point  regardless  of  how 
we  distort  the  graph  paper  we  use  to  measure  it.  That  wouldn’t 
have  made  sense,  because  in  other  frames  of  reference,  we  have 

E¥  o. 

Metric  units  example  23 

The  relation  m2  =  £2  -  p 2  is  only  valid  in  relativistic  units.  If 
we  tried  to  apply  it  without  modification  to  numbers  expressed  in 
metric  units,  we  would  have 


„2  m4  o  m2 


kg'  =kg  -Tr-kg  --p-, 


which  would  be  nonsense  because  the  three  terms  all  have  dif¬ 
ferent  units.  As  usual,  we  need  to  insert  factors  of  c  to  make  a 
metric  version,  and  these  factors  of  c  are  determined  by  the  need 
to  fix  the  broken  units: 


2  4 

me 


£ 2  -  p2c2 


Pair  production  requires  matter  example  24 

Example  18  on  p.  421  discussed  the  annihilation  of  an  electron 
and  a  positron  into  two  gamma  rays,  which  is  an  example  of  turn¬ 
ing  matter  into  pure  energy.  An  opposite  example  is  pair  produc¬ 
tion,  a  process  in  which  a  gamma  ray  disappears,  and  its  energy 
goes  into  creating  an  electron  and  a  positron. 

Pair  production  cannot  happen  in  a  vacuum.  For  example,  gamma 
rays  from  distant  black  holes  can  travel  through  empty  space  for 
thousands  of  years  before  being  detected  on  earth,  and  they  don’t 
turn  into  electron-positron  pairs  before  they  can  get  here.  Pair 
production  can  only  happen  in  the  presence  of  matter.  When 
lead  is  used  as  shielding  against  gamma  rays,  one  of  the  ways 
the  gamma  rays  can  be  stopped  in  the  lead  is  by  undergoing  pair 
production. 

To  see  why  pair  production  is  forbidden  in  a  vacuum,  consider  the 
process  in  the  frame  of  reference  in  which  the  electron-positron 
pair  has  zero  total  momentum.  In  this  frame,  the  gamma  ray 
would  have  to  have  had  zero  momentum,  but  a  gamma  ray  with 
zero  momentum  must  have  zero  energy  as  well  (example  21). 
This  means  that  conservation  of  /bur-momentum  has  been  vio¬ 
lated:  the  timelike  component  of  the  four-momentum  is  the  mass- 
energy,  and  it  has  increased  from  0  in  the  initial  state  to  at  least 
2 me2  in  the  final  state. 
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7.3.4  *  Proofs 


This  optional  section  proves  some  results  claimed  earlier. 
Ultrarelativistic  motion 

We  start  by  considering  the  case  of  a  particle,  described  as  “ul¬ 
trarelativistic,”  that  travels  at  very  close  to  the  speed  of  light.  A 
good  way  of  thinking  about  such  a  particle  is  that  it’s  one  with  a 
very  small  mass.  For  example,  the  subatomic  particle  called  the 
neutrino  has  a  very  small  mass,  thousands  of  times  smaller  than 
that  of  the  electron.  Neutrinos  are  emitted  in  radioactive  decay, 
and  because  the  neutrino’s  mass  is  so  small,  the  amount  of  energy 
available  in  these  decays  is  always  enough  to  accelerate  it  to  very 
close  to  the  speed  of  light.  Nobody  has  ever  succeeded  in  observing 
a  neutrino  that  was  not  ultrarelativistic.  When  a  particle’s  mass  is 
very  small,  the  mass  becomes  difficult  to  measure.  For  almost  70 
years  after  the  neutrino  was  discovered,  its  mass  was  thought  to  be 
zero.  Similarly,  we  currently  believe  that  a  ray  of  light  has  no  mass, 
but  it  is  always  possible  that  its  mass  will  be  found  to  be  nonzero 
at  some  point  in  the  future.  A  ray  of  light  can  be  modeled  as  an 
ultrarelativistic  particle. 

Let’s  compare  ultrarelativistic  particles  with  train  cars.  A  single 
car  with  kinetic  energy  E  has  different  properties  than  a  train  of  two 
cars  each  with  kinetic  energy  E/2.  The  single  car  has  half  the  mass 
and  a  speed  that  is  greater  by  a  factor  of  \[2.  But  the  same  is  not 
true  for  ultrarelativistic  particles.  Since  an  idealized  ultrarelativistic 
particle  has  a  mass  too  small  to  be  detectable  in  any  experiment, 
we  can’t  detect  the  difference  between  m  and  2 m.  Furthermore, 
ultrarelativistic  particles  move  at  close  to  c,  so  there  is  no  observable 
difference  in  speed.  Thus  we  expect  that  a  single  ultrarelativistic 
particle  with  energy  E  compared  with  two  such  particles,  each  with 
energy  E/2,  should  have  all  the  same  properties  as  measured  by  a 
mechanical  detector. 

An  idealized  zero-mass  particle  also  has  no  frame  in  which  it 
can  be  at  rest.  It  always  travels  at  c,  and  no  matter  how  fast  we 
chase  after  it,  we  can  never  catch  up.  We  can,  however,  observe 
it  in  different  frames  of  reference,  and  we  will  find  that  its  energy 
is  different.  For  example,  distant  galaxies  are  receding  from  us  at 
substantial  fractions  of  c,  and  when  we  observe  them  through  a 
telescope,  they  appear  very  dim  not  just  because  they  are  very  far 
away  but  also  because  their  light  has  less  energy  in  our  frame  than 
in  a  frame  at  rest  relative  to  the  source.  This  effect  must  be  such 
that  changing  frames  of  reference  according  to  a  specific  Lorentz 
transformation  always  changes  the  energy  of  the  particle  by  a  fixed 
factor,  regardless  of  the  particle’s  original  energy;  for  if  not,  then 
the  effect  of  a  Lorentz  transformation  on  a  single  particle  of  energy 
E  would  be  different  from  its  effect  on  two  particles  of  energy  E/2. 
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How  does  this  energy-shift  factor  depend  on  the  velocity  v  of  the 
Lorentz  transformation?  Rather  than  v,  it  becomes  more  convenient 
to  express  things  in  terms  of  the  Doppler  shift  factor  D,  which  mul¬ 
tiplies  when  we  change  frames  of  reference.  Let’s  write  f(D)  for  the 
energy-shift  factor  that  results  from  a  given  Lorentz  transformation. 
Since  a  Lorentz  transformation  D\  followed  by  a  second  transforma¬ 
tion  D2  is  equivalent  to  a  single  transformation  by  D\D-2,  we  must 
have  /(-D1D2)  =  f  (D\) f  (D2) ■  This  tightly  constrains  the  form  of 
the  function  /;  it  must  be  something  like  f(D )  =  sn,  where  n  is  a 
constant.  We  postpone  until  p.  428  the  proof  that  n  =  1,  which  is 
also  in  agreement  with  experiments  with  rays  of  light. 

Our  final  result  is  that  the  energy  of  an  ultrarelativistic  particle 
is  simply  proportional  to  its  Doppler  shift  factor  D.  Even  in  the 
case  where  the  particle  is  truly  massless,  so  that  D  doesn’t  have 
any  finite  value,  we  can  still  find  how  the  energy  differs  according  to 
different  observers  by  finding  the  D  of  the  Lorentz  transformation 
between  the  two  observers’  frames  of  reference. 


Energy 

The  following  argument  is  due  to  Einstein.  Suppose  that  a  ma¬ 
terial  object  O  of  mass  m,  initially  at  rest  in  a  certain  frame  A, 
emits  two  rays  of  light,  each  with  energy  E/2.  By  conservation  of 
energy,  the  object  must  have  lost  an  amount  of  energy  equal  to  E. 
By  symmetry,  O  remains  at  rest. 

We  now  switch  to  a  new  frame  of  reference  moving  at  a  certain 
velocity  v  in  the  z  direction  relative  to  the  original  frame.  We  assume 
that  O’s  energy  is  different  in  this  frame,  but  that  the  change  in  its 
energy  amounts  to  multiplication  by  some  unitless  factor  x,  which 
depends  only  on  v,  since  there  is  nothing  else  it  could  depend  on  that 
could  allow  us  to  form  a  unitless  quantity.  In  this  frame  the  light 
rays  have  energies  ED(v )  and  ED(—v).  If  conservation  of  energy  is 
to  hold  in  the  new  frame  as  it  did  in  the  old,  we  must  have  2 xE  = 
ED(v )  +  ED(—v).  After  some  algebra,  we  find  x  =  l/Vl  —  v2, 
which  we  recognize  as  7.  This  proves  that  E  =  my  for  a  material 
object. 

Momentum 

We’ve  seen  that  ultrarelativistic  particles  are  “generic,”  in  the 
sense  that  they  have  no  individual  mechanical  properties  other  than 
an  energy  and  a  direction  of  motion.  Therefore  the  relationship 
between  energy  and  momentum  must  be  linear  for  ultrarelativistic 
particles.  Indeed,  experiments  verify  that  light  has  momentum,  and 
doubling  the  energy  of  a  ray  of  light  doubles  its  momentum  rather 
than  quadrupling  it.  On  a  graph  of  p  versus  E.  massless  particles, 
which  have  E  0 c  |p|,  lie  on  two  diagonal  lines  that  connect  at  the 
origin.  If  we  like,  we  can  pick  units  such  that  the  slopes  of  these 
lines  are  plus  and  minus  one.  Material  particles  he  to  the  right  of 


h  /  In  the  p-E  plane,  mass¬ 
less  particles  lie  on  the  two 
diagonals,  while  particles  with 
mass  lie  to  the  right. 
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these  lines.  For  example,  a  car  sitting  in  a  parking  lot  has  p  =  0 
and  E  =  me2. 

Now  what  happens  to  such  a  graph  when  we  change  to  a  dif¬ 
ferent  frame  or  reference  that  is  in  motion  relative  to  the  original 
frame?  A  massless  particle  still  has  to  act  like  a  massless  particle, 
so  the  diagonals  are  simply  stretched  or  contracted  along  their  own 
lengths.  In  fact  the  transformation  must  be  linear  (p.  391),  be¬ 
cause  conservation  of  energy  and  momentum  involve  addition,  and 
we  need  these  laws  to  be  valid  in  all  frames  of  reference.  By  the 
same  reasoning  as  in  figure  j  on  p.  393,  the  transformation  must  be 
area-preserving.  We  then  have  the  same  three  cases  to  consider  as 
in  figure  g  on  p.  392.  Case  I  is  ruled  out  because  it  would  imply 
that  particles  keep  the  same  energy  when  we  change  frames.  (This 
is  what  would  happen  if  c  were  infinite,  so  that  the  mass-equivalent 
E/c2  of  a  given  energy  was  zero,  and  therefore  E  would  be  inter¬ 
preted  purely  as  the  mass.)  Case  II  can’t  be  right  because  it  doesn’t 
preserve  the  E  =  \p\  diagonals.  We  are  left  with  case  III,  which  es¬ 
tablishes  the  fact  that  the  p-E  plane  transforms  according  to  exactly 
the  same  kind  of  Lorentz  transformation  as  the  x-t  plane.  That  is, 
( Ej  ,  px ,  Py ,  P z )  is  a  four- vector. 

The  only  remaining  issue  to  settle  is  whether  the  choice  of  units 
that  gives  invariant  45-degree  diagonals  in  the  x-t  plane  is  the  same 
as  the  choice  of  units  that  gives  such  diagonals  in  the  p-E  plane. 
That  is,  we  need  to  establish  that  the  c  that  applies  to  x  and  t  is 
equal  to  the  d  needed  for  p  and  E,  i.e. ,  that  the  velocity  scales  of  the 
two  graphs  are  matched  up.  This  is  true  because  in  the  Newtonian 
limit,  the  total  mass-energy  E  is  essentially  just  the  particle’s  mass, 
and  then  p/E  ~  p/m  ~  v.  This  establishes  that  the  velocity  scales 
are  matched  at  small  velocities,  which  implies  that  they  coincide  for 
all  velocities,  since  a  large  velocity,  even  one  approaching  c,  can  be 
built  up  from  many  small  increments.  (This  also  establishes  that 
the  exponent  n  defined  on  p.  427  equals  1  as  claimed.) 

Since  m 2  =  E2  —  p2,  it  follows  that  for  a  material  particle,  p  = 
m.'yv. 
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7.4  ★  General  Relativity 

What  you’ve  learned  so  far  about  relativity  is  known  as  the 
special  theory  of  relativity,  which  is  compatible  with  three  of  the  four 
known  forces  of  nature:  electromagnetism,  the  strong  nuclear  force, 
and  the  weak  nuclear  force.  Gravity,  however,  can’t  be  shoehorned 
into  the  special  theory.  In  order  to  make  gravity  work,  Einstein  had 
to  generalize  relativity.  The  resulting  theory  is  known  as  the  general 
theory  of  relativity.7 

7.4.1  Our  universe  isn’t  Euclidean 

Euclid  proved  thousands  of  years  ago  that  the  angles  in  a  triangle 
add  up  to  180°.  But  what  does  it  really  mean  to  “prove”  this? 
Euclid  proved  it  based  on  certain  assumptions  (his  five  postulates), 
listed  in  the  margin  of  this  page.  But  how  do  we  know  that  the 
postulates  are  true? 

Only  by  observation  can  we  tell  whether  any  of  Euclid’s  state¬ 
ments  are  correct  characterizations  of  how  space  actually  behaves 
in  our  universe.  If  we  draw  a  triangle  on  paper  with  a  ruler  and 
measure  its  angles  with  a  protractor,  we  will  quickly  verify  to  pretty 
good  precision  that  the  sum  is  close  to  180°.  But  of  course  we  al¬ 
ready  knew  that  space  was  at  least  approximately  Euclidean.  If 
there  had  been  any  gross  error  in  Euclidean  geometry,  it  would  have 
been  detected  in  Euclid’s  own  lifetime.  The  correspondence  princi¬ 
ple  tells  us  that  if  there  is  going  to  be  any  deviation  from  Euclidean 
geometry,  it  must  be  small  under  ordinary  conditions. 

To  improve  the  precision  of  the  experiment,  we  need  to  make 
sure  that  our  ruler  is  very  straight.  One  way  to  check  would  be  to 
sight  along  it  by  eye,  which  amounts  to  comparing  its  straightness 
to  that  of  a  ray  of  light.  For  that  matter,  we  might  as  well  throw 
the  physical  ruler  in  the  trash  and  construct  our  triangle  out  of 
three  laser  beams.  To  avoid  effects  from  the  air  we  should  do  the 
experiment  in  outer  space.  Doing  it  in  space  also  has  the  advantage 
of  allowing  us  to  make  the  triangle  very  large;  as  shown  in  figure 
a,  the  discrepancy  from  180°  is  expected  to  be  proportional  to  the 
area  of  the  triangle. 

But  we  already  know  that  light  rays  are  bent  by  gravity.  We 
expect  it  based  on  E  =  me2,  which  tells  us  that  the  energy  of  a 
light  ray  is  equivalent  to  a  certain  amount  of  mass,  and  furthermore 
it  has  been  verified  experimentally  by  the  deflection  of  starlight  by 
the  sun  (example  15,  p.  420).  We  therefore  know  that  our  universe 
is  noneuclidean,  and  we  gain  the  further  insight  that  the  level  of 


7Einstein  originally  described  the  distinction  between  the  two  theories  by 
saying  that  the  special  theory  applied  to  nonaccelerating  frames  of  reference, 
while  the  general  one  allowed  any  frame  at  all.  The  modern  consensus  is  that 
Einstein  was  misinterpreting  his  own  theory,  and  that  special  relativity  actually 
handles  accelerating  frames  just  fine. 


Postulates  of  Euclidean  geome¬ 
try: 

1.  Two  points  determine  a  line. 

2.  Line  segments  can  be  ex¬ 
tended. 

3.  A  unique  circle  can  be  con¬ 
structed  given  any  point  as  its 
center  and  any  line  segment  as 
its  radius. 

4.  All  right  angles  are  equal  to 
one  another. 

5.  Given  a  line  and  a  point  not 
on  the  line,  no  more  than  one 
line  can  be  drawn  through  the 
point  and  parallel  to  the  given 
line. 


a  /  Noneuclidean  effects,  such  as 
the  discrepancy  from  180°  in  the 
sum  of  the  angles  of  a  triangle, 
are  expected  to  be  proportional 
to  area.  Here,  a  noneuclidean 
equilateral  triangle  is  cut  up  into 
four  smaller  equilateral  triangles, 
each  with  1/4  the  area.  As  proved 
in  problem  22,  the  discrepancy 
is  quadrupled  when  the  area  is 
quadrupled. 
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b/An  Einstein’s  ring.  The 
distant  object  is  a  quasar, 
MG1 131 +0456,  and  the  one 
in  the  middle  is  an  unknown 
object,  possibly  a  supermassive 
black  hole.  The  intermediate 
object’s  gravity  focuses  the  rays 
of  light  from  the  distant  one. 
Because  the  entire  arrangement 
lacks  perfect  axial  symmetry,  the 
ring  is  nonuniform;  most  of  its 
brightness  is  concentrated  in  two 
lumps  on  opposite  sides. 


deviation  from  Euclidean  behavior  depends  on  gravity. 

Since  the  noneuclidean  effects  are  bigger  when  the  system  being 
studied  is  larger,  we  expect  them  to  be  especially  important  in  the 
study  of  cosmology,  where  the  distance  scales  are  very  large. 

Einstein’s  ring  example  25 

An  Einstein’s  ring,  figure  b,  is  formed  when  there  is  a  chance 
alignment  of  a  distant  source  with  a  closer  gravitating  body.  This 
type  of  gravitational  lensing  is  direct  evidence  for  the  noneuclidean 
nature  of  space.  The  two  light  rays  are  lines,  and  they  violate  Eu¬ 
clid’s  first  postulate,  that  two  points  determine  a  line. 

One  could  protest  that  effects  like  these  are  just  an  imperfection 
of  the  light  rays  as  physical  models  of  straight  lines.  Maybe  the 
noneuclidean  effects  would  go  away  if  we  used  something  better  and 
straighter  than  a  light  ray.  But  we  don’t  know  of  anything  straighter 
than  a  light  ray.  Furthermore,  we  observe  that  all  measuring  devices, 
not  just  optical  ones,  report  the  same  noneuclidean  behavior. 

Curvature 

An  example  of  such  a  non-optical  measurement  is  the  Gravity 
Probe  B  satellite,  figure  d,  which  was  launched  into  a  polar  orbit 
in  2004  and  operated  until  2010.  The  probe  carried  four  gyroscopes 
made  of  quartz,  which  were  the  most  perfect  spheres  ever  manu¬ 
factured,  varying  from  sphericity  by  no  more  than  about  40  atoms. 
Each  gyroscope  floated  weightlessly  in  a  vacuum,  so  that  its  rota¬ 
tion  was  perfectly  steady.  After  5000  orbits,  the  gyroscopes  had 
reoriented  themselves  by  about  2  x  10_3°  relative  to  the  distant 
stars.  This  effect  cannot  be  explained  by  Newtonian  physics,  since 
no  torques  acted  on  them.  It  was,  however,  exactly  as  predicted  by 
Einstein’s  theory  of  general  relativity.  It  becomes  easier  to  see  why 
such  an  effect  would  be  expected  due  to  the  noneuclidean  nature  of 
space  if  we  characterize  euclidean  geometry  as  the  geometry  of  a  flat 
plane  as  opposed  to  a  curved  one.  On  a  curved  surface  like  a  sphere, 
figure  c,  Euclid’s  fifth  postulate  fails,  and  it’s  not  hard  to  see  that 
we  can  get  triangles  for  which  the  sum  of  the  angles  is  not  180°. 
By  transporting  a  gyroscope  all  the  way  around  the  edges  of  such  a 
triangle  and  back  to  its  starting  point,  we  change  its  orientation. 


The  triangle  in  figure  c  has  angles  that  add  up  to  more  than 
180°.  This  type  of  curvature  is  referred  to  as  positive.  It  is  also 
possible  to  have  negative  curvature,  as  in  figure  e. 

In  general  relativity,  curvature  isn’t  just  something  caused  by 
gravity.  Gravity  is  curvature,  and  the  curvature  involves  both  space 
and  time,  as  may  become  clearer  once  you  get  to  figure  k.  Thus  the 
distinction  between  special  and  general  relativity  is  that  general  rela¬ 
tivity  handles  curved  spacetime,  while  special  relativity  is  restricted 
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c  /  Left:  A  90-90-90  triangle.  Its  angles  add  up  to  more  than  180°. 
Middle :  The  triangle  “pops”  off  the  page  visually.  We  intuitively  want 
to  visualize  it  as  lying  on  a  curved  surface  such  as  the  earth’s.  Right : 
A  gyroscope  carried  smoothly  around  its  perimeter  ends  up  having 
changed  its  orientation  when  it  gets  back  to  its  starting  point. 


to  the  case  where  spacetime  is  flat. 

Curvature  doesn ’t  require  higher  dimensions 

Although  we  often  visualize  curvature  by  imagining  embedding 
a  two-dimensional  surface  in  a  three-dimensional  space,  that’s  just 
an  aid  in  visualization.  There  is  no  evidence  for  any  additional 
dimensions,  nor  is  it  necessary  to  hypothesize  them  in  order  to  let 
spacetime  be  curved  as  described  in  general  relativity. 


f/Only  measurements  from  within  the  plane  define  whether  the 
plane  is  curved.  It  could  look  curved  when  drawn  embedded  in  three 
dimensions,  but  nevertheless  still  be  intrinsically  flat. 


Put  yourself  in  the  shoes  of  a  two-dimensional  being  living  in  a 
two-dimensional  space.  Euclid’s  postulates  all  refer  to  constructions 
that  can  be  performed  using  a  compass  and  an  unmarked  straight¬ 
edge.  If  this  being  can  physically  verify  them  all  as  descriptions  of 
the  space  she  inhabits,  then  she  knows  that  her  space  is  Euclidean, 
and  that  propositions  such  as  the  Pythagorean  theorem  are  physi¬ 
cally  valid  in  her  universe.  But  the  diagram  in  f/1  illustrating  the 
proof  of  the  Pythagorean  theorem  in  Euclid’s  Elements  (proposi¬ 
tion  1.47)  is  equally  valid  if  the  page  is  rolled  onto  a  cylinder,  2,  or 
formed  into  a  wavy  corrugated  shape,  3.  These  types  of  curvature, 
which  can  be  achieved  without  tearing  or  crumpling  the  surface, 
are  not  real  to  her.  They  are  simply  side-effects  of  visualizing  her 


d  /  Gravity  Probe  B  was  in  a 
polar  orbit  around  the  earth.  As 
in  the  right  panel  of  figure  c, 
the  orientation  of  the  gyroscope 
changes  when  it  is  carried  around 
a  curve  and  back  to  its  starting 
point.  Because  the  effect  was 
small,  it  was  necessary  to  let  it 
accumulate  over  the  course  of 
5000  orbits  in  order  to  make  it 
detectable. 


e/A  triangle  in  a  space  with 
negative  curvature  has  angles 
that  add  to  less  than  180°. 
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two-dimensional  universe  as  if  it  were  embedded  in  a  hypothetical 
third  dimension  —  which  doesn’t  exist  in  any  sense  that  is  empir¬ 
ically  verifiable  to  her.  Of  the  curved  surfaces  in  figure  f,  only  the 
sphere,  4,  has  curvature  that  she  can  measure;  the  diagram  can’t  be 
plastered  onto  the  sphere  without  folding  or  cutting  and  pasting. 

So  the  observation  of  curvature  doesn’t  imply  the  existence  of  ex¬ 
tra  dimensions,  nor  does  embedding  a  space  in  a  higher-dimensional 
one  so  that  it  looks  curvy  always  mean  that  there  will  be  any  cur¬ 
vature  detectable  from  within  the  lower-dimensional  space. 

7.4.2  The  equivalence  principle 

Universality  of  free-fall 

Although  light  rays  and  gyroscopes  seem  to  agree  that  space  is 
curved  in  a  gravitational  field,  it’s  always  conceivable  that  we  could 
find  something  else  that  would  disagree.  For  example,  suppose  that 
there  is  a  new  and  improved  ray  called  the  StraightRay™ .  The 
StraightRay  is  like  a  light  ray,  but  when  we  construct  a  triangle  out 
of  StraightRays,  we  always  get  the  Euclidean  result  for  the  sum  of 
the  angles.  We  would  then  have  to  throw  away  general  relativity’s 
whole  idea  of  describing  gravity  in  terms  of  curvature.  One  good  way 
of  making  a  StraightRay  would  be  if  we  had  a  supply  of  some  kind 
of  exotic  matter  —  call  it  FloatyStuff™  —  that  had  the  ordinary 
amount  of  inertia,  but  was  completely  unaffected  by  gravity.  We 
could  then  shoot  a  stream  of  FloatyStuff  particles  out  of  a  nozzle  at 
nearly  the  speed  of  light  and  make  a  StraightRay. 

Normally  when  we  release  a  material  object  in  a  gravitational 
field,  it  experiences  a  force  mg,  and  then  by  Newton’s  second  law 
its  acceleration  is  a  =  F/m  =  mg/m  =  g.  The  ml s  cancel,  which  is 
the  reason  that  everything  falls  with  the  same  acceleration  (in  the 
absence  of  other  forces  such  as  air  resistance).  The  universality  of 
this  behavior  is  what  allows  us  to  interpret  the  gravity  geometrically 
in  general  relativity.  For  example,  the  Gravity  Probe  B  gyroscopes 
were  made  out  of  quartz,  but  if  they  had  been  made  out  of  some¬ 
thing  else,  it  wouldn’t  have  mattered.  But  if  we  had  access  to  some 
FloatyStuff,  the  geometrical  picture  of  gravity  would  fail,  because 
the  “m”  that  described  its  susceptibility  to  gravity  would  be  a  dif¬ 
ferent  “m”  than  the  one  describing  its  inertia. 

The  question  of  the  existence  or  nonexistence  of  such  forms  of 
matter  turns  out  to  be  related  to  the  question  of  what  kinds  of 
motion  are  relative.  Let’s  say  that  alien  gangsters  land  in  a  flying 
saucer,  kidnap  you  out  of  your  back  yard,  konk  you  on  the  head, 
and  take  you  away.  When  you  regain  consciousness,  you’re  locked 
up  in  a  sealed  cabin  in  their  spaceship.  You  pull  your  keychain  out 
of  your  pocket  and  release  it,  and  you  observe  that  it  accelerates 
toward  the  floor  with  an  acceleration  that  seems  quite  a  bit  slower 
than  what  you’re  used  to  on  earth,  perhaps  a  third  of  a  gee.  There 
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are  two  possible  explanations  for  this.  One  is  that  the  aliens  have 
taken  you  to  some  other  planet,  maybe  Mars,  where  the  strength  of 
gravity  is  a  third  of  what  we  have  on  earth.  The  other  is  that  your 
keychain  didn’t  really  accelerate  at  all:  you’re  still  inside  the  flying 
saucer,  which  is  accelerating  at  a  third  of  a  gee,  so  that  it  was  really 
the  deck  that  accelerated  up  and  hit  the  keys. 

There  is  absolutely  no  way  to  tell  which  of  these  two  scenarios  is 
actually  the  case  —  unless  you  happen  to  have  a  chunk  of  FloatyStuff 
in  your  other  pocket.  If  you  release  the  FloatyStuff  and  it  hovers 
above  the  deck,  then  you’re  on  another  planet  and  experiencing 
genuine  gravity;  your  keychain  responded  to  the  gravity,  but  the 
FloatyStuff  didn’t.  But  if  you  release  the  FloatyStuff  and  see  it  hit 
the  deck,  then  the  flying  saucer  is  accelerating  through  outer  space. 

The  nonexistence  of  FloatyStuff  in  our  universe  is  called  the 
equivalence  principle.  If  the  equivalence  principle  holds,  then  an  ac¬ 
celeration  (such  as  the  acceleration  of  the  flying  saucer)  is  always 
equivalent  to  a  gravitational  field,  and  no  observation  can  ever  tell 
the  difference  without  reference  to  something  external.  (And  sup¬ 
pose  you  did  have  some  external  reference  point  —  how  would  you 
know  whether  it  was  accelerating?) 

The  artificial  horizon  example  26 

The  pilot  of  an  airplane  cannot  always  easily  tell  which  way  is  up. 
The  horizon  may  not  be  level  simply  because  the  ground  has  an 
actual  slope,  and  in  any  case  the  horizon  may  not  be  visible  if  the 
weather  is  foggy.  One  might  imagine  that  the  problem  could  be 
solved  simply  by  hanging  a  pendulum  and  observing  which  way 
it  pointed,  but  by  the  equivalence  principle  the  pendulum  cannot 
tell  the  difference  between  a  gravitational  field  and  an  acceler¬ 
ation  of  the  aircraft  relative  to  the  ground  —  nor  can  any  other 
accelerometer,  such  as  the  pilot’s  inner  ear.  For  example,  when 
the  plane  is  turning  to  the  right,  accelerometers  will  be  tricked  into 
believing  that  “down”  is  down  and  to  the  left.  To  get  around  this 
problem,  airplanes  use  a  device  called  an  artificial  horizon,  which 
is  essentially  a  gyroscope.  The  gyroscope  has  to  be  initialized 
when  the  plane  is  known  to  be  oriented  in  a  horizontal  plane.  No 
gyroscope  is  perfect,  so  over  time  it  will  drift.  For  this  reason  the 
instrument  also  contains  an  accelerometer,  and  the  gyroscope  is 
always  forced  into  agreement  with  the  accelerometer’s  average 
output  over  the  preceding  several  minutes.  If  the  plane  is  flown  in 
circles  for  several  minutes,  the  artificial  horizon  will  be  fooled  into 
indicating  that  the  wrong  direction  is  vertical. 

Gravitational  Doppler  shifts  and  time  dilation 

An  interesting  application  of  the  equivalence  principle  is  the  ex¬ 
planation  of  gravitational  time  dilation.  As  described  on  p.  388, 
experiments  show  that  a  clock  at  the  top  of  a  mountain  runs  faster 


g  /  An  artificial  horizon. 
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h/1.  A  ray  of  light  is  emit¬ 
ted  upward  from  the  floor  of  the 
elevator.  The  elevator  acceler¬ 
ates  upward.  2.  By  the  time  the 
light  is  detected  at  the  ceiling,  the 
elevator  has  changed  its  velocity, 
so  the  light  is  detected  with  a 
Doppler  shift. 


i  /  Pound  and  Rebka  at  the 
top  and  bottom  of  the  tower. 


than  one  down  at  its  foot. 

To  calculate  this  effect,  we  make  use  of  the  fact  that  the  grav¬ 
itational  field  in  the  area  around  the  mountain  is  equivalent  to  an 
acceleration.  Suppose  we’re  in  an  elevator  accelerating  upward  with 
acceleration  a,  and  we  shoot  a  ray  of  light  from  the  floor  up  toward 
the  ceiling,  at  height  h.  The  time  At  it  takes  the  light  ray  to  get 
to  the  ceiling  is  about  h/c,  and  by  the  time  the  light  ray  reaches 
the  ceiling,  the  elevator  has  sped  up  by  v  =  aAt  =  ah/c ,  so  we’ll 
see  a  red-shift  in  the  ray’s  frequency.  Since  v  is  small  compared  to 
c,  we  don’t  need  to  use  the  fancy  Doppler  shift  equation  from  sub¬ 
section  7.2.8;  we  can  just  approximate  the  Doppler  shift  factor  as 
1  —  v/cm  1  —  ah/c2.  By  the  equivalence  principle,  we  should  expect 
that  if  a  ray  of  light  starts  out  low  down  and  then  rises  up  through 
a  gravitational  held  g,  its  frequency  will  be  Doppler  shifted  by  a  fac¬ 
tor  of  1  —  gh/c2 .  This  effect  was  observed  in  a  famous  experiment 
carried  out  by  Pound  and  Rebka  in  1959.  Gamma-rays  were  emit¬ 
ted  at  the  bottom  of  a  22.5-meter  tower  at  Harvard  and  detected  at 
the  top  with  the  Doppler  shift  predicted  by  general  relativity.  (See 
problem  25.) 

In  the  mountain-valley  experiment,  the  frequency  of  the  clock 
in  the  valley  therefore  appears  to  be  running  too  slowly  by  a  factor 
of  1  —  gh/c 2  when  it  is  compared  via  radio  with  the  clock  at  the 
top  of  the  mountain.  We  conclude  that  time  runs  more  slowly  when 
one  is  lower  down  in  a  gravitational  held,  and  the  slow-down  factor 
between  two  points  is  given  by  1  —  gh/c2,  where  h  is  the  difference 
in  height. 

We  have  built  up  a  picture  of  light  rays  interacting  with  grav¬ 
ity.  To  confirm  that  this  make  sense,  recall  that  we  have  already 
observed  in  subsection  7.3.3  and  in  problem  11  on  p.  446  that  light 
has  momentum.  The  equivalence  principle  says  that  whatever  has 
inertia  must  also  participate  in  gravitational  interactions.  Therefore 
light  waves  must  have  weight,  and  must  lose  energy  when  they  rise 
through  a  gravitational  field. 

Local  flatness 

The  noneuclidean  nature  of  spacetime  produces  effects  that  grow 
in  proportion  to  the  area  of  the  region  being  considered.  Interpret¬ 
ing  such  effects  as  evidence  of  curvature,  we  see  that  this  connects 
naturally  to  the  idea  that  curvature  is  undetectable  from  close  up. 
For  example,  the  curvature  of  the  earth’s  surface  is  not  normally 
noticeable  to  us  in  everyday  life.  Locally,  the  earth’s  surface  is  flat, 
and  the  same  is  true  for  spacetime. 

Local  flatness  turns  out  to  be  another  way  of  stating  the  equiv¬ 
alence  principle.  In  a  variation  on  the  alien-abduction  story,  sup¬ 
pose  that  you  regain  consciousness  aboard  the  flying  saucer  and 
find  yourself  weightless.  If  the  equivalence  principle  holds,  then 


434 


Chapter  7  Relativity 


you  have  no  way  of  determining  from  local  observations,  inside  the 
saucer,  whether  you  are  actually  weightless  in  deep  space,  or  simply 
free-falling  in  apparent  weightlessness,  like  the  astronauts  aboard 
the  International  Space  Station.  That  means  that  locally,  we  can 
always  adopt  a  free-falling  frame  of  reference  in  which  there  is  no 
gravitational  field  at  all.  If  there  is  no  gravity,  then  special  relativ¬ 
ity  is  valid,  and  we  can  treat  our  local  region  of  spacetime  as  being 
approximately  flat. 

In  figure  k,  an  apple  falls  out  of  a  tree.  Its  path  is  a  “straight” 
line  in  spacetime,  in  the  same  sense  that  the  equator  is  a  “straight” 
line  on  the  earth’s  surface. 

Inertial  frames 

In  Newtonian  mechanics,  we  have  a  distinction  between  inertial 
and  noninertial  frames  of  reference.  An  inertial  frame  according  to 
Newton  is  one  that  has  a  constant  velocity  vector  relative  to  the 
stars.  But  what  if  the  stars  themselves  are  accelerating  due  to  a 
gravitational  force  from  the  rest  of  the  galaxy?  We  could  then  take 
the  galaxy’s  center  of  mass  as  defining  an  inertial  frame,  but  what 
if  something  else  is  acting  on  the  galaxy? 


j  /  The  earth  is  flat  —  locally. 


k/ Spacetime  is  locally  flat. 


I /Wouldn’t  it  be  nice  if  we  could  define  the  meaning  of  a  Newto¬ 
nian  inertial  frame  of  reference?  Newton  makes  it  sound  easy:  to  define 
an  inertial  frame,  just  find  some  object  that  is  not  accelerating  because 
it  is  not  being  acted  on  by  any  external  forces.  But  what  object  would 
we  use?  The  earth?  The  “fixed  stars?”  Our  galaxy?  Our  supercluster  of 
galaxies?  All  of  these  are  accelerating  —  relative  to  something. 


If  we  had  some  FloatyStuff,  we  could  resolve  the  whole  question. 
FloatyStuff  isn’t  affected  by  gravity,  so  if  we  release  a  sample  of  it 
in  mid-air,  it  will  continue  on  a  trajectory  that  defines  a  perfect 
Newtonian  inertial  frame.  (We’d  better  have  it  on  a  tether,  because 
otherwise  the  earth’s  rotation  will  carry  the  earth  out  from  under 
it.)  But  if  the  equivalence  principle  holds,  then  Newton’s  definition 
of  an  inertial  frame  is  fundamentally  flawed. 

There  is  a  different  definition  of  an  inertial  frame  that  works 
better  in  relativity.  A  Newtonian  inertial  frame  was  defined  by  an 
object  that  isn’t  subject  to  any  forces,  gravitational  or  otherwise. 
In  general  relativity,  we  instead  define  an  inertial  frame  using  an 
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m  /  Matter  is  lifted  out  of  a 
Newtonian  black  hole  with  a 
bucket.  The  dashed  line  rep¬ 
resents  the  point  at  which  the 
escape  velocity  equals  the  speed 
of  light.  For  a  real,  relativistic 
black  hole,  this  is  impossible. 


object  that  that  isn’t  influenced  by  anything  other  than  gravity.  By 
this  definition,  a  free-falling  rock  defines  an  inertial  frame,  but  this 
book  sitting  on  your  desk  does  not. 

7.4.3  Black  holes 

The  observations  described  so  far  showed  only  small  effects  from 
curvature.  To  get  a  big  effect,  we  should  look  at  regions  of  space  in 
which  there  are  strong  gravitational  fields.  The  prime  example  is  a 
black  hole.  The  best  studied  examples  are  two  objects  in  our  own 
galaxy:  Cygnus  X-l,  which  is  believed  to  be  a  black  hole  with  about 
ten  times  the  mass  of  our  sun,  and  Sagittarius  A*,  an  object  near 
the  center  of  our  galaxy  with  about  four  million  solar  masses. 

Although  a  black  hole  is  a  relativistic  object,  we  can  gain  some 
insight  into  how  it  works  by  applying  Newtonian  physics.  A  spher¬ 
ical  body  of  mass  M  has  an  escape  velocity  v  =  \j2GM/r,  which 
is  the  minimum  velocity  that  we  would  need  to  give  to  a  projectile 
shot  from  a  distance  r  so  that  it  would  never  fall  back  down.  If  r 
is  small  enough,  the  escape  velocity  will  be  greater  than  c,  so  that 
even  a  ray  of  light  can  never  escape. 

We  can  now  make  an  educated  guess  as  to  what  this  means 
without  having  to  study  all  the  mathematics  of  general  relativity. 
In  relativity,  c  isn’t  really  the  speed  of  light,  it’s  really  to  be  thought 
of  as  a  restriction  on  how  fast  cause  and  effect  can  propagate  through 
space.  This  suggests  the  correct  interpretation,  which  is  that  for  an 
object  compact  enough  to  be  a  black  hole,  there  is  no  way  for  an 
event  at  a  distance  closer  than  r  to  have  an  effect  on  an  event  far 
away.  There  is  an  invisible,  spherical  boundary  with  radius  r,  called 
the  event  horizon,  and  the  region  within  that  boundary  is  cut  off 
from  the  rest  of  the  universe  in  terms  of  cause  and  effect.  If  you 
wanted  to  explore  that  region,  you  could  drop  into  it  while  wearing 
a  space-suit  —  but  it  would  be  a  one-way  trip,  because  you  could 
never  get  back  out  to  report  on  what  you  had  seen. 

In  the  Newtonian  description  of  a  black  hole,  matter  could  be 
lifted  out  of  a  black  hole,  nr.  Would  this  be  possible  with  a  real- 
world  black  hole,  which  is  relativistic  rather  than  Newtonian?  No, 
because  the  bucket  is  causally  separated  from  the  outside  universe. 
No  rope  would  be  strong  enough  for  this  job  (problem  12,  p.  446). 


One  misleading  aspect  of  the  Newtonian  analysis  is  that  it  en¬ 
courages  us  to  imagine  that  a  light  ray  trying  to  escape  from  a  black 
hole  will  slow  down,  stop,  and  then  fall  back  in.  This  can’t  be  right, 
because  we  know  that  any  observer  who  sees  a  light  ray  flying  by 
always  measures  its  speed  to  be  c.  This  was  true  in  special  relativity, 
and  by  the  equivalence  principle  we  can  be  assured  that  the  same  is 
true  locally  in  general  relativity.  Figure  n  shows  what  would  really 
happen. 
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Although  the  light  rays  in  figure  n  don’t  speed  up  or  slow  down, 
they  do  experience  gravitational  Doppler  shifts.  If  a  light  ray  is 
emitted  from  just  above  the  event  horizon,  then  it  will  escape  to  an 
infinite  distance,  but  it  will  suffer  an  extreme  Doppler  shift  toward 
low  frequencies.  A  distant  observer  also  has  the  option  of  inter¬ 
preting  this  as  a  gravitational  time  dilation  that  greatly  lowers  the 
frequency  of  the  oscillating  electric  charges  that  produced  the  ray. 
If  the  point  of  emission  is  made  closer  and  closer  to  the  horizon, 
the  frequency  and  energy  measured  by  a  distant  observer  approach 
zero,  making  the  ray  impossible  to  observe. 

Information  paradox 

Black  holes  have  some  disturbing  implications  for  the  kind  of 
universe  that  in  the  Age  of  the  Enlightenment  was  imagined  to  have 
been  set  in  motion  initially  and  then  left  to  run  forever  like  clock¬ 
work. 

Newton’s  laws  have  built  into  them  the  implicit  assumption  that 
omniscience  is  possible,  at  least  in  principle.  For  example,  Newton’s 
definition  of  an  inertial  frame  of  reference  leads  to  an  infinite  regress, 
as  described  on  p.  435.  For  Newton  this  isn’t  a  problem,  because 
in  principle  an  omnisicient  observer  can  know  the  location  of  ev¬ 
ery  mass  in  the  universe.  In  this  conception  of  the  cosmos,  there 
are  no  theoretical  limits  on  human  knowledge,  only  practical  ones; 
if  we  could  gather  sufficiently  precise  data  about  the  state  of  the 
universe  at  one  time,  and  if  we  could  carry  out  all  the  calculations 
to  extrapolate  into  the  future,  then  we  could  know  everything  that 
would  ever  happen.  (See  the  famous  quote  by  Laplace  on  p.  16.) 

But  the  existence  of  event  horizons  surrounding  black  holes  makes 


n  /  The  equivalence  principle  tells 
us  that  spacetime  locally  has  the 
same  structure  as  in  special  rel¬ 
ativity,  so  we  can  draw  the  famil¬ 
iar  parallelogram  of  x  -  f  coordi¬ 
nates  at  each  point  near  the  black 
hole.  Superimposed  on  each  lit¬ 
tle  grid  is  a  pair  of  lines  repre¬ 
senting  motion  at  the  speed  of 
light  in  both  directions,  inward  and 
outward.  Because  spacetime  is 
curved,  these  lines  do  not  ap¬ 
pear  to  be  at  45-degree  angles, 
but  to  an  observer  in  that  region, 
they  would  appear  to  be.  When 
light  rays  are  emitted  inward  and 
outward  from  a  point  outside  the 
event  horizon,  one  escapes  and 
one  plunges  into  the  black  hole. 
On  this  diagram,  they  look  like 
they  are  decelerating  and  accel¬ 
erating,  but  local  observers  com¬ 
paring  them  to  their  own  coordi¬ 
nate  grids  would  always  see  them 
as  moving  at  exactly  c.  When 
rays  are  emitted  from  a  point  in¬ 
side  the  event  horizon,  neither  es¬ 
capes;  the  distortion  is  so  severe 
that  “outward”  is  really  inward. 
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o  /  In  Newtonian  contexts, 
physicists  and  astronomers  had 
a  correct  intuition  that  it’s  hard 
for  things  to  collapse  gravita¬ 
tionally.  This  star  cluster  has 
been  around  for  about  15  billion 
years,  but  it  hasn’t  collapsed  into 
a  black  hole.  If  any  individual 
star  happens  to  head  toward  the 
center,  conservation  of  angular 
momentum  tends  to  cause  it  to 
swing  past  and  fly  back  out.  The 
Penrose  singularity  theorem  tells 
us  that  this  Newtonian  intuition  is 
wrong  when  applied  to  an  object 
that  has  collapsed  past  a  certain 
point. 


it  impossible  for  any  observer  to  be  omniscient;  only  an  observer  in¬ 
side  a  particular  horizon  can  see  what’s  going  on  inside  that  horizon. 

Furthermore,  a  black  hole  has  at  its  center  an  infinitely  dense 
point,  called  a  singularity,  containing  all  its  mass,  and  this  implies 
that  information  can  be  destroyed  and  made  inaccessible  to  any 
observer  at  all.  For  example,  suppose  that  astronaut  Alice  goes  on 
a  suicide  mission  to  explore  a  black  hole,  free-falling  in  through  the 
event  horizon.  She  has  a  certain  amount  of  time  to  collect  data  and 
satisfy  her  intellectual  curiosity,  but  then  she  impacts  the  singularity 
and  is  compacted  into  a  mathematical  point.  Now  astronaut  Betty 
decides  that  she  will  never  be  satisfied  unless  the  secrets  revealed 
to  Alice  are  known  to  her  as  well  —  and  besides,  she  was  Alice’s 
best  friend,  and  she  wants  to  know  whether  Alice  had  any  last  words. 
Betty  can  jump  through  the  horizon,  but  she  can  never  know  Alice’s 
last  words,  nor  can  any  other  observer  who  jumps  in  after  Alice  does. 

This  destruction  of  information  is  known  as  the  black  hole  infor¬ 
mation  paradox,  and  it’s  referred  to  as  a  paradox  because  quantum 
physics  (ch.  13)  has  built  into  its  DNA  the  requirement  that  infor¬ 
mation  is  never  lost  in  this  sense. 

Formation 

Around  1960,  as  black  holes  and  their  strange  properties  began 
to  be  better  understood  and  more  widely  discussed,  many  physi¬ 
cists  who  found  these  issues  distressing  comforted  themselves  with 
the  belief  that  black  holes  would  never  really  form  from  realistic 
initial  conditions,  such  as  the  collapse  of  a  massive  star.  Their  skep¬ 
ticism  was  not  entirely  unreasonable,  since  it  is  usually  very  hard 
in  astronomy  to  hit  a  gravitating  target,  the  reason  being  that  con¬ 
servation  of  angular  momentum  tends  to  make  the  projectile  swing 
past.  (See  problem  13  on  p.  289  for  a  quantitative  analysis.)  For 
example,  if  we  wanted  to  drop  a  space  probe  into  the  sun,  we  would 
have  to  extremely  precisely  stop  its  sideways  orbital  motion  so  that 
it  would  drop  almost  exactly  straight  in.  Once  a  star  started  to  col¬ 
lapse,  the  theory  went,  and  became  relatively  compact,  it  would  be 
such  a  small  target  that  further  infalling  material  would  be  unlikely 
to  hit  it,  and  the  process  of  collapse  would  halt.  According  to  this 
point  of  view,  theorists  who  had  calculated  the  collapse  of  a  star 
into  a  black  hole  had  been  oversimplifying  by  assuming  a  star  that 
was  initially  perfectly  spherical  and  nonrotating.  Remove  the  un¬ 
realistically  perfect  symmetry  of  the  initial  conditions,  and  a  black 
hole  would  never  actually  form. 

But  Roger  Penrose  proved  in  1964  that  this  was  wrong.  In  fact, 
once  an  object  collapses  to  a  certain  density,  the  Penrose  singularity 
theorem  guarantees  mathematically  that  it  will  collapse  further  until 
a  singularity  is  formed,  and  this  singularity  is  surrounded  by  an 
event  horizon.  Since  the  brightness  of  an  object  like  Sagittarius  A* 
is  far  too  low  to  be  explained  unless  it  has  an  event  horizon  (the 
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interstellar  gas  flowing  into  it  would  glow  due  to  frictional  heating), 
we  can  be  certain  that  there  really  is  a  singularity  at  its  core. 

7.4.4  Cosmology 

The  Big  Bang 

Subsection  6.1.5  presented  the  evidence,  discovered  by  Hubble, 
that  the  universe  is  expanding  in  the  aftermath  of  the  Big  Bang: 
when  we  observe  the  light  from  distant  galaxies,  it  is  always  Doppler- 
shifted  toward  the  red  end  of  the  spectrum,  indicating  that  no  mat¬ 
ter  what  direction  we  look  in  the  sky,  everything  is  rushing  away 
from  us.  This  seems  to  go  against  the  modern  attitude,  originated 
by  Copernicus,  that  we  and  our  planet  do  not  occupy  a  special  place 
in  the  universe.  Why  is  everything  rushing  away  from  our  planet  in 
particular?  But  general  relativity  shows  that  this  anti-Copernican 
conclusion  is  wrong.  General  relativity  describes  space  not  as  a 
rigidly  defined  background  but  as  something  that  can  curve  and 
stretch,  like  a  sheet  of  rubber.  We  imagine  all  the  galaxies  as  exist¬ 
ing  on  the  surface  of  such  a  sheet,  which  then  expands  uniformly. 
The  space  between  the  galaxies  (but  not  the  galaxies  themselves) 
grows  at  a  steady  rate,  so  that  any  observer,  inhabiting  any  galaxy, 
will  see  every  other  galaxy  as  receding.  There  is  therefore  no  privi¬ 
leged  or  special  location  in  the  universe. 

We  might  think  that  there  would  be  another  kind  of  special 
place,  which  would  be  the  one  at  which  the  Big  Bang  happened. 
Maybe  someone  has  put  a  brass  plaque  there?  But  general  rela¬ 
tivity  doesn’t  describe  the  Big  Bang  as  an  explosion  that  suddenly 
occurred  in  a  preexisting  background  of  time  and  space.  According 
to  general  relativity,  space  itself  came  into  existence  at  the  Big  Bang, 
and  the  hot,  dense  matter  of  the  early  universe  was  uniformly  dis¬ 
tributed  everywhere.  The  Big  Bang  happened  everywhere  at  once. 

Observations  show  that  the  universe  is  very  uniform  on  large 
scales,  and  for  ease  of  calculation,  the  first  physical  models  of  the 
expanding  universe  were  constructed  with  perfect  uniformity.  In 
these  models,  the  Big  Bang  was  a  singularity.  This  singularity  can’t 
even  be  included  as  an  event  in  spacetime,  so  that  time  itself  only  ex¬ 
ists  after  the  Big  Bang.  A  Big  Bang  singularity  also  creates  an  even 
more  acute  version  of  the  black  hole  information  paradox.  Whereas 
matter  and  information  disappear  into  a  black  hole  singularity,  stuff 
pops  out  of  a  Big  Bang  singularity,  and  there  is  no  physical  principle 
that  could  predict  what  it  would  be. 

As  with  black  holes,  there  was  considerable  skepticism  about 
whether  the  existence  of  an  initial  singularity  in  these  models  was 
an  arifact  of  the  unrealistically  perfect  uniformity  assumed  in  the 
models.  Perhaps  in  the  real  universe,  extrapolation  of  all  the  paths 
of  the  galaxies  backward  in  time  would  show  them  missing  each  other 
by  millions  of  light-years.  But  in  1972  Stephen  Hawking  proved 
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p/An  expanding  universe 
with  positive  spatial  curvature 
can  be  imagined  as  a  balloon 
being  blown  up.  Every  galaxy’s 
distance  from  every  other  galaxy 
increases,  but  no  galaxy  is  the 
center  of  the  expansion. 


a  variant  on  the  Penrose  singularity  theorem  that  applied  to  Big 
Bang  singularities.  By  the  Hawking  singularity  theorem,  the  level  of 
uniformity  we  see  in  the  present-day  universe  is  more  than  sufficient 
to  prove  that  a  Big  Bang  singularity  must  have  existed. 

The  cosmic  censorship  hypothesis 

It  might  not  be  too  much  of  a  philosophical  jolt  to  imagine  that 
information  was  spontaneously  created  in  the  Big  Bang.  Setting 
up  the  initial  conditions  of  the  entire  universe  is  traditionally  the 
prerogative  of  God,  not  the  laws  of  physics.  But  there  is  nothing 
fundamental  in  general  relativity  that  forbids  the  existence  of  other 
singularities  that  act  like  the  Big  Bang,  being  information  produc¬ 
ers  rather  than  information  consumers.  As  John  Earman  of  the 
University  of  Pittsburgh  puts  it,  anything  could  pop  out  of  such 
a  singularity,  including  green  slime  or  your  lost  socks.  This  would 
eliminate  any  hope  of  finding  a  universal  set  of  laws  of  physics  that 
would  be  able  to  make  a  prediction  given  any  initial  situation. 

That  would  be  such  a  devastating  defeat  for  the  enterprise  of 
physics  that  in  1969  Penrose  proposed  an  alternative,  humorously 
named  the  “cosmic  censorship  hypothesis,”  which  states  that  ev¬ 
ery  singularity  in  our  universe,  other  than  the  Big  Bang,  is  hidden 
behind  an  event  horizon.  Therefore  if  green  slime  spontaneously 
pops  out  of  one,  there  is  limited  impact  on  the  predictive  ability 
of  physics,  since  the  slime  can  never  have  any  causal  effect  on  the 
outside  world.  A  singularity  that  is  not  modestly  cloaked  behind  an 
event  horizon  is  referred  to  as  a  naked  singularity.  Nobody  has  yet 
been  able  to  prove  the  cosmic  censorship  hypothesis. 

The  advent  of  high-precision  cosmology 

We  expect  that  if  there  is  matter  in  the  universe,  it  should  have 
gravitational  fields,  and  in  the  rubber-sheet  analogy  this  should  be 
represented  as  a  curvature  of  the  sheet.  Instead  of  a  flat  sheet,  we 
can  have  a  spherical  balloon,  so  that  cosmological  expansion  is  like 
inflating  it  with  more  and  more  air.  It  is  also  possible  to  have  nega¬ 
tive  curvature,  as  in  figure  e  on  p.  431.  All  three  of  these  are  valid, 
possible  cosmologies  according  to  relativity.  The  positive-curvature 
type  happens  if  the  average  density  of  matter  in  the  universe  is  above 
a  certain  critical  level,  the  negative-curvature  one  if  the  density  is 
below  that  value. 

To  find  out  which  type  of  universe  we  inhabit,  we  could  try 
to  take  a  survey  of  the  matter  in  the  universe  and  determine  its 
average  density.  Historically,  it  has  been  very  difficult  to  do  this, 
even  to  within  an  order  of  magnitude.  Most  of  the  matter  in  the 
universe  probably  doesn’t  emit  light,  making  it  difficult  to  detect. 
Astronomical  distance  scales  are  also  very  poorly  calibrated  against 
absolute  units  such  as  the  SI. 

Instead,  we  measure  the  universe’s  curvature,  and  infer  the  den- 
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sity  of  matter  from  that.  It  turns  out  that  we  can  do  this  by  ob¬ 
serving  the  cosmic  microwave  background  (CMB)  radiation,  which 
is  the  light  left  over  from  the  brightly  glowing  early  universe,  which 
was  dense  and  hot.  As  the  universe  has  expanded,  light  waves  that 
were  in  flight  have  expanded  their  wavelengths  along  with  it.  This 
afterglow  of  the  big  bang  was  originally  visible  light,  but  after  bil¬ 
lions  of  years  of  expansion  it  has  shifted  into  the  microwave  radio 
part  of  the  electromagnetic  spectrum.  The  CMB  is  not  perfectly 
uniform,  and  this  turns  out  to  give  us  a  way  to  measure  the  uni¬ 
verse’s  curvature.  Since  the  CMB  was  emitted  when  the  universe 
was  only  about  400,000  years  old,  any  vibrations  or  disturbances  in 
the  hot  hydrogen  and  helium  gas  that  filled  space  in  that  era  would 
only  have  had  time  to  travel  a  certain  distance,  limited  by  the  speed 
of  sound.  We  therefore  expect  that  no  feature  in  the  CMB  should 
be  bigger  than  a  certain  known  size.  In  a  universe  with  negative 
spatial  curvature,  the  sum  of  the  interior  angles  of  a  triangle  is  less 
than  the  Euclidean  value  of  180  degrees.  Therefore  if  we  observe 
a  variation  in  the  CMB  over  some  angle,  the  distance  between  two 
points  on  the  sky  is  actually  greater  than  would  have  been  inferred 
from  Euclidean  geometry.  The  opposite  happens  if  the  curvature  is 
positive. 

This  observation  was  done  by  the  1989-1993  COBE  probe,  and 
its  2001-2009  successor,  the  Wilkinson  Microwave  Anisotropy  Probe. 
The  result  is  that  the  angular  sizes  are  almost  exactly  equal  to  what 
they  should  be  according  to  Euclidean  geometry.  We  therefore  infer 
that  the  universe  is  very  close  to  having  zero  average  spatial  cur¬ 
vature  on  the  cosmological  scale,  and  this  tells  us  that  its  average 
density  must  be  within  about  0.5%  of  the  critical  value.  The  years 
since  COBE  and  WMAP  mark  the  advent  of  an  era  in  which  cos¬ 
mology  has  gone  from  being  a  field  of  estimates  and  rough  guesses 
to  a  high-precision  science. 

If  one  is  inclined  to  be  skeptical  about  the  seemingly  precise  an¬ 
swers  to  the  mysteries  of  the  cosmos,  there  are  consistency  checks 
that  can  be  carried  out.  In  the  bad  old  days  of  low-precision  cos¬ 
mology,  estimates  of  the  age  of  the  universe  ranged  from  10  billion 
to  20  billion  years,  and  the  low  end  was  inconsistent  with  the  age 
of  the  oldest  star  clusters.  This  was  believed  to  be  a  problem  either 
for  observational  cosmology  or  for  the  astrophysical  models  used  to 
estimate  the  ages  of  the  clusters:  “You  can’t  be  older  than  your 
ma.”  Current  data  have  shown  that  the  low  estimates  of  the  age 
were  incorrect,  so  consistency  is  restored.  (The  best  figure  for  the 
age  of  the  universe  is  currently  13.8  ±  0.1  billion  years.) 

Dark  energy  and  dark  matter 

Not  everything  works  out  so  smoothly,  however.  One  surpriseis 
that  the  universe’s  expansion  is  not  currently  slowing  down,  as  had 
been  expected  due  to  the  gravitational  attraction  of  all  the  matter 


q/The  angular  scale  of  fluc¬ 
tuations  in  the  cosmic  microwave 
background  can  be  used  to  infer 
the  curvature  of  the  universe. 
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in  it.  Instead,  it  is  currently  speeding  up.  This  is  attributed  to  a 
variable  in  Einstein’s  equations,  long  assumed  to  be  zero,  which  rep¬ 
resents  a  universal  gravitational  repulsion  of  space  itself,  occurring 
even  when  there  is  no  matter  present.  The  current  name  for  this 
is  “dark  energy,”  although  the  fancy  name  is  just  a  label  for  our 
ignorance  about  what  causes  it. 

Another  surprise  comes  from  attempts  to  model  the  formation 
of  the  elements  during  the  era  shortly  after  the  Big  Bang,  before  the 
formation  of  the  first  stars.  The  observed  relative  abundances  of  hy¬ 
drogen,  helium,  and  deuterium  (2H)  cannot  be  reconciled  with  the 
density  of  low-velocity  matter  inferred  from  the  observational  data. 
If  the  inferred  mass  density  were  entirely  due  to  normal  matter  (i.e. , 
matter  whose  mass  consisted  mostly  of  protons  and  neutrons),  then 
nuclear  reactions  in  the  dense  early  universe  should  have  proceeded 
relatively  efficiently,  leading  to  a  much  higher  ratio  of  helium  to 
hydrogen,  and  a  much  lower  abundance  of  deuterium.  The  conclu¬ 
sion  is  that  most  of  the  matter  in  the  universe  must  be  made  of  an 
unknown  type  of  exotic  matter,  known  as  “dark  matter.”  We  are 
in  the  ironic  position  of  knowing  that  precisely  96%  of  the  universe 
is  something  other  than  atoms,  but  knowing  nothing  about  what 
that  something  is.  As  of  2013,  there  have  been  several  experiments 
that  have  been  carried  out  to  attempt  the  direct  detection  of  dark 
matter  particles.  These  are  carried  out  at  the  bottom  of  mineshafts 
to  eliminate  background  radiation.  Early  claims  of  success  appear 
to  have  been  statistical  flukes,  and  the  most  sensitive  experiments 
have  not  detected  anything.8 


sarxiv . org/abs/ 1310 . 8214 
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Problems 


The  symbols  U,  - ,  etc.  are  explained  on  page  449. 

1  The  figure  illustrates  a  Lorentz  transformation  using  the 

conventions  employed  in  section  7.2.  For  simplicity,  the  transfor¬ 
mation  chosen  is  one  that  lengthens  one  diagonal  by  a  factor  of  2. 
Since  Lorentz  transformations  preserve  area,  the  other  diagonal  is 
shortened  by  a  factor  of  2.  Let  the  original  frame  of  reference,  de¬ 
picted  with  the  square,  be  A,  and  the  new  one  B.  (a)  By  measuring 
with  a  ruler  on  the  figure,  show  that  the  velocity  of  frame  B  rela¬ 
tive  to  frame  A  is  0.6c.  (b)  Print  out  a  copy  of  the  page.  With  a 
ruler,  draw  a  third  parallelogram  that  represents  a  second  succes¬ 
sive  Lorentz  transformation,  one  that  lengthens  the  long  diagonal 
by  another  factor  of  2.  Call  this  third  frame  C.  Use  measurements 
with  a  ruler  to  determine  frame  C’s  velocity  relative  to  frame  A. 
Does  it  equal  double  the  velocity  found  in  part  a?  Explain  why  it 
should  be  expected  to  turn  out  the  way  it  does.  V  ^ 
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Problem  7. 


2  Astronauts  in  three  different  spaceships  are  communicating 
with  each  other.  Those  aboard  ships  A  and  B  agree  on  the  rate  at 
which  time  is  passing,  but  they  disagree  with  the  ones  on  ship  C. 

(a)  Alice  is  aboard  ship  A.  How  does  she  describe  the  motion  of  her 
own  ship,  in  its  frame  of  reference? 

(b)  Describe  the  motion  of  the  other  two  ships  according  to  Alice. 

(c)  Give  the  description  according  to  Betty,  whose  frame  of  reference 
is  ship  B. 

(d)  Do  the  same  for  Cathy,  aboard  ship  C. 

3  What  happens  in  the  equation  for  7  when  you  put  in  a  negative 

number  for  vl  Explain  what  this  means  physically,  and  why  it  makes 
sense.  ■ 

4  The  Voyager  1  space  probe,  launched  in  1977,  is  moving  faster 
relative  to  the  earth  than  any  other  human-made  object,  at  17,000 
meters  per  second. 

(a)  Calculate  the  probe’s  7. 

(b)  Over  the  course  of  one  year  on  earth,  slightly  less  than  one  year 

passes  on  the  probe.  How  much  less?  (There  are  31  million  seconds 
in  a  year.)  V  ^ 

5  In  example  2  on  page  395,  I  remarked  that  accelerating  a 

macroscopic  (i.e.,  not  microscopic)  object  to  close  to  the  speed  of 
light  would  require  an  unreasonable  amount  of  energy.  Suppose  that 
the  starship  Enterprise  from  Star  Trek  has  a  mass  of  8.0  x  107  kg, 
about  the  same  as  the  Queen  Elizabeth  2.  Compute  the  kinetic 
energy  it  would  have  to  have  if  it  was  moving  at  half  the  speed  of 
light.  Compare  with  the  total  energy  content  of  the  world’s  nuclear 
arsenals,  which  is  about  1021  J.  V  1 

6  The  earth  is  orbiting  the  sun,  and  therefore  is  contracted 

relativistically  in  the  direction  of  its  motion.  Compute  the  amount 
by  which  its  diameter  shrinks  in  this  direction.  v 

■ 

7  In  this  homework  problem,  you’ll  fill  in  the  steps  of  the  algebra 
required  in  order  to  find  the  equation  for  7  on  page  393.  To  keep  the 
algebra  simple,  let  the  time  t  in  figure  k  equal  1,  as  suggested  in  the 
figure  accompanying  this  homework  problem.  The  original  square 
then  has  an  area  of  1,  and  the  transformed  parallelogram  must  also 
have  an  area  of  1.  (a)  Prove  that  point  P  is  at  x  =  vy,  so  that  its 
(t,  x)  coordinates  are  (7,  vy).  (b)  Find  the  ( t ,  x)  coordinates  of  point 
Q.  (c)  Find  the  length  of  the  short  diagonal  connecting  P  and  Q. 
(d)  Average  the  coordinates  of  P  and  Q  to  find  the  coordinates  of 
the  midpoint  C  of  the  parallelogram,  and  then  find  distance  OC.  (e) 
Find  the  area  of  the  parallelogram  by  computing  twice  the  area  of 
triangle  PQO.  [Hint:  You  can  take  PQ  to  be  the  base  of  the  triangle.] 
(f)  Set  this  area  equal  to  1  and  solve  for  7  to  prove  7  =  l/Vl  —  v2. 

V  ^ 
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8  (a)  A  free  neutron  (as  opposed  to  a  neutron  bound  into  an 

atomic  nucleus)  is  unstable,  and  undergoes  beta  decay  (which  you 
may  want  to  review).  The  masses  of  the  particles  involved  are  as 
follows: 


neutron 

1.67495  x  10-  27  kg 

proton 

1.67265  x  10-  27  kg 

electron 

0.00091  x  10~27  kg 

antineutrino 

<  10~35  kg 

Find  the  energy  released  in  the  decay  of  a  free  neutron.  V 

(b)  Neutrons  and  protons  make  up  essentially  all  of  the  mass  of  the 
ordinary  matter  around  us.  We  observe  that  the  universe  around  us 
has  no  free  neutrons,  but  lots  of  free  protons  (the  nuclei  of  hydrogen, 
which  is  the  element  that  90%  of  the  universe  is  made  of).  We  find 
neutrons  only  inside  nuclei  along  with  other  neutrons  and  protons, 
not  on  their  own. 

If  there  are  processes  that  can  convert  neutrons  into  protons, 
we  might  imagine  that  there  could  also  be  proton-to-neutron  con¬ 
versions,  and  indeed  such  a  process  does  occur  sometimes  in  nuclei 
that  contain  both  neutrons  and  protons:  a  proton  can  decay  into  a 
neutron,  a  positron,  and  a  neutrino.  A  positron  is  a  particle  with 
the  same  properties  as  an  electron,  except  that  its  electrical  charge 
is  positive  (see  chapter  7).  A  neutrino,  like  an  antineutrino,  has 
negligible  mass. 

Although  such  a  process  can  occur  within  a  nucleus,  explain  why 
it  cannot  happen  to  a  free  proton.  (If  it  could,  hydrogen  would  be 
radioactive,  and  you  wouldn’t  exist!) 


9  (a)  Find  a  relativistic  equation  for  the  velocity  of  an  object 

in  terms  of  its  mass  and  momentum  (eliminating  7).  V 

(b)  Show  that  your  result  is  approximately  the  same  as  the  classical 
value,  p/m,  at  low  velocities. 

(c)  Show  that  very  large  momenta  result  in  speeds  close  to  the  speed 

of  light.  ^ 


10  (a)  Show  that  for  v  =  (3/5 )c,  7  comes  out  to  be  a  simple 

fraction. 

(b)  Find  another  value  of  v  for  which  7  is  a  simple  fraction. 
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11  An  object  moving  at  a  speed  very  close  to  the  speed  of  light 
is  referred  to  as  ultrarelativistic.  Ordinarily  (luckily)  the  only  ul- 
trarelativistic  objects  in  our  universe  are  subatomic  particles,  such 
as  cosmic  rays  or  particles  that  have  been  accelerated  in  a  particle 
accelerator. 

(a)  What  kind  of  number  is  7  for  an  ultrarelativistic  particle? 

(b)  Repeat  example  19  on  page  423,  but  instead  of  very  low,  non- 
relativistic  speeds,  consider  ultrarelativistic  speeds. 

(c)  Find  an  equation  for  the  ratio  £/p.  The  speed  may  be  relativis¬ 
tic,  but  don’t  assume  that  it’s  ultrarelativistic.  V 

(d)  Simplify  your  answer  to  part  c  for  the  case  where  the  speed  is 

ultrarelativistic.  V 

(e)  We  can  think  of  a  beam  of  light  as  an  ultrarelativistic  object  — 

it  certainly  moves  at  a  speed  that’s  sufficiently  close  to  the  speed 
of  light!  Suppose  you  turn  on  a  one-watt  flashlight,  leave  it  on  for 
one  second,  and  then  turn  it  off.  Compute  the  momentum  of  the 
recoiling  flashlight,  in  units  of  kg-m/s.  V 

(f)  Discuss  how  part  e  relates  to  the  correspondence  principle. 

12  As  discussed  in  chapter  6,  the  speed  at  which  a  disturbance 
travels  along  a  string  under  tension  is  given  by  v  =  y/Tjp,  where  p, 
is  the  mass  per  unit  length,  and  T  is  the  tension. 

(a)  Suppose  a  string  has  a  density  p,  and  a  cross-sectional  area  A. 

Find  an  expression  for  the  maximum  tension  that  could  possibly 
exist  in  the  string  without  producing  v  >  c,  which  is  impossible 
according  to  relativity.  Express  your  answer  in  terms  of  p,  A ,  and 
c.  The  interpretation  is  that  relativity  puts  a  limit  on  how  strong 
any  material  can  be.  V 

(b)  Every  substance  has  a  tensile  strength,  defined  as  the  force 

per  unit  area  required  to  break  it  by  pulling  it  apart.  The  ten¬ 
sile  strength  is  measured  in  units  of  N/ m2 ,  which  is  the  same  as  the 
pascal  (Pa),  the  inks  unit  of  pressure.  Make  a  numerical  estimate 
of  the  maximum  tensile  strength  allowed  by  relativity  in  the  case 
where  the  rope  is  made  out  of  ordinary  matter,  with  a  density  on 
the  same  order  of  magnitude  as  that  of  water.  (For  comparison, 
kevlar  has  a  tensile  strength  of  about  4  x  109  Pa,  and  there  is  spec¬ 
ulation  that  fibers  made  from  carbon  nanotubes  could  have  values 
as  high  as  6  x  1010  Pa.)  V 

(c)  A  black  hole  is  a  star  that  has  collapsed  and  become  very  dense, 
so  that  its  gravity  is  too  strong  for  anything  ever  to  escape  from  it. 
For  instance,  the  escape  velocity  from  a  black  hole  is  greater  than 
c,  so  a  projectile  can’t  be  shot  out  of  it.  Many  people,  when  they 
hear  this  description  of  a  black  hole  in  terms  of  an  escape  velocity 
greater  than  c,  wonder  why  it  still  wouldn’t  be  possible  to  extract 
an  object  from  a  black  hole  by  other  means  than  launching  it  out 
as  a  projectile.  For  example,  suppose  we  lower  an  astronaut  into  a 
black  hole  on  a  rope,  and  then  pull  him  back  out  again.  Why  might 
this  not  work? 
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13  (a)  A  charged  particle  is  surrounded  by  a  uniform  electric 
field.  Starting  from  rest,  it  is  accelerated  by  the  field  to  speed  v  after 
traveling  a  distance  d.  Now  it  is  allowed  to  continue  for  a  further 
distance  3d,  for  a  total  displacement  from  the  start  of  4 d.  What 
speed  will  it  reach,  assuming  classical  physics? 

(b)  Find  the  relativistic  result  for  the  case  of  v  =  c/2. 

14  Problem  14  has  been  deleted.  - 

15  Expand  the  equation  K  =  m( 7  —  1)  in  a  Taylor  series,  and 

find  the  first  two  nonvanishing  terms.  Show  that  the  first  term  is 
the  nonrelativistic  expression  for  kinetic  energy.  - 


16  Consider  the  relativistic  relation  for  momentum  as  a  function 
of  velocity  (for  a  particle  with  nonzero  mass).  Expand  this  in  a 
Taylor  series,  and  find  the  first  two  nonvanishing  terms.  Show  that 
the  first  term  is  the  classical  expression. 


17  As  promised  in  subsection  7.2.8,  this  problem  will  lead  you 
through  the  steps  of  finding  an  equation  for  the  combination  of 
velocities  in  relativity,  generalizing  the  numerical  result  found  in 
problem  1.  Suppose  that  A  moves  relative  to  B  at  velocity  u,  and 
B  relative  to  C  at  v.  We  want  to  find  A’s  velocity  w  relative  to 
C,  in  terms  of  u  and  v.  Suppose  that  A  emits  light  with  a  certain 
frequency.  This  will  be  observed  by  B  with  a  Doppler  shift  D(u). 
C  detects  a  further  shift  of  D(v)  relative  to  B.  We  therefore  expect 
the  Doppler  shifts  to  multiply,  D(iv)  =  D{u)D{v),  and  this  provides 
an  implicit  rule  for  determining  w  if  u  and  v  are  known,  (a)  Using 
the  expression  for  D  given  in  section  7.2.8,  write  down  an  equation 
relating  u,  v,  and  w.  (b)  Solve  for  w  in  terms  of  u  and  v.  (c)  Show 
that  your  answer  to  part  b  satisfies  the  correspondence  principle. 

>  Solution,  p.  978  !£ 


18  The  figure  shows  seven  four-vectors,  represented  in  a  two- 
dimensional  plot  of  x  versus  t.  All  the  vectors  have  y  and  z  compo¬ 
nents  that  are  zero.  Which  of  these  vectors  are  congruent  to  others, 
i.e.,  which  represent  spacetime  intervals  that  are  equal  to  one  an¬ 
other?  If  you  reason  based  on  Euclidean  geometry,  you  will  get  the 
wrong  answers.  >  Solution,  p.  979  ^ 


19  Four-vectors  can  be  timelike,  lightlike,  or  spacelike.  What  can 
you  say  about  the  inherent  properties  of  particles  whose  momentum 
four-vectors  fall  in  these  various  categories?  - 


Aa 


b  c 


Problem  18. 
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20  The  following  are  the  three  most  common  ways  in  which 
gamma  rays  interact  with  matter: 

Photoelectric  effect:  The  gamma  ray  hits  an  electron,  is  annihi¬ 
lated,  and  gives  all  of  its  energy  to  the  electron. 

Compton  scattering:  The  gamma  ray  bounces  off  of  an  electron, 
exiting  in  some  direction  with  some  amount  of  energy. 

Pair  production:  The  gamma  ray  is  annihilated,  creating  an 
electron  and  a  positron. 

Example  24  on  p.  425  shows  that  pair  production  can’t  occur  in  a 
vacuum  due  to  conservation  of  the  energy- momentum  four- vector. 
What  about  the  other  two  processes?  Can  the  photoelectric  effect 
occur  without  the  presence  of  some  third  particle  such  as  an  atomic 
nucleus?  Can  Compton  scattering  happen  without  a  third  particle? 


0  2  4  6  8 


time  (days) 

Problem  25b.  Redrawn  from 
Van  Baak,  Physics  Today  60 
(2007)  16. 


21  Expand  the  relativistic  equation  for  the  longitudinal  Doppler 
shift  of  light  D(v)  in  a  Taylor  series,  and  find  the  first  two  nonvanish¬ 
ing  terms.  Show  that  these  two  terms  agree  with  the  nonrelativistic 
expression,  so  that  any  relativistic  effect  is  of  higher  order  in  v. 

22  Prove,  as  claimed  in  the  caption  of  figure  a  on  p.  429,  that 

S  —  180°  =  4(s  —  180°),  where  S  is  the  sum  of  the  angles  of  the  large 
equilateral  triangle  and  s  is  the  corresponding  sum  for  one  of  the 
four  small  ones.  >  Solution,  p.  979  - 

23  If  a  two-dimensional  being  lived  on  the  surface  of  a  cone, 
would  it  say  that  its  space  was  curved,  or  not? 

24  (a)  Verify  that  the  equation  1  —  gh/c 2  for  the  gravitational 
Doppler  shift  and  gravitational  time  dilation  has  units  that  make 
sense,  (b)  Does  this  equation  satisfy  the  correspondence  principle? 

25  (a)  Calculate  the  Doppler  shift  to  be  expected  in  the  Pound- 

Rebka  experiment  described  on  p.  434.  (b)  In  the  1978  Iijima 

mountain- valley  experiment  (p.  388),  analysis  was  complicated  by 
the  clock’s  sensitivity  to  pressure,  humidity,  and  temperature.  A 
cleaner  version  of  the  experiment  was  done  in  2005  by  hobbyist 
Tom  Van  Baak.  He  put  his  kids  and  three  of  his  atomic  clocks  in  a 
minivan  and  drove  from  Bellevue,  Washington  to  a  lodge  on  Mount 
Rainier,  1340  meters  higher  in  elevation.  At  home,  he  compared  the 
clocks  to  others  that  had  stayed  at  his  house.  Verify  that  the  effect 
shown  in  the  graph  is  as  predicted  by  general  relativity. 
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26  The  International  Space  Station  orbits  at  an  altitude  of 
about  350  km  and  a  speed  of  about  8000  m/s  relative  to  the  ground. 
Compare  the  gravitational  and  kinematic  time  dilations.  Over  all, 
does  time  run  faster  on  the  ISS  than  on  the  ground,  or  more  slowly? 

I 

27  Section  7.4.3  presented  a  Newtonian  estimate  of  how  compact 
an  object  would  have  to  be  in  order  to  be  a  black  hole.  Although 
this  estimate  is  not  really  right,  it  turns  out  to  give  the  right  answer 
to  within  about  a  factor  of  2.  To  roughly  what  size  would  the  earth 
have  to  be  compressed  in  order  to  become  a  black  hole? 

28  Clock  A  sits  on  a  desk.  Clock  B  is  tossed  up  in  the  air  from 

the  same  height  as  the  desk  and  then  comes  back  down.  Compare 
the  elapsed  times.  >  Hint,  p.  960  >  Solution,  p.  979  - 

29  The  angular  defect  d  of  a  triangle  (measured  in  radians) 

is  defined  as  s  —  7r,  where  s  is  the  sum  of  the  interior  angles.  The 
angular  defect  is  proportional  to  the  area  A  of  the  triangle.  Consider 
the  geometry  measured  by  a  two-dimensional  being  who  lives  on  the 
surface  of  a  sphere  of  radius  R.  First  find  some  triangle  on  the  sphere 
whose  area  and  angular  defect  are  easy  to  calculate.  Then  determine 
the  general  equation  for  d  in  terms  of  A  and  R.  V  ^ 

30  (a)  In  this  chapter  we’ve  represented  Lorentz  transforma¬ 
tions  as  distortions  of  a  square  into  various  parallelograms,  with 
the  degree  of  distortion  depending  on  the  velocity  of  one  frame  of 
reference  relative  to  another.  Suppose  that  one  frame  of  reference 
was  moving  at  c  relative  to  another.  Discuss  what  would  happen  in 
terms  of  distortion  of  a  square,  and  show  that  this  is  impossible  by 
using  an  argument  similar  to  the  one  used  to  rule  out  transforma¬ 
tions  like  the  one  in  figure  e  on  page  391. 

(b)  Resolve  the  following  paradox.  Two  pen-pointer  lasers  are  placed 
side  by  side  and  aimed  in  parallel  directions.  Their  beams  both 
travel  at  c  relative  to  the  hardware,  but  each  beam  has  a  velocity  of 
zero  relative  to  the  neighboring  beam.  But  the  speed  of  light  can’t 
be  zero;  it’s  supposed  to  be  the  same  in  all  frames  of  reference. 

31  The  products  of  a  certain  radioactive  decay  are  a  massive 
particle  and  a  gamma  ray,  which  is  massless.  See  example  21  on 
p.  424  for  a  discussion  of  the  energy  and  momentum  of  a  gamma 
ray.  (a)  Show  that,  in  the  center  of  mass  frame,  the  energy  of  the 
gamma  is  less  than  the  mass-energy  of  the  massive  particle. 

(b)  Show  that  the  opposite  inequality  holds  if  we  compare  the  kinetic 
energy  of  the  massive  particle  to  the  energy  of  the  gamma.  [Problem 
by  B.  Shotwell.]  ^ 


Key  to  symbols: 

easy  -  typical  K  challenging  )i  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  7A:  The  Michelson-Morley  Experiment 
B 


A 

laboratory's  x,t  frame  of  reference 


B 


A 

sun's  x',t'  frame  (lab  moving  to  the  right) 


In  this  exercise  you  will  analyze  the  Michelson-Morley  experiment,  and  find  what  the  results 
should  have  been  according  to  Galilean  relativity  and  Einstein’s  theory  of  relativity.  A  beam  of 
light  coming  from  the  west  (not  shown)  comes  to  the  half-silvered  mirror  A.  Half  the  light  goes 
through  to  the  east,  is  reflected  by  mirror  C,  and  comes  back  to  A.  The  other  half  is  reflected 
north  by  A,  is  reflected  by  B,  and  also  comes  back  to  A.  When  the  beams  reunite  at  A,  part 
of  each  ends  up  going  south,  and  these  parts  interfere  with  one  another.  If  the  time  taken 
for  a  round  trip  differs  by,  for  example,  half  the  period  of  the  wave,  there  will  be  destructive 
interference. 

The  point  of  the  experiment  was  to  search  for  a  difference  in  the  experimental  results  between 
the  daytime,  when  the  laboratory  was  moving  west  relative  to  the  sun,  and  the  nighttime,  when 
the  laboratory  was  moving  east  relative  to  the  sun.  Galilean  relativity  and  Einstein’s  theory 
of  relativity  make  different  predictions  about  the  results.  According  to  Galilean  relativity,  the 
speed  of  light  cannot  be  the  same  in  all  reference  frames,  so  it  is  assumed  that  there  is  one  special 
reference  frame,  perhaps  the  sun’s,  in  which  light  travels  at  the  same  speed  in  all  directions; 
in  other  frames,  Galilean  relativity  predicts  that  the  speed  of  light  will  be  different  in  different 
directions,  e.g.,  slower  if  the  observer  is  chasing  a  beam  of  light.  There  are  four  different  ways 
to  analyze  the  experiment: 

•  Laboratory ’s  frame  of  reference,  Galilean  relativity.  This  is  not  a  useful  way  to  analyze 
the  experiment,  since  one  does  not  know  how  fast  light  will  travel  in  various  directions. 

•  Sun’s  frame  of  reference,  Galilean  relativity.  We  assume  that  in  this  special  frame  of 
reference,  the  speed  of  light  is  the  same  in  all  directions:  we  call  this  speed  c.  In  this 
frame,  the  laboratory  moves  with  velocity  v,  and  mirrors  A,  B,  and  C  move  while  the  light 
beam  is  in  flight. 

•  Laboratory’s  frame  of  reference,  Einstein’s  theory  of  relativity.  The  analysis  is  extremely 
simple.  Let  the  length  of  each  arm  be  L.  Then  the  time  required  to  get  from  A  to  either 
mirror  is  L/c ,  so  each  beam’s  round-trip  time  is  2 L/c. 

•  Sun’s  frame  of  reference,  Einstein’s  theory  of  relativity.  We  analyze  this  case  by  starting 
with  the  laboratory’s  frame  of  reference  and  then  transforming  to  the  sun’s  frame. 
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Groups  1-4  work  in  the  sun’s  frame  of  reference  according  to  Galilean  relativity. 

Group  1  finds  time  AC.  Group  2  finds  time  CA.  Group  3  finds  time  AB.  Group  4  finds  time 
BA. 

Groups  5  and  6  transform  the  lab-frame  results  into  the  sun’s  frame  according  to  Einstein’s 
theory. 

Group  5  transforms  the  x  and  t  when  ray  ACA  gets  back  to  A  into  the  sun’s  frame  of  reference, 
and  group  6  does  the  same  for  ray  ABA. 

Discussion: 

Michelson  and  Morley  found  no  change  in  the  interference  of  the  waves  between  day  and  night. 
Which  version  of  relativity  is  consistent  with  their  results? 

What  does  each  theory  predict  if  v  approaches  c? 

What  if  the  arms  are  not  exactly  equal  in  length? 

Does  it  matter  if  the  “special”  frame  is  some  frame  other  than  the  sun’s? 
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Exercise  7B:  Sports  in  Slowlightland 

In  Slowlightland,  the  speed  of  light  is  20  mi/hr  ~  32  km/hr  ~  9  m/s.  Think  of  an  example  of 
how  relativistic  effects  would  work  in  sports.  Things  can  get  very  complex  very  quickly,  so  try 
to  think  of  a  simple  example  that  focuses  on  just  one  of  the  following  effects: 

-  relativistic  momentum 

-  relativistic  kinetic  energy 

-  relativistic  addition  of  velocities 

-  time  dilation  and  length  contraction 

-  Doppler  shifts  of  light 

-  equivalence  of  mass  and  energy 

-  time  it  takes  for  light  to  get  to  an  athlete’s  eye 

-  deflection  of  light  rays  by  gravity 
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Exercise  7C:  Events  and  Spacetime 

Bell's  spaceship  paradox 


A  difficult  philosophical  question  is  whether  the  time  dilation  and  length  contractions 
predicted  by  relativity  are  "real."  This  depends,  of  course,  on  what  one  means  by 
"real."  They  are  frame-dependent,  i.e„  observers  in  different  frames  of  reference 
disagree  about  them.  But  this  doesn't  tell  us  much  about  their  reality,  since  velocities 
are  frame-dependent  in  Newtonian  mechanics,  but  nobody  worries  about  whether 
velocities  are  real.  John  Bell  (1 928-1 990)  proposed  the  following  thought  experiment  to 
physicists  in  the  CERN  cafeteria,  and  found  that  nearly  all  of  them  got  it  wrong.  He  took 
this  as  evidence  that  their  intuitions  had  been  misguided  by  the  standard  way  of 
approaching  this  question  of  the  reality  of  Lorentz  contractions. 

Let  spaceships  A  and  B  accelerate  as  shown  in  the  figure,  along  a  straight  line. 
Observer  C,  whose  frame  of  reference  is  indicated  by  the  square,  does  not  accelerate. 
The  accelerations,  as  judged  by  C,  are  constant,  and  equal  for  the  two  ships.  Each 
ship  is  equipped  with  a  yard-arm,  and  a  thread  is  tied  between  the  two  arms.  Does  the 
thread  break,  due  to  length  contraction? 

Use  the  spacetime  diagram  below  to  find  the  answer. 

What  does  C  say  about  the  distance  between  A  and  B  as  time  goes  by?  You  may 
want  to  use  a  ruler  to  verify  that  the  tracks  in  the  diagram  do  correspond  properly  to  the 
statement  of  the  paradox. 

At  what  time  does  the  parallelogram  correspond  to  A's  frame  of  reference?  B's? 

What  does  an  observer  in  the  parallelogram  frame  say  about  the  distance  between  A 
and  B  as  time  goes  by? 

What  would  the  result  be  if,  instead  of  the  Lorentz  transformation,  we  used  the 
Galilean  version? 
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The  following  paper-and-pencil  exercises  involve  graphs  of  position  versus  time.  These  are  often  referred  to  as  spacetime  diagrams,  and  tracks  on 
them  are  called  world-lines.  As  in  the  other  examples  in  this  book,  a  time  axis  is  always  closer  to  horizontal,  and  a  position  axis  closer  to  vertical,  and 
the  units  are  such  that  the  universal  speed  is  a  slope  of  +1  or  -1 . 

Draw  spacetime  diagrams  of:  (1 )  a  box  with  a  particle  bouncing  back  and  forth  inside  it,  (2)  a  ray  of  light  being  absorbed  by  a  leaf  on  a  tree,  (3)  an 
atomic  nucleus  splitting  up  into  two  parts  (fissioning),  (4)  a  cloud  of  gas  collapsing  gravitationally  to  form  our  solar  system 

12  3  4 


5.  Event  A  is  given.  Mark  examples  of  events  B-F  that  satisfy  these  criteria.  A  material  object  travels  from  A  to 
B.  An  object  traveled  from  C  to  A.  A  ray  of  light  emitted  at  A  is  received  at  D.  A  ray  of  light  emitted  at  E  is 
received  at  A.  F  can  have  no  possible  cause-and-effect  relationship  with  A. 


6.  These  are  five  pairs  of  spacetime  diagrams,  organized  so  that  there  is  some  resemblance  between  the  left  and  right  of  each  pair.  In  each  case, 
decide  whether  the  two  could  actually  be  the  same  thing  represented  in  a  different  inertial  frame  of  reference.  Write  yes  or  no,  and  explain  why. 
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7.  The  diagram  shows  three  events  and  two  frames  of  reference. 
Describe  the  time-ordering  of  the  events  in  these  frames.  Is  any 
other  time-ordering  possible  in  any  other  frame? 


8.  The  top  row  shows  three  clocks  located  in  three  different  places.  They  have  been  synchronized  in  the  frame  of  reference  of  the  earth, 
represented  by  the  paper.  This  synchronization  is  carried  out  by  exchanging  light  signals.  For  example,  if  the  front  and  back  clocks  both  send  out 
flashes  of  light  when  they  think  it's  2  o'clock,  the  one  in  the  middle  will  receive  them  both  at  the  same  time.  Event  A  is  the  one  at  which  the  back 
clock  A  reads  2  o'clock,  etc.  The  second  row  represents  clocks  that  are  synchronized  aboard  the  train,  which  is  moving  to  the  right  at  a 
substantial  fraction  of  the  speed  of  light.  How  should  the  clocks  shown  with  dashed  outlines  compare  with  the  one  at  the  middle  of  the  train? 


9  The  figure  shows  three  events  and  a 
square  representing  the  t  and  x  axes  of  a 
frame  of  reference.  In  this  frame,  the  time¬ 
ordering  of  the  events  is  ABC.  If  we  switch  to 
another  frame,  what  other  orderings,  if  any, 
are  possible? 


1 0.  The  figure  shows  the  motion  of  an  object,  with  two  different  frames  of 
reference  superimposed.  Is  anything  strange  going  on? 


1 1 .  Suppose  that  the  train  in  8  is  hooked  up  in  a  circle,  like  a  chain  necklace.  What  happens? 
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Exercise  7D:  Misconceptions  about  Relativity 

The  following  is  a  list  of  common  misconceptions  about  relativity.  The  class  will  be  split 
up  into  random  groups,  and  each  group  will  cooperate  on  developing  an  explanation  of  the 
misconception,  and  then  the  groups  will  present  their  explanations  to  the  class.  There  may 
be  multiple  rounds,  with  students  assigned  to  different  randomly  chosen  groups  in  successive 
rounds. 

1.  How  can  light  have  momentum  if  it  has  zero  mass? 

2.  What  does  the  world  look  like  in  a  frame  of  reference  moving  at  c? 

3.  Alice  observes  Betty  coming  toward  her  from  the  left  at  c/2,  and  Carol  from  the  right  at 
c/2.  Therefore  Betty  is  moving  at  the  speed  of  light  relative  to  Carol. 

4.  Are  relativistic  effects  such  as  length  contraction  and  time  dilation  real,  or  do  they  just 
seem  to  be  that  way? 

5.  Special  relativity  only  matters  if  you’re  moving  close  to  the  speed  of  light. 

6.  Special  relativity  says  that  everything  is  relative. 

7.  There  is  a  common  misconception  that  relativistic  length  contraction  is  what  we  would 
actually  see.  Refute  this  by  drawing  a  spacetime  diagram  for  an  object  approaching  an 
observer,  and  tracing  rays  of  light  emitted  from  the  object’s  front  and  back  that  both 
reach  the  observer’s  eye  at  the  same  time. 

8.  When  you  travel  close  to  the  speed  of  light,  your  time  slows  down. 

9.  Is  a  light  wave’s  wavelength  relativistically  length  contracted  by  a  factor  of  gamma? 

10.  Accelerate  a  baseball  to  ultrarelativistic  speeds.  Does  it  become  a  black  hole? 

11.  Where  did  the  Big  Bang  happen? 

12.  The  universe  can’t  be  infinite  in  size,  because  it’s  only  had  a  finite  amount  of  time  to 
expand  from  the  point  where  the  Big  Bang  happened. 
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Exercise  7E:  The  sum  of  observer-vectors  is  an  observer-vector. 

The  figure  gives  four  pairs  of  four-vectors,  oriented  in  our  customary  way  as  shown  by  the 
light-cone  on  the  left. 


1.  Of  the  types  shown  in  the  four  cases  i-iv,  which  types  of  vectors  could  represent  the  world- line 
of  an  observer? 


2.  Suppose  that  U  and  V  are  both  observer- vectors.  What  would  it  mean  physically  to  compute 

U  + V? 

3.  Determine  the  sign  of  each  inner  product  A  •  B. 

4.  Given  an  observer  whose  world-line  is  along  a  four- vector  O,  suppose  we  want  to  determine 
whether  some  other  four-vector  P  is  also  a  possible  world-line  of  an  observer.  Show  that  knowl¬ 
edge  of  the  signs  of  the  inner  products  O  •  P  and  P  •  P  is  necessary  and  sufficient  to  determine 
this.  Hint:  Consider  various  possibilities  like  i-iv  for  vector  P,  and  see  how  the  signs  would  turn 
out. 

5.  For  vectors  as  described  in  4,  determine  the  signs  of 

(U  + V)  •  (U  + V) 


and 

(U  +  V)  •  u 

by  multiplying  them  out.  Interpret  the  result  physically. 
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Chapter  8 

Atoms  and  Electromagnetism 


8.1  The  Electric  Glue 

Where  the  telescope  ends,  the  microscope  begins.  Which  of  the  two 
has  the  grander  view?  Victor  Hugo 

His  father  died  during  his  mother’s  pregnancy.  Rejected  by  her 
as  a  boy,  he  was  packed  off  to  boarding  school  when  she  remarried. 
He  himself  never  married,  but  in  middle  age  he  formed  an  intense 
relationship  with  a  much  younger  man,  a  relationship  that  he  ter¬ 
minated  when  he  underwent  a  psychotic  break.  Following  his  early 
scientific  successes,  he  spent  the  rest  of  his  professional  life  mostly 
in  frustration  over  his  inability  to  unlock  the  secrets  of  alchemy. 

The  man  being  described  is  Isaac  Newton,  but  not  the  triumphant 
Newton  of  the  standard  textbook  hagiography.  Why  dwell  on  the 
sad  side  of  his  life?  To  the  modern  science  educator,  Newton’s  life¬ 
long  obsession  with  alchemy  may  seem  an  embarrassment,  a  distrac¬ 
tion  from  his  main  achievement,  the  creation  the  modern  science  of 
mechanics.  To  Newton,  however,  his  alchemical  researches  were  nat¬ 
urally  related  to  his  investigations  of  force  and  motion.  What  was 
radical  about  Newton’s  analysis  of  motion  was  its  universality:  it 
succeeded  in  describing  both  the  heavens  and  the  earth  with  the 
same  equations,  whereas  previously  it  had  been  assumed  that  the 
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sun,  moon,  stars,  and  planets  were  fundamentally  different  from 
earthly  objects.  But  Newton  realized  that  if  science  was  to  describe 
all  of  nature  in  a  unified  way,  it  was  not  enough  to  unite  the  human 
scale  with  the  scale  of  the  universe:  he  would  not  be  satisfied  until 
he  fit  the  microscopic  universe  into  the  picture  as  well. 

It  should  not  surprise  us  that  Newton  failed.  Although  he  was  a 
firm  believer  in  the  existence  of  atoms,  there  was  no  more  experimen¬ 
tal  evidence  for  their  existence  than  there  had  been  when  the  ancient 
Greeks  first  posited  them  on  purely  philosophical  grounds.  Alchemy 
labored  under  a  tradition  of  secrecy  and  mysticism.  Newton  had 
already  almost  single-handedly  transformed  the  fuzzyheaded  held 
of  “natural  philosophy”  into  something  we  would  recognize  as  the 
modern  science  of  physics,  and  it  would  be  unjust  to  criticize  him 
for  failing  to  change  alchemy  into  modern  chemistry  as  well.  The 
time  was  not  ripe.  The  microscope  was  a  new  invention,  and  it  was 
cutting-edge  science  when  Newton’s  contemporary  Hooke  discovered 
that  living  things  were  made  out  of  cells. 

8.1 .1  The  quest  for  the  atomic  force 

Newton  was  not  the  first  of  the  age  of  reason.  He  was  the  last 
of  the  magicians.  John  Maynard  Keynes 

Newton’s  quest 

Nevertheless  it  will  be  instructive  to  pick  up  Newton’s  train  of 
thought  and  see  where  it  leads  us  with  the  benefit  of  modern  hind¬ 
sight.  In  uniting  the  human  and  cosmic  scales  of  existence,  he  had 
reimagined  both  as  stages  on  which  the  actors  were  objects  (trees 
and  houses,  planets  and  stars)  that  interacted  through  attractions 
and  repulsions.  He  was  already  convinced  that  the  objects  inhab¬ 
iting  the  microworld  were  atoms,  so  it  remained  only  to  determine 
what  kinds  of  forces  they  exerted  on  each  other. 

His  next  insight  was  no  less  brilliant  for  his  inability  to  bring  it  to 
fruition.  He  realized  that  the  many  human-scale  forces  —  friction, 
sticky  forces,  the  normal  forces  that  keep  objects  from  occupying 
the  same  space,  and  so  on  —  must  all  simply  be  expressions  of  a 
more  fundamental  force  acting  between  atoms.  Tape  sticks  to  paper 
because  the  atoms  in  the  tape  attract  the  atoms  in  the  paper.  My 
house  doesn’t  fall  to  the  center  of  the  earth  because  its  atoms  repel 
the  atoms  of  the  dirt  under  it. 

Here  he  got  stuck.  It  was  tempting  to  think  that  the  atomic  force 
was  a  form  of  gravity,  which  he  knew  to  be  universal,  fundamental, 
and  mathematically  simple.  Gravity,  however,  is  always  attractive, 
so  how  could  he  use  it  to  explain  the  existence  of  both  attractive 
and  repulsive  atomic  forces?  The  gravitational  force  between  ob¬ 
jects  of  ordinary  size  is  also  extremely  small,  which  is  why  we  never 
notice  cars  and  houses  attracting  us  gravitationally.  It  would  be 
hard  to  understand  how  gravity  could  be  responsible  for  anything 
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as  vigorous  as  the  beating  of  a  heart  or  the  explosion  of  gunpowder. 
Newton  went  on  to  write  a  million  words  of  alchemical  notes  filled 
with  speculation  about  some  other  force,  perhaps  a  “divine  force”  or 
“vegetative  force”  that  would  for  example  be  carried  by  the  sperm 
to  the  egg. 

Luckily,  we  now  know  enough  to  investigate  a  different  suspect 
as  a  candidate  for  the  atomic  force:  electricity.  Electric  forces  are 
often  observed  between  objects  that  have  been  prepared  by  rubbing 
(or  other  surface  interactions),  for  instance  when  clothes  rub  against 
each  other  in  the  dryer.  A  useful  example  is  shown  in  figure  a/1: 
stick  two  pieces  of  tape  on  a  tabletop,  and  then  put  two  more  pieces 
on  top  of  them.  Lift  each  pair  from  the  table,  and  then  separate 
them.  The  two  top  pieces  will  then  repel  each  other,  a/2,  as  will 
the  two  bottom  pieces.  A  bottom  piece  will  attract  a  top  piece, 
however,  a/3.  Electrical  forces  like  these  are  similar  in  certain  ways 
to  gravity,  the  other  force  that  we  already  know  to  be  fundamental: 

•  Electrical  forces  are  universal.  Although  some  substances, 
such  as  fur,  rubber,  and  plastic,  respond  more  strongly  to 
electrical  preparation  than  others,  all  matter  participates  in 
electrical  forces  to  some  degree.  There  is  no  such  thing  as  a 
“nonelectric”  substance.  Matter  is  both  inherently  gravita¬ 
tional  and  inherently  electrical. 

•  Experiments  show  that  the  electrical  force,  like  the  gravita¬ 
tional  force,  is  an  inverse  square  force.  That  is,  the  electrical 
force  between  two  spheres  is  proportional  to  1/r2,  where  r  is 
the  center-to-center  distance  between  them. 


a  /  Four  pieces  of  tape  are 
prepared,  1,  as  described  in  the 
text.  Depending  on  which  com¬ 
bination  is  tested,  the  interaction 
can  be  either  repulsive,  2,  or 
attractive,  3. 


Furthermore,  electrical  forces  make  more  sense  than  gravity  as 
candidates  for  the  fundamental  force  between  atoms,  because  we 
have  observed  that  they  can  be  either  attractive  or  repulsive. 


8.1.2  Charge,  electricity  and  magnetism 

Charge 

“Charge”  is  the  technical  term  used  to  indicate  that  an  object 
has  been  prepared  so  as  to  participate  in  electrical  forces.  This  is 
to  be  distinguished  from  the  common  usage,  in  which  the  term  is 
used  indiscriminately  for  anything  electrical.  For  example,  although 
we  speak  colloquially  of  “charging”  a  battery,  you  may  easily  verify 
that  a  battery  has  no  charge  in  the  technical  sense,  e.g.,  it  does  not 
exert  any  electrical  force  on  a  piece  of  tape  that  has  been  prepared 
as  described  in  the  previous  section. 
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Two  types  of  charge 

We  can  easily  collect  reams  of  data  on  electrical  forces  between 
different  substances  that  have  been  charged  in  different  ways.  We 
find  for  example  that  cat  fur  prepared  by  rubbing  against  rabbit 
fur  will  attract  glass  that  has  been  rubbed  on  silk.  How  can  we 
make  any  sense  of  all  this  information?  A  vast  simplification  is 
achieved  by  noting  that  there  are  really  only  two  types  of  charge. 
Suppose  we  pick  cat  fur  rubbed  on  rabbit  fur  as  a  representative  of 
type  A,  and  glass  rubbed  on  silk  for  type  B.  We  will  now  find  that 
there  is  no  “type  C.”  Any  object  electrified  by  any  method  is  either 
A-like,  attracting  things  A  attracts  and  repelling  those  it  repels,  or 
B-like,  displaying  the  same  attractions  and  repulsions  as  B.  The  two 
types,  A  and  B,  always  display  opposite  interactions.  If  A  displays 
an  attraction  with  some  charged  object,  then  B  is  guaranteed  to 
undergo  repulsion  with  it,  and  vice-versa. 

The  coulomb 

Although  there  are  only  two  types  of  charge,  each  type  can  come 
in  different  amounts.  The  metric  unit  of  charge  is  the  coulomb 
(rhymes  with  “drool  on”),  defined  as  follows: 

One  Coulomb  (C)  is  the  amount  of  charge  such  that  a  force  of 
9.0  x  109  N  occurs  between  two  pointlike  objects  with  charges 
of  1  C  separated  by  a  distance  of  1  m. 

The  notation  for  an  amount  of  charge  is  q.  The  numerical  factor 
in  the  definition  is  historical  in  origin,  and  is  not  worth  memoriz¬ 
ing.  The  definition  is  stated  for  pointlike,  i.e.,  very  small,  objects, 
because  otherwise  different  parts  of  them  would  be  at  different  dis¬ 
tances  from  each  other. 

A  model  of  two  types  of  charged  particles 

Experiments  show  that  all  the  methods  of  rubbing  or  otherwise 
charging  objects  involve  two  objects,  and  both  of  them  end  up  get¬ 
ting  charged.  If  one  object  acquires  a  certain  amount  of  one  type  of 
charge,  then  the  other  ends  up  with  an  equal  amount  of  the  other 
type.  Various  interpretations  of  this  are  possible,  but  the  simplest 
is  that  the  basic  building  blocks  of  matter  come  in  two  flavors,  one 
with  each  type  of  charge.  Rubbing  objects  together  results  in  the 
transfer  of  some  of  these  particles  from  one  object  to  the  other.  In 
this  model,  an  object  that  has  not  been  electrically  prepared  may  ac¬ 
tually  possesses  a  great  deal  of  both  types  of  charge,  but  the  amounts 
are  equal  and  they  are  distributed  in  the  same  way  throughout  it. 
Since  type  A  repels  anything  that  type  B  attracts,  and  vice  versa, 
the  object  will  make  a  total  force  of  zero  on  any  other  object.  The 
rest  of  this  chapter  fleshes  out  this  model  and  discusses  how  these 
mysterious  particles  can  be  understood  as  being  internal  parts  of 
atoms. 
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Use  of  positive  and  negative  signs  for  charge 

Because  the  two  types  of  charge  tend  to  cancel  out  each  other’s 
forces,  it  makes  sense  to  label  them  using  positive  and  negative  signs, 
and  to  discuss  the  total  charge  of  an  object.  It  is  entirely  arbitrary 
which  type  of  charge  to  call  negative  and  which  to  call  positive. 
Benjamin  Franklin  decided  to  describe  the  one  we’ve  been  calling 
“A”  as  negative,  but  it  really  doesn’t  matter  as  long  as  everyone  is 
consistent  with  everyone  else.  An  object  with  a  total  charge  of  zero 
(equal  amounts  of  both  types)  is  referred  to  as  electrically  neutral. 

self-check  A 

Criticize  the  following  statement 

tive  and  repulsive.” 

965 

A  large  body  of  experimental  observations  can  be  summarized 
as  follows: 


:  “There  are  two  types  of  charge,  attrac- 

>  Answer,  p. 


Coulomb’s  law:  The  magnitude  of  the  force  acting  between  point- 
like  charged  objects  at  a  center-to-center  distance  r  is  given  by  the 
equation 


where  the  constant  k  equals  9.0  x  109  N-m2/C2.  The  force  is  attrac¬ 
tive  if  the  charges  are  of  different  signs,  and  repulsive  if  they  have 
the  same  sign. 


Clever  modern  techniques  have  allowed  the  1/r2  form  of  Coulomb’s 
law  to  be  tested  to  incredible  accuracy,  showing  that  the  exponent 
is  in  the  range  from  1.9999999999999998  to  2.0000000000000002. 

Note  that  Coulomb’s  law  is  closely  analogous  to  Newton’s  law 
of  gravity,  where  the  magnitude  of  the  force  is  Gmim-z/r2 ,  except 
that  there  is  only  one  type  of  mass,  not  two,  and  gravitational  forces 
are  never  repulsive.  Because  of  this  close  analogy  between  the  two 
types  of  forces,  we  can  recycle  a  great  deal  of  our  knowledge  of 
gravitational  forces.  For  instance,  there  is  an  electrical  equivalent 
of  the  shell  theorem:  the  electrical  forces  exerted  externally  by  a 
uniformly  charged  spherical  shell  are  the  same  as  if  all  the  charge 
was  concentrated  at  its  center,  and  the  forces  exerted  internally  are 
zero. 

Conservation  of  charge 

An  even  more  fundamental  reason  for  using  positive  and  nega¬ 
tive  signs  for  electrical  charge  is  that  experiments  show  that  charge 
is  conserved  according  to  this  definition:  in  any  closed  system,  the 
total  amount  of  charge  is  a  constant.  This  is  why  we  observe  that 
rubbing  initially  uncharged  substances  together  always  has  the  re¬ 
sult  that  one  gains  a  certain  amount  of  one  type  of  charge,  while 
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b  /  A  charged  piece  of  tape 
attracts  uncharged  pieces  of 
paper  from  a  distance,  and  they 
leap  up  to  it. 


c  /  The  paper  has  zero  total 
charge,  but  it  does  have  charged 
particles  in  it  that  can  move. 


the  other  acquires  an  equal  amount  of  the  other  type.  Conservation 
of  charge  seems  natural  in  our  model  in  which  matter  is  made  of 
positive  and  negative  particles.  If  the  charge  on  each  particle  is  a 
fixed  property  of  that  type  of  particle,  and  if  the  particles  themselves 
can  be  neither  created  nor  destroyed,  then  conservation  of  charge  is 
inevitable. 

Electrical  forces  involving  neutral  objects 

As  shown  in  figure  b,  an  electrically  charged  object  can  attract 
objects  that  are  uncharged.  How  is  this  possible?  The  key  is  that 
even  though  each  piece  of  paper  has  a  total  charge  of  zero,  it  has  at 
least  some  charged  particles  in  it  that  have  some  freedom  to  move. 
Suppose  that  the  tape  is  positively  charged,  c.  Mobile  particles 
in  the  paper  will  respond  to  the  tape’s  forces,  causing  one  end  of 
the  paper  to  become  negatively  charged  and  the  other  to  become 
positive.  The  attraction  between  the  paper  and  the  tape  is  now 
stronger  than  the  repulsion,  because  the  negatively  charged  end  is 
closer  to  the  tape. 

self-check  B 

What  would  have  happened  if  the  tape  was  negatively  charged?  > 

Answer,  p.  965 

The  path  ahead 

We  have  begun  to  encounter  complex  electrical  behavior  that  we 
would  never  have  realized  was  occurring  just  from  the  evidence  of  our 
eyes.  Unlike  the  pulleys,  blocks,  and  inclined  planes  of  mechanics, 
the  actors  on  the  stage  of  electricity  and  magnetism  are  invisible 
phenomena  alien  to  our  everyday  experience.  For  this  reason,  the 
flavor  of  the  second  half  of  your  physics  education  is  dramatically 
different,  focusing  much  more  on  experiments  and  techniques.  Even 
though  you  will  never  actually  see  charge  moving  through  a  wire, 
you  can  learn  to  use  an  ammeter  to  measure  the  flow. 

Students  also  tend  to  get  the  impression  from  their  first  semester 
of  physics  that  it  is  a  dead  science.  Not  so!  We  are  about  to  pick 
up  the  historical  trail  that  leads  directly  to  the  cutting-edge  physics 
research  you  read  about  in  the  newspaper.  The  atom-smashing  ex¬ 
periments  that  began  around  1900,  which  we  will  be  studying  in  this 
chapter,  were  not  that  different  from  the  ones  of  the  year  2000  - 
just  smaller,  simpler,  and  much  cheaper. 

Magnetic  forces 

A  detailed  mathematical  treatment  of  magnetism  won’t  come 
until  much  later  in  this  book,  but  we  need  to  develop  a  few  simple 
ideas  about  magnetism  now  because  magnetic  forces  are  used  in  the 
experiments  and  techniques  we  come  to  next.  Everyday  magnets 
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come  in  two  general  types.  Permanent  magnets,  such  as  the  ones 
on  your  refrigerator,  are  made  of  iron  or  substances  like  steel  that 
contain  iron  atoms.  (Certain  other  substances  also  work,  but  iron 
is  the  cheapest  and  most  common.)  The  other  type  of  magnet, 
an  example  of  which  is  the  ones  that  make  your  stereo  speakers 
vibrate,  consist  of  coils  of  wire  through  which  electric  charge  flows. 
Both  types  of  magnets  are  able  to  attract  iron  that  has  not  been 
magnetically  prepared,  for  instance  the  door  of  the  refrigerator. 

A  single  insight  makes  these  apparently  complex  phenomena 
much  simpler  to  understand:  magnetic  forces  are  interactions  be¬ 
tween  moving  charges,  occurring  in  addition  to  the  electric  forces. 
Suppose  a  permanent  magnet  is  brought  near  a  magnet  of  the  coiled- 
wire  type.  The  coiled  wire  has  moving  charges  in  it  because  we  force 
charge  to  flow.  The  permanent  magnet  also  has  moving  charges  in 
it,  but  in  this  case  the  charges  that  naturally  swirl  around  inside  the 
iron.  (What  makes  a  magnetized  piece  of  iron  different  from  a  block 
of  wood  is  that  the  motion  of  the  charge  in  the  wood  is  random 
rather  than  organized.)  The  moving  charges  in  the  coiled- wire  mag¬ 
net  exert  a  force  on  the  moving  charges  in  the  permanent  magnet, 
and  vice-versa. 

The  mathematics  of  magnetism  is  significantly  more  complex 
than  the  Coulomb  force  law  for  electricity,  which  is  why  we  will 
wait  until  chapter  11  before  delving  deeply  into  it.  Two  simple  facts 
will  suffice  for  now: 

(1)  If  a  charged  particle  is  moving  in  a  region  of  space  near  where 
other  charged  particles  are  also  moving,  their  magnetic  force  on  it 
is  directly  proportional  to  its  velocity. 

(2)  The  magnetic  force  on  a  moving  charged  particle  is  always 
perpendicular  to  the  direction  the  particle  is  moving. 

A  magnetic  compass  example  1 

The  Earth  is  molten  inside,  and  like  a  pot  of  boiling  water,  it  roils 
and  churns.  To  make  a  drastic  oversimplification,  electric  charge 
can  get  carried  along  with  the  churning  motion,  so  the  Earth  con¬ 
tains  moving  charge.  The  needle  of  a  magnetic  compass  is  itself 
a  small  permanent  magnet.  The  moving  charge  inside  the  earth 
interacts  magnetically  with  the  moving  charge  inside  the  compass 
needle,  causing  the  compass  needle  to  twist  around  and  point 
north. 

A  television  tube  example  2 

A  TV  picture  is  painted  by  a  stream  of  electrons  coming  from 
the  back  of  the  tube  to  the  front.  The  beam  scans  across  the 
whole  surface  of  the  tube  like  a  reader  scanning  a  page  of  a  book. 
Magnetic  forces  are  used  to  steer  the  beam.  As  the  beam  comes 
from  the  back  of  the  tube  to  the  front,  up-down  and  left-right  forces 
are  needed  for  steering.  But  magnetic  forces  cannot  be  used 


Section  8.1  The  Electric  Glue 


465 


to  get  the  beam  up  to  speed  in  the  first  place,  since  they  can 
only  push  perpendicular  to  the  electrons’  direction  of  motion,  not 
forward  along  it. 

Discussion  Questions 

A  If  the  electrical  attraction  between  two  pointlike  objects  at  a  distance 
of  1  m  is  9  x  1 09  N,  why  can’t  we  infer  that  their  charges  are  +1  and  -1  C? 
What  further  observations  would  we  need  to  do  in  order  to  prove  this? 

B  An  electrically  charged  piece  of  tape  will  be  attracted  to  your  hand. 
Does  that  allow  us  to  tell  whether  the  mobile  charged  particles  in  your 
hand  are  positive  or  negative,  or  both? 

8.1.3  Atoms 

I  was  brought  up  to  look  at  the  atom  as  a  nice,  hard  fellow,  red 
or  grey  in  color  according  to  taste.  Rutherford 

Atomism 

The  Greeks  have  been  kicked  around  a  lot  in  the  last  couple  of 
millennia:  dominated  by  the  Romans,  bullied  during  the  crusades 
by  warlords  going  to  and  from  the  Holy  Land,  and  occupied  by 
Turkey  until  recently.  It’s  no  wonder  they  prefer  to  remember  their 
salad  days,  when  their  best  thinkers  came  up  with  concepts  like 
democracy  and  atoms.  Greece  is  democratic  again  after  a  period 
of  military  dictatorship,  and  an  atom  is  proudly  pictured  on  one  of 
their  coins.  That’s  why  it  hurts  me  to  have  to  say  that  the  ancient 
Greek  hypothesis  that  matter  is  made  of  atoms  was  pure  guess¬ 
work.  There  was  no  real  experimental  evidence  for  atoms,  and  the 
18th-century  revival  of  the  atom  concept  by  Dalton  owed  little  to 
the  Greeks  other  than  the  name,  which  means  “unsplittable.”  Sub¬ 
tracting  even  more  cruelly  from  Greek  glory,  the  name  was  shown 
to  be  inappropriate  in  1897  when  physicist  J.J.  Thomson  proved  ex¬ 
perimentally  that  atoms  had  even  smaller  things  inside  them,  which 
could  be  extracted.  (Thomson  called  them  “electrons.”)  The  “un¬ 
splittable”  was  splittable  after  all. 

But  that’s  getting  ahead  of  our  story.  What  happened  to  the 
atom  concept  in  the  intervening  two  thousand  years?  Educated  peo¬ 
ple  continued  to  discuss  the  idea,  and  those  who  were  in  favor  of  it 
could  often  use  it  to  give  plausible  explanations  for  various  facts  and 
phenomena.  One  fact  that  was  readily  explained  was  conservation 
of  mass.  For  example,  if  you  mix  1  kg  of  water  with  1  kg  of  dirt, 
you  get  exactly  2  kg  of  mud,  no  more  and  no  less.  The  same  is  true 
for  the  a  variety  of  processes  such  as  freezing  of  water,  fermenting 
beer,  or  pulverizing  sandstone.  If  you  believed  in  atoms,  conserva¬ 
tion  of  mass  made  perfect  sense,  because  all  these  processes  could 
be  interpreted  as  mixing  and  rearranging  atoms,  without  changing 
the  total  number  of  atoms.  Still,  this  is  nothing  like  a  proof  that 
atoms  exist. 

If  atoms  did  exist,  what  types  of  atoms  were  there,  and  what  dis- 
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tinguished  the  different  types  from  each  other?  Was  it  their  sizes, 
their  shapes,  their  weights,  or  some  other  quality?  The  chasm  be¬ 
tween  the  ancient  and  modern  atomisms  becomes  evident  when  we 
consider  the  wild  speculations  that  existed  on  these  issues  until  the 
present  century.  The  ancients  decided  that  there  were  four  types  of 
atoms,  earth,  water,  air  and  fire;  the  most  popular  view  was  that 
they  were  distinguished  by  their  shapes.  Water  atoms  were  spher¬ 
ical,  hence  water’s  ability  to  flow  smoothly.  Fire  atoms  had  sharp 
points,  which  was  why  fire  hurt  when  it  touched  one’s  skin.  (There 
was  no  concept  of  temperature  until  thousands  of  years  later.)  The 
drastically  different  modern  understanding  of  the  structure  of  atoms 
was  achieved  in  the  course  of  the  revolutionary  decade  stretching 
1895  to  1905.  The  main  purpose  of  this  chapter  is  to  describe  those 
momentous  experiments. 

Atoms,  light,  and  everything  else 

Although  I  tend  to  ridicule  ancient  Greek  philosophers  like  Aris¬ 
totle,  let’s  take  a  moment  to  praise  him  for  something.  If  you  read 
Aristotle’s  writings  on  physics  (or  just  skim  them,  which  is  all  I’ve 
done),  the  most  striking  thing  is  how  careful  he  is  about  classifying 
phenomena  and  analyzing  relationships  among  phenomena.  The  hu¬ 
man  brain  seems  to  naturally  make  a  distinction  between  two  types 
of  physical  phenomena:  objects  and  motion  of  objects.  When  a 
phenomenon  occurs  that  does  not  immediately  present  itself  as  one 
of  these,  there  is  a  strong  tendency  to  conceptualize  it  as  one  or 
the  other,  or  even  to  ignore  its  existence  completely.  For  instance, 
physics  teachers  shudder  at  students’  statements  that  “the  dynamite 
exploded,  and  force  came  out  of  it  in  all  directions.”  In  these  exam¬ 
ples,  the  nonmaterial  concept  of  force  is  being  mentally  categorized 
as  if  it  was  a  physical  substance.  The  statement  that  “winding  the 
clock  stores  motion  in  the  spring”  is  a  miscategorization  of  electrical 
energy  as  a  form  of  motion.  An  example  of  ignoring  the  existence 
of  a  phenomenon  altogether  can  be  elicited  by  asking  people  why 
we  need  lamps.  The  typical  response  that  “the  lamp  illuminates 
the  room  so  we  can  see  things,”  ignores  the  necessary  role  of  light 
coming  into  our  eyes  from  the  things  being  illuminated. 

If  you  ask  someone  to  tell  you  briefly  about  atoms,  the  likely 
response  is  that  “everything  is  made  of  atoms,”  but  we’ve  now  seen 
that  it’s  far  from  obvious  which  “everything”  this  statement  would 
properly  refer  to.  For  the  scientists  of  the  early  1900s  who  were 
trying  to  investigate  atoms,  this  was  not  a  trivial  issue  of  defini¬ 
tions.  There  was  a  new  gizmo  called  the  vacuum  tube,  of  which  the 
only  familiar  example  today  is  the  picture  tube  of  a  TV.  In  short 
order,  electrical  tinkerers  had  discovered  a  whole  flock  of  new  phe¬ 
nomena  that  occurred  in  and  around  vacuum  tubes,  and  given  them 
picturesque  names  like  “x-rays,”  “cathode  rays,”  “Hertzian  waves,” 
and  “N-rays.”  These  were  the  types  of  observations  that  ended  up 
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telling  us  that  we  know  about  matter,  but  fierce  controversies  ensued 
over  whether  these  were  themselves  forms  of  matter. 

Let’s  bring  ourselves  up  to  the  level  of  classification  of  phenom¬ 
ena  employed  by  physicists  in  the  year  1900.  They  recognized  three 
categories: 

•  Matter  has  mass,  can  have  kinetic  energy,  and  can  travel 
through  a  vacuum,  transporting  its  mass  and  kinetic  energy 
with  it.  Matter  is  conserved,  both  in  the  sense  of  conservation 
of  mass  and  conservation  of  the  number  of  atoms  of  each  ele¬ 
ment.  Atoms  can’t  occupy  the  same  space  as  other  atoms,  so 
a  convenient  way  to  prove  something  is  not  a  form  of  matter 
is  to  show  that  it  can  pass  through  a  solid  material,  in  which 
the  atoms  are  packed  together  closely. 

•  Light  has  no  mass,  always  has  energy,  and  can  travel  through  a 
vacuum,  transporting  its  energy  with  it.  Two  light  beams  can 
penetrate  through  each  other  and  emerge  from  the  collision 
without  being  weakened,  deflected,  or  affected  in  any  other 
way.  Light  can  penetrate  certain  kinds  of  matter,  e.g.,  glass. 

•  The  third  category  is  everything  that  doesn’t  fit  the  defini¬ 
tion  of  light  or  matter.  This  catch-all  category  includes,  for 
example,  time,  velocity,  heat,  and  force. 


The  chemical  elements 


mHe 

=  3.97 

mH 

?BNe 

=  20.01 

mH 

msc 

=  44.60 

mH 

d  /  Examples  of  masses  of 
atoms  compared  to  that  of  hydro¬ 
gen.  Note  how  some,  but  not  all, 
are  close  to  integers. 


How  would  one  find  out  what  types  of  atoms  there  were?  To¬ 
day,  it  doesn’t  seem  like  it  should  have  been  very  difficult  to  work 
out  an  experimental  program  to  classify  the  types  of  atoms.  For 
each  type  of  atom,  there  should  be  a  corresponding  element,  i.e.,  a 
pure  substance  made  out  of  nothing  but  that  type  of  atom.  Atoms 
are  supposed  to  be  unsplittable,  so  a  substance  like  milk  could  not 
possibly  be  elemental,  since  churning  it  vigorously  causes  it  to  split 
up  into  two  separate  substances:  butter  and  whey.  Similarly,  rust 
could  not  be  an  element,  because  it  can  be  made  by  combining  two 
substances:  iron  and  oxygen.  Despite  its  apparent  reasonableness, 
no  such  program  was  carried  out  until  the  eighteenth  century.  The 
ancients  presumably  did  not  do  it  because  observation  was  not  uni¬ 
versally  agreed  on  as  the  right  way  to  answer  questions  about  nature, 
and  also  because  they  lacked  the  necessary  techniques  or  the  tech¬ 
niques  were  the  province  of  laborers  with  low  social  status,  such  as 
smiths  and  miners.  Alchemists  were  hindered  by  atomism’s  repu¬ 
tation  for  subversiveness,  and  by  a  tendency  toward  mysticism  and 
secrecy.  (The  most  celebrated  challenge  facing  the  alchemists,  that 
of  converting  lead  into  gold,  is  one  we  now  know  to  be  impossible, 
since  lead  and  gold  are  both  elements.) 

By  1900,  however,  chemists  had  done  a  reasonably  good  job  of 
finding  out  what  the  elements  were.  They  also  had  determined  the 
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ratios  of  the  different  atoms’  masses  fairly  accurately.  A  typical 
technique  would  be  to  measure  how  many  grams  of  sodium  (Na) 
would  combine  with  one  gram  of  chlorine  (Cl)  to  make  salt  (NaCl). 
(This  assumes  you’ve  already  decided  based  on  other  evidence  that 
salt  consisted  of  equal  numbers  of  Na  and  Cl  atoms.)  The  masses  of 
individual  atoms,  as  opposed  to  the  mass  ratios,  were  known  only 
to  within  a  few  orders  of  magnitude  based  on  indirect  evidence,  and 
plenty  of  physicists  and  chemists  denied  that  individual  atoms  were 
anything  more  than  convenient  symbols. 

Making  sense  of  the  elements 

As  the  information  accumulated,  the  challenge  was  to  find  a 
way  of  systematizing  it;  the  modern  scientist’s  aesthetic  sense  rebels 
against  complication.  This  hodgepodge  of  elements  was  an  embar¬ 
rassment.  One  contemporary  observer,  William  Crookes,  described 
the  elements  as  extending  “before  us  as  stretched  the  wide  Atlantic 
before  the  gaze  of  Columbus,  mocking,  taunting  and  murmuring 
strange  riddles,  which  no  man  has  yet  been  able  to  solve.”  It  wasn’t 
long  before  people  started  recognizing  that  many  atoms’  masses  were 
nearly  integer  multiples  of  the  mass  of  hydrogen,  the  lightest  ele¬ 
ment.  A  few  excitable  types  began  speculating  that  hydrogen  was 
the  basic  building  block,  and  that  the  heavier  elements  were  made 
of  clusters  of  hydrogen.  It  wasn’t  long,  however,  before  their  parade 
was  rained  on  by  more  accurate  measurements,  which  showed  that 
not  all  of  the  elements  had  atomic  masses  that  were  near  integer 
multiples  of  hydrogen,  and  even  the  ones  that  were  close  to  being 
integer  multiples  were  off  by  one  percent  or  so. 
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Chemistry  professor  Dmitri  Mendeleev,  preparing  his  lectures  in 
1869,  wanted  to  find  some  way  to  organize  his  knowledge  for  his  stu¬ 
dents  to  make  it  more  understandable.  He  wrote  the  names  of  all 
the  elements  on  cards  and  began  arranging  them  in  different  ways 
on  his  desk,  trying  to  find  an  arrangement  that  would  make  sense  of 
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e  /  A  modern  periodic  table.  Ele¬ 
ments  in  the  same  column  have 
similar  chemical  properties.  The 
modern  atomic  numbers,  dis¬ 
cussed  in  section  8.2,  were  not 
known  in  Mendeleev’s  time,  since 
the  table  could  be  flipped  in  vari¬ 
ous  ways. 
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the  muddle.  The  row-and-column  scheme  he  came  up  with  is  essen¬ 
tially  our  modern  periodic  table.  The  columns  of  the  modern  version 
represent  groups  of  elements  with  similar  chemical  properties,  and 
each  row  is  more  massive  than  the  one  above  it.  Going  across  each 
row,  this  almost  always  resulted  in  placing  the  atoms  in  sequence 
by  weight  as  well.  What  made  the  system  significant  was  its  predic¬ 
tive  value.  There  were  three  places  where  Mendeleev  had  to  leave 
gaps  in  his  checkerboard  to  keep  chemically  similar  elements  in  the 
same  column.  He  predicted  that  elements  would  exist  to  £11  these 
gaps,  and  extrapolated  or  interpolated  from  other  elements  in  the 
same  column  to  predict  their  numerical  properties,  such  as  masses, 
boiling  points,  and  densities.  Mendeleev’s  professional  stock  sky¬ 
rocketed  when  his  three  elements  (later  named  gallium,  scandium 
and  germanium)  were  discovered  and  found  to  have  very  nearly  the 
properties  he  had  predicted. 

One  thing  that  Mendeleev’s  table  made  clear  was  that  mass  was 
not  the  basic  property  that  distinguished  atoms  of  different  ele¬ 
ments.  To  make  his  table  work,  he  had  to  deviate  from  ordering 
the  elements  strictly  by  mass.  For  instance,  iodine  atoms  are  lighter 
than  tellurium,  but  Mendeleev  had  to  put  iodine  after  tellurium  so 
that  it  would  lie  in  a  column  with  chemically  similar  elements. 

Direct  proof  that  atoms  existed 

The  success  of  the  kinetic  theory  of  heat  was  taken  as  strong  evi¬ 
dence  that,  in  addition  to  the  motion  of  any  object  as  a  whole,  there 
is  an  invisible  type  of  motion  all  around  us:  the  random  motion  of 
atoms  within  each  object.  But  many  conservatives  were  not  con¬ 
vinced  that  atoms  really  existed.  Nobody  had  ever  seen  one,  after 
all.  It  wasn’t  until  generations  after  the  kinetic  theory  of  heat  was 
developed  that  it  was  demonstrated  conclusively  that  atoms  really 
existed  and  that  they  participated  in  continuous  motion  that  never 
died  out. 

The  smoking  gun  to  prove  atoms  were  more  than  mathematical 
abstractions  came  when  some  old,  obscure  observations  were  reex¬ 
amined  by  an  unknown  Swiss  patent  clerk  named  Albert  Einstein. 
A  botanist  named  Brown,  using  a  microscope  that  was  state  of  the 
art  in  1827,  observed  tiny  grains  of  pollen  in  a  drop  of  water  on  a 
microscope  slide,  and  found  that  they  jumped  around  randomly  for 
no  apparent  reason.  Wondering  at  first  if  the  pollen  he’d  assumed  to 
be  dead  was  actually  alive,  he  tried  looking  at  particles  of  soot,  and 
found  that  the  soot  particles  also  moved  around.  The  same  results 
would  occur  with  any  small  grain  or  particle  suspended  in  a  liquid. 
The  phenomenon  came  to  be  referred  to  as  Brownian  motion,  and 
its  existence  was  filed  away  as  a  quaint  and  thoroughly  unimportant 
fact,  really  just  a  nuisance  for  the  microscopist. 

It  wasn’t  until  1906  that  Einstein  found  the  correct  interpreta- 
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tion  for  Brown’s  observation:  the  water  molecules  were  in  continuous 
random  motion,  and  were  colliding  with  the  particle  all  the  time, 
kicking  it  in  random  directions.  After  all  the  millennia  of  speculation 
about  atoms,  at  last  there  was  solid  proof.  Einstein’s  calculations 
dispelled  all  doubt,  since  he  was  able  to  make  accurate  predictions 
of  things  like  the  average  distance  traveled  by  the  particle  in  a  cer¬ 
tain  amount  of  time.  (Einstein  received  the  Nobel  Prize  not  for  his 
theory  of  relativity  but  for  his  papers  on  Brownian  motion  and  the 
photoelectric  effect.) 

Discussion  Questions 

A  How  could  knowledge  of  the  size  of  an  individual  aluminum  atom  be 
used  to  infer  an  estimate  of  its  mass,  or  vice  versa? 

B  How  could  one  test  Einstein’s  interpretation  of  Brownian  motion  by 
observing  it  at  different  temperatures? 


8.1.4  Quantization  of  charge 

Proving  that  atoms  actually  existed  was  a  big  accomplishment, 
but  demonstrating  their  existence  was  different  from  understanding 
their  properties.  Note  that  the  Brown-Einstein  observations  had 
nothing  at  all  to  do  with  electricity,  and  yet  we  know  that  matter 
is  inherently  electrical,  and  we  have  been  successful  in  interpreting 
certain  electrical  phenomena  in  terms  of  mobile  positively  and  nega¬ 
tively  charged  particles.  Are  these  particles  atoms?  Parts  of  atoms? 
Particles  that  are  entirely  separate  from  atoms?  It  is  perhaps  pre¬ 
mature  to  attempt  to  answer  these  questions  without  any  conclusive 
evidence  in  favor  of  the  charged-particle  model  of  electricity. 

Strong  support  for  the  charged-particle  model  came  from  a  1911 
experiment  by  physicist  Robert  Millikan  at  the  University  of  Chicago. 
Consider  a  jet  of  droplets  of  perfume  or  some  other  liquid  made  by 
blowing  it  through  a  tiny  pinhole.  The  droplets  emerging  from  the 
pinhole  must  be  smaller  than  the  pinhole,  and  in  fact  most  of  them 
are  even  more  microscopic  than  that,  since  the  turbulent  flow  of  air 
tends  to  break  them  up.  Millikan  reasoned  that  the  droplets  would 
acquire  a  little  bit  of  electric  charge  as  they  rubbed  against  the  chan¬ 
nel  through  which  they  emerged,  and  if  the  charged-particle  model 
of  electricity  was  right,  the  charge  might  be  split  up  among  so  many 
minuscule  liquid  drops  that  a  single  drop  might  have  a  total  charge 
amounting  to  an  excess  of  only  a  few  charged  particles  —  perhaps 
an  excess  of  one  positive  particle  on  a  certain  drop,  or  an  excess  of 
two  negative  ones  on  another. 

Millikan’s  ingenious  apparatus,  g,  consisted  of  two  metal  plates, 
which  could  be  electrically  charged  as  needed.  He  sprayed  a  cloud  of 
oil  droplets  into  the  space  between  the  plates,  and  selected  one  drop 
through  a  microscope  for  study.  First,  with  no  charge  on  the  plates, 
he  would  determine  the  drop’s  mass  by  letting  it  fall  through  the 
air  and  measuring  its  terminal  velocity,  i.e.,  the  velocity  at  which 


f/A  young  Robert  Mil¬ 
likan .  (  Contemporary) 
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g/A  simplified  diagram  of 
Millikan’s  apparatus. 
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h  /  A  few  samples 
data. 


the  force  of  air  friction  canceled  out  the  force  of  gravity.  The  force 
of  air  drag  on  a  slowly  moving  sphere  had  already  been  found  by 
experiment  to  be  bvr 2,  where  b  was  a  constant.  Setting  the  total 
force  equal  to  zero  when  the  drop  is  at  terminal  velocity  gives 

bvr 2  —  mg  =  0, 


and  setting  the  known  density  of  oil  equal  to  the  drop’s  mass  divided 
by  its  volume  gives  a  second  equation, 


rn 


Everything  in  these  equations  can  be  measured  directly  except  for 
m  and  r,  so  these  are  two  equations  in  two  unknowns,  which  can  be 
solved  in  order  to  determine  how  big  the  drop  is. 

Next  Millikan  charged  the  metal  plates,  adjusting  the  amount 
of  charge  so  as  to  exactly  counteract  gravity  and  levitate  the  drop. 
If,  for  instance,  the  drop  being  examined  happened  to  have  a  total 
charge  that  was  negative,  then  positive  charge  put  on  the  top  plate 
would  attract  it,  pulling  it  up,  and  negative  charge  on  the  bottom 
plate  would  repel  it,  pushing  it  up.  (Theoretically  only  one  plate 
would  be  necessary,  but  in  practice  a  two-plate  arrangement  like  this 
gave  electrical  forces  that  were  more  uniform  in  strength  throughout 
the  space  where  the  oil  drops  were.)  The  amount  of  charge  on  the 
plates  required  to  levitate  the  charged  drop  gave  Millikan  a  handle 
on  the  amount  of  charge  the  drop  carried.  The  more  charge  the 
drop  had,  the  stronger  the  electrical  forces  on  it  would  be,  and  the 
less  charge  would  have  to  be  put  on  the  plates  to  do  the  trick.  Un¬ 
fortunately,  expressing  this  relationship  using  Coulomb’s  law  would 
have  been  impractical,  because  it  would  require  a  perfect  knowledge 
of  how  the  charge  was  distributed  on  each  plate,  plus  the  ability 
to  perform  vector  addition  of  all  the  forces  being  exerted  on  the 
drop  by  all  the  charges  on  the  plate.  Instead,  Millikan  made  use  of 
the  fact  that  the  electrical  force  experienced  by  a  pointlike  charged 
object  at  a  certain  point  in  space  is  proportional  to  its  charge, 


Q 

/  (1.64 
xlO-19  C) 
-12.02 
-6.02 
-16.93 

of  Millikan’s 


F 

—  =  constant. 

q 

With  a  given  amount  of  charge  on  the  plates,  this  constant  could  be 
determined  for  instance  by  discarding  the  oil  drop,  inserting  between 
the  plates  a  larger  and  more  easily  handled  object  with  a  known 
charge  on  it,  and  measuring  the  force  with  conventional  methods. 
(Millikan  actually  used  a  slightly  different  set  of  techniques  for  de¬ 
termining  the  constant,  but  the  concept  is  the  same.)  The  amount 
of  force  on  the  actual  oil  drop  had  to  equal  mg,  since  it  was  just 
enough  to  levitate  it,  and  once  the  calibration  constant  had  been 
determined,  the  charge  of  the  drop  could  then  be  found  based  on  its 
previously  determined  mass. 
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The  table  on  the  left  shows  a  few  of  the  results  from  Millikan’s 
1911  paper.  (Millikan  took  data  on  both  negatively  and  positively 
charged  drops,  but  in  his  paper  he  gave  only  a  sample  of  his  data  on 
negatively  charged  drops,  so  these  numbers  are  all  negative.)  Even  a 
quick  look  at  the  data  leads  to  the  suspicion  that  the  charges  are  not 
simply  a  series  of  random  numbers.  For  instance,  the  second  charge 
is  almost  exactly  equal  to  half  the  first  one.  Millikan  explained  the 
observed  charges  as  all  being  integer  multiples  of  a  single  number, 
1.64  x  10-19  C.  In  the  second  column,  dividing  by  this  constant  gives 
numbers  that  are  essentially  integers,  allowing  for  the  random  errors 
present  in  the  experiment.  Millikan  states  in  his  paper  that  these 
results  were  a 

. . .  direct  and  tangible  demonstration  ...  of  the  correct¬ 
ness  of  the  view  advanced  many  years  ago  and  supported 
by  evidence  from  many  sources  that  all  electrical  charges, 
however  produced,  are  exact  multiples  of  one  definite, 
elementary  electrical  charge,  or  in  other  words,  that  an 
electrical  charge  instead  of  being  spread  uniformly  over 
the  charged  surface  has  a  definite  granular  structure, 
consisting,  in  fact,  of  . . .  specks,  or  atoms  of  electric¬ 
ity,  all  precisely  alike,  peppered  over  the  surface  of  the 
charged  body. 

In  other  words,  he  had  provided  direct  evidence  for  the  charged- 
particle  model  of  electricity  and  against  models  in  which  electricity 
was  described  as  some  sort  of  fluid.  The  basic  charge  is  notated  e, 
and  the  modern  value  is  e  =  1.60  x  ICE19  C.  The  word  “ quantized ” 
is  used  in  physics  to  describe  a  quantity  that  can  only  have  certain 
numerical  values,  and  cannot  have  any  of  the  values  between  those. 
In  this  language,  we  would  say  that  Millikan  discovered  that  charge 
is  quantized.  The  charge  e  is  referred  to  as  the  quantum  of  charge. 

A  historical  note  on  Millikan ’s  fraud 

Very  few  undergraduate  physics  textbooks  mention  the  well- 
documented  fact  that  although  Millikan’s  conclusions  were  correct, 
he  was  guilty  of  scientific  fraud.  His  technique  was  difficult  and 
painstaking  to  perform,  and  his  original  notebooks,  which  have  been 
preserved,  show  that  the  data  were  far  less  perfect  than  he  claimed 
in  his  published  scientific  papers.  In  his  publications,  he  stated  cat¬ 
egorically  that  every  single  oil  drop  observed  had  had  a  charge  that 
was  a  multiple  of  e,  with  no  exceptions  or  omissions.  But  his  note¬ 
books  are  replete  with  notations  such  as  “beautiful  data,  keep,”  and 
“bad  run,  throw  out.”  Millikan,  then,  appears  to  have  earned  his 
Nobel  Prize  by  advocating  a  correct  position  with  dishonest  descrip¬ 
tions  of  his  data. 

Why  do  textbook  authors  fail  to  mention  Millikan’s  fraud?  It 
may  be  that  they  think  students  are  too  unsophisticated  to  cor- 


Section  8.1  The  Electric  Glue 


473 


light  emitted 
when  cathode 
rays  hit  air 


^  ^  light  emitted 


when  cathode 
rays  hit  the 
glass 

i  /  Cathode  rays  observed  in 
a  vacuum  tube. 


rectly  evaluate  the  implications  of  the  fact  that  scientific  fraud  has 
sometimes  existed  and  even  been  rewarded  by  the  scientific  estab¬ 
lishment.  Maybe  they  are  afraid  students  will  reason  that  fudging 
data  is  OK,  since  Millikan  got  the  Nobel  Prize  for  it.  But  falsifying 
history  in  the  name  of  encouraging  truthfulness  is  more  than  a  little 
ironic.  English  teachers  don’t  edit  Shakespeare’s  tragedies  so  that 
the  bad  characters  are  always  punished  and  the  good  ones  never 
suffer! 

self-check  C 

Is  money  quantized?  What  is  the  quantum  of  money?  >  Answer,  p. 

965 

8.1.5  The  electron 

Cathode  rays 

Nineteenth-century  physicists  spent  a  lot  of  time  trying  to  come 
up  with  wild,  random  ways  to  play  with  electricity.  The  best  ex¬ 
periments  of  this  kind  were  the  ones  that  made  big  sparks  or  pretty 
colors  of  light. 

One  such  parlor  trick  was  the  cathode  ray.  To  produce  it,  you 
first  had  to  hire  a  good  glassblower  and  find  a  good  vacuum  pump. 
The  glassblower  would  create  a  hollow  tube  and  embed  two  pieces  of 
metal  in  it,  called  the  electrodes,  which  were  connected  to  the  out¬ 
side  via  metal  wires  passing  through  the  glass.  Before  letting  him 
seal  up  the  whole  tube,  you  would  hook  it  up  to  a  vacuum  pump, 
and  spend  several  hours  huffing  and  puffing  away  at  the  pump’s 
hand  crank  to  get  a  good  vacuum  inside.  Then,  while  you  were  still 
pumping  on  the  tube,  the  glassblower  would  melt  the  glass  and  seal 
the  whole  thing  shut.  Finally,  you  would  put  a  large  amount  of  pos¬ 
itive  charge  on  one  wire  and  a  large  amount  of  negative  charge  on 
the  other.  Metals  have  the  property  of  letting  charge  move  through 
them  easily,  so  the  charge  deposited  on  one  of  the  wires  would 
quickly  spread  out  because  of  the  repulsion  of  each  part  of  it  for 
every  other  part.  This  spreading-out  process  would  result  in  nearly 
all  the  charge  ending  up  in  the  electrodes,  where  there  is  more  room 
to  spread  out  than  there  is  in  the  wire.  For  obscure  historical  rea¬ 
sons  a  negative  electrode  is  called  a  cathode  and  a  positive  one  is 
an  anode. 

Figure  i  shows  the  light-emitting  stream  that  was  observed.  If, 
as  shown  in  this  figure,  a  hole  was  made  in  the  anode,  the  beam 
would  extend  on  through  the  hole  until  it  hit  the  glass.  Drilling  a 
hole  in  the  cathode,  however  would  not  result  in  any  beam  coming 
out  on  the  left  side,  and  this  indicated  that  the  stuff,  whatever  it 
was,  was  coming  from  the  cathode.  The  rays  were  therefore  chris¬ 
tened  “cathode  rays.”  (The  terminology  is  still  used  today  in  the 
term  “cathode  ray  tube”  or  “CRT”  for  the  picture  tube  of  a  TV  or 
computer  monitor.) 
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Were  cathode  rays  a  form  of  light,  or  of  matter ? 

Were  cathode  rays  a  form  of  light,  or  matter?  At  first  no  one  re¬ 
ally  cared  what  they  were,  but  as  their  scientific  importance  became 
more  apparent,  the  light- versus- matter  issue  turned  into  a  contro¬ 
versy  along  nationalistic  lines,  with  the  Germans  advocating  light 
and  the  English  holding  out  for  matter.  The  supporters  of  the  ma¬ 
terial  interpretation  imagined  the  rays  as  consisting  of  a  stream  of 
atoms  ripped  from  the  substance  of  the  cathode. 

One  of  our  defining  characteristics  of  matter  is  that  material 
objects  cannot  pass  through  each  other.  Experiments  showed  that 
cathode  rays  could  penetrate  at  least  some  small  thickness  of  matter, 
such  as  a  metal  foil  a  tenth  of  a  millimeter  thick,  implying  that  they 
were  a  form  of  light. 

Other  experiments,  however,  pointed  to  the  contrary  conclusion. 
Light  is  a  wave  phenomenon,  and  one  distinguishing  property  of 
waves  is  demonstrated  by  speaking  into  one  end  of  a  paper  towel 
roll.  The  sound  waves  do  not  emerge  from  the  other  end  of  the 
tube  as  a  focused  beam.  Instead,  they  begin  spreading  out  in  all 
directions  as  soon  as  they  emerge.  This  shows  that  waves  do  not 
necessarily  travel  in  straight  lines.  If  a  piece  of  metal  foil  in  the  shape 
of  a  star  or  a  cross  was  placed  in  the  way  of  the  cathode  ray,  then 
a  “shadow”  of  the  same  shape  would  appear  on  the  glass,  showing 
that  the  rays  traveled  in  straight  lines.  This  straight-line  motion 
suggested  that  they  were  a  stream  of  small  particles  of  matter. 

These  observations  were  inconclusive,  so  what  was  really  needed 
was  a  determination  of  whether  the  rays  had  mass  and  weight.  The 
trouble  was  that  cathode  rays  could  not  simply  be  collected  in  a  cup 
and  put  on  a  scale.  When  the  cathode  ray  tube  is  in  operation,  one 
does  not  observe  any  loss  of  material  from  the  cathode,  or  any  crust 
being  deposited  on  the  anode. 

Nobody  could  think  of  a  good  way  to  weigh  cathode  rays,  so  the 
next  most  obvious  way  of  settling  the  light/matter  debate  was  to 
check  whether  the  cathode  rays  possessed  electrical  charge.  Light 
was  known  to  be  uncharged.  If  the  cathode  rays  carried  charge, 
they  were  definitely  matter  and  not  light,  and  they  were  presum¬ 
ably  being  made  to  jump  the  gap  by  the  simultaneous  repulsion  of 
the  negative  charge  in  the  cathode  and  attraction  of  the  positive 
charge  in  the  anode.  The  rays  would  overshoot  the  anode  because 
of  their  momentum.  (Although  electrically  charged  particles  do  not 
normally  leap  across  a  gap  of  vacuum,  very  large  amounts  of  charge 
were  being  used,  so  the  forces  were  unusually  intense.) 

Thomson ’s  experiments 

Physicist  J.J.  Thomson  at  Cambridge  carried  out  a  series  of 
definitive  experiments  on  cathode  rays  around  the  year  1897.  By 
turning  them  slightly  off  course  with  electrical  forces,  k,  he  showed 


j  /  J.J.  Thomson  in  the  lab. 
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that  they  were  indeed  electrically  charged,  which  was  strong  evi¬ 
dence  that  they  were  material.  Not  only  that,  but  he  proved  that 
they  had  mass,  and  measured  the  ratio  of  their  mass  to  their  charge, 
m/q.  Since  their  mass  was  not  zero,  he  concluded  that  they  were 
a  form  of  matter,  and  presumably  made  up  of  a  stream  of  micro¬ 
scopic,  negatively  charged  particles.  When  Millikan  published  his 
results  fourteen  years  later,  it  was  reasonable  to  assume  that  the 
charge  of  one  such  particle  equaled  minus  one  fundamental  charge, 
q  =  — e,  and  from  the  combination  of  Thomson’s  and  Millikan’s  re¬ 
sults  one  could  therefore  determine  the  mass  of  a  single  cathode  ray 
particle. 


k  /  Thomson’s  experiment  proving 
cathode  rays  had  electric  charge 
(redrawn  from  his  original  paper). 
The  cathode,  C,  and  anode,  A, 
are  as  in  any  cathode  ray  tube. 
The  rays  pass  through  a  slit  in 
the  anode,  and  a  second  slit,  B, 
is  interposed  in  order  to  make 
the  beam  thinner  and  eliminate 
rays  that  were  not  going  straight. 
Charging  plates  D  and  E  shows 
that  cathode  rays  have  charge: 
they  are  attracted  toward  the  pos¬ 
itive  plate  D  and  repelled  by  the 
negative  plate  E. 


The  basic  technique  for  determining  m/q  was  simply  to  measure 
the  angle  through  which  the  charged  plates  bent  the  beam.  The 
electric  force  acting  on  a  cathode  ray  particle  while  it  was  between 
the  plates  would  be  proportional  to  its  charge, 


Feiec  =  (known  constant)  •  q. 


Application  of  Newton’s  second  law,  a  =  F/m,  would  allow  m/q 
to  be  determined: 


m  known  constant 
q  a 

There  was  just  one  catch.  Thomson  needed  to  know  the  cathode 
ray  particles’  velocity  in  order  to  figure  out  their  acceleration.  At 
that  point,  however,  nobody  had  even  an  educated  guess  as  to  the 
speed  of  the  cathode  rays  produced  in  a  given  vacuum  tube.  The 
beam  appeared  to  leap  across  the  vacuum  tube  practically  instan¬ 
taneously,  so  it  was  no  simple  matter  of  timing  it  with  a  stopwatch! 

Thomson’s  clever  solution  was  to  observe  the  effect  of  both  elec¬ 
tric  and  magnetic  forces  on  the  beam.  The  magnetic  force  exerted 
by  a  particular  magnet  would  depend  on  both  the  cathode  ray’s 
charge  and  its  velocity: 

Fmag  =  (known  constant  #2)  •  qv 
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Thomson  played  with  the  electric  and  magnetic  forces  until  ei¬ 
ther  one  would  produce  an  equal  effect  on  the  beam,  allowing  him 
to  solve  for  the  velocity, 


(known  constant) 

v  =  — - - . 

(known  constant  #2) 

Knowing  the  velocity  (which  was  on  the  order  of  10%  of  the 
speed  of  light  for  his  setup),  he  was  able  to  find  the  acceleration 
and  thus  the  mass-to-charge  ratio  m/q.  Thomson’s  techniques  were 
relatively  crude  (or  perhaps  more  charitably  we  could  say  that  they 
stretched  the  state  of  the  art  of  the  time),  so  with  various  methods 
he  came  up  with  m/q  values  that  ranged  over  about  a  factor  of  two, 
even  for  cathode  rays  extracted  from  a  cathode  made  of  a  single 
material.  The  best  modern  value  is  m/q  =  5.69  x  ICE12  kg/C, 
which  is  consistent  with  the  low  end  of  Thomson’s  range. 

The  cathode  ray  as  a  subatomic  particle:  the  electron 

What  was  significant  about  Thomson’s  experiment  was  not  the 
actual  numerical  value  of  m/q,  however,  so  much  as  the  fact  that, 
combined  with  Millikan’s  value  of  the  fundamental  charge,  it  gave 
a  mass  for  the  cathode  ray  particles  that  was  thousands  of  times 
smaller  than  the  mass  of  even  the  lightest  atoms.  Even  without 
Millikan’s  results,  which  were  14  years  in  the  future,  Thomson  rec¬ 
ognized  that  the  cathode  rays’  m/q  was  thousands  of  times  smaller 
than  the  m/q  ratios  that  had  been  measured  for  electrically  charged 
atoms  in  chemical  solutions.  He  correctly  interpreted  this  as  evi¬ 
dence  that  the  cathode  rays  were  smaller  building  blocks  —  he  called 
them  electrons  —  out  of  which  atoms  themselves  were  formed.  This 
was  an  extremely  radical  claim,  coming  at  a  time  when  atoms  had 
not  yet  been  proven  to  exist!  Even  those  who  used  the  word  “atom” 
often  considered  them  no  more  than  mathematical  abstractions,  not 
literal  objects.  The  idea  of  searching  for  structure  inside  of  “un- 
splittable”  atoms  was  seen  by  some  as  lunacy,  but  within  ten  years 
Thomson’s  ideas  had  been  amply  verified  by  many  more  detailed 
experiments. 

Discussion  Questions 

A  Thomson  started  to  become  convinced  during  his  experiments  that 
the  “cathode  rays”  observed  coming  from  the  cathodes  of  vacuum  tubes 
were  building  blocks  of  atoms  —  what  we  now  call  electrons.  He  then 
carried  out  observations  with  cathodes  made  of  a  variety  of  metals,  and 
found  that  m/q  was  roughly  the  same  in  every  case,  considering  his  lim¬ 
ited  accuracy.  Given  his  suspicion,  why  did  it  make  sense  to  try  different 
metals?  How  would  the  consistent  values  of  m/q  serve  to  test  his  hypoth¬ 
esis? 

B  My  students  have  frequently  asked  whether  the  m/q  that  Thomson 
measured  was  the  value  for  a  single  electron,  or  for  the  whole  beam.  Can 
you  answer  this  question? 
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C  Thomson  found  that  the  m/q  of  an  electron  was  thousands  of  times 
smaller  than  that  of  charged  atoms  in  chemical  solutions.  Would  this  imply 
that  the  electrons  had  more  charge?  Less  mass?  Would  there  be  no  way 
to  tell?  Explain.  Remember  that  Millikan’s  results  were  still  many  years  in 
the  future,  so  q  was  unknown. 

D  Can  you  guess  any  practical  reason  why  Thomson  couldn’t  just 
let  one  electron  fly  across  the  gap  before  disconnecting  the  battery  and 
turning  off  the  beam,  and  then  measure  the  amount  of  charge  deposited 
on  the  anode,  thus  allowing  him  to  measure  the  charge  of  a  single  electron 
directly? 

E  Why  is  it  not  possible  to  determine  m  and  q  themselves,  rather  than 
just  their  ratio,  by  observing  electrons’  motion  in  electric  and  magnetic 
fields? 


I /The  raisin  cookie  model  of 
the  atom  with  four  units  of 
charge,  which  we  now  know  to  be 
beryllium. 


8.1.6  The  raisin  cookie  model  of  the  atom 

Based  on  his  experiments,  Thomson  proposed  a  picture  of  the 
atom  which  became  known  as  the  raisin  cookie  model.  In  the  neutral 
atom,  1,  there  are  four  electrons  with  a  total  charge  of  —  4e,  sitting  in 
a  sphere  (the  “cookie”)  with  a  charge  of  +4e  spread  throughout  it. 
It  was  known  that  chemical  reactions  could  not  change  one  element 
into  another,  so  in  Thomson’s  scenario,  each  element’s  cookie  sphere 
had  a  permanently  fixed  radius,  mass,  and  positive  charge,  different 
from  those  of  other  elements.  The  electrons,  however,  were  not  a 
permanent  feature  of  the  atom,  and  could  be  tacked  on  or  pulled  out 
to  make  charged  ions.  Although  we  now  know,  for  instance,  that  a 
neutral  atom  with  four  electrons  is  the  element  beryllium,  scientists 
at  the  time  did  not  know  how  many  electrons  the  various  neutral 
atoms  possessed. 

This  model  is  clearly  different  from  the  one  you’ve  learned  in 
grade  school  or  through  popular  culture,  where  the  positive  charge 
is  concentrated  in  a  tiny  nucleus  at  the  atom’s  center.  An  equally 
important  change  in  ideas  about  the  atom  has  been  the  realization 
that  atoms  and  their  constituent  subatomic  particles  behave  entirely 
differently  from  objects  on  the  human  scale.  For  instance,  we’ll  see 
later  that  an  electron  can  be  in  more  than  one  place  at  one  time. 
The  raisin  cookie  model  was  part  of  a  long  tradition  of  attempts 
to  make  mechanical  models  of  phenomena,  and  Thomson  and  his 
contemporaries  never  questioned  the  appropriateness  of  building  a 
mental  model  of  an  atom  as  a  machine  with  little  parts  inside.  To¬ 
day,  mechanical  models  of  atoms  are  still  used  (for  instance  the 
tinker-toy-style  molecular  modeling  kits  like  the  ones  used  by  Wat¬ 
son  and  Crick  to  figure  out  the  double  helix  structure  of  DNA),  but 
scientists  realize  that  the  physical  objects  are  only  aids  to  help  our 
brains’  symbolic  and  visual  processes  think  about  atoms. 

Although  there  was  no  clear-cut  experimental  evidence  for  many 
of  the  details  of  the  raisin  cookie  model,  physicists  went  ahead  and 
started  working  out  its  implications.  For  instance,  suppose  you  had 
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a  four-electron  atom.  All  four  electrons  would  be  repelling  each 
other,  but  they  would  also  all  be  attracted  toward  the  center  of  the 
“cookie”  sphere.  The  result  should  be  some  kind  of  stable,  sym¬ 
metric  arrangement  in  which  all  the  forces  canceled  out.  People 
sufficiently  clever  with  math  soon  showed  that  the  electrons  in  a 
four-electron  atom  should  settle  down  at  the  vertices  of  a  pyramid 
with  one  less  side  than  the  Egyptian  kind,  i.e. ,  a  regular  tetrahe¬ 
dron.  This  deduction  turns  out  to  be  wrong  because  it  was  based 
on  incorrect  features  of  the  model,  but  the  model  also  had  many 
successes,  a  few  of  which  we  will  now  discuss. 

Flow  of  electrical  charge  in  wires  example  3 

One  of  my  former  students  was  the  son  of  an  electrician,  and 
had  become  an  electrician  himself.  He  related  to  me  how  his 
father  had  remained  refused  to  believe  all  his  life  that  electrons 
really  flowed  through  wires.  If  they  had,  he  reasoned,  the  metal 
would  have  gradually  become  more  and  more  damaged,  eventu¬ 
ally  crumbling  to  dust. 

His  opinion  is  not  at  all  unreasonable  based  on  the  fact  that  elec¬ 
trons  are  material  particles,  and  that  matter  cannot  normally  pass 
through  matter  without  making  a  hole  through  it.  Nineteenth- 
century  physicists  would  have  shared  his  objection  to  a  charged- 
particle  model  of  the  flow  of  electrical  charge.  In  the  raisin-cookie 
model,  however,  the  electrons  are  very  low  in  mass,  and  there¬ 
fore  presumably  very  small  in  size  as  well.  It  is  not  surprising  that 
they  can  slip  between  the  atoms  without  damaging  them. 

Flow  of  electrical  charge  across  cell  membranes  example  4 
Your  nervous  system  is  based  on  signals  carried  by  charge  mov¬ 
ing  from  nerve  cell  to  nerve  cell.  Your  body  is  essentially  all  liquid, 
and  atoms  in  a  liquid  are  mobile.  This  means  that,  unlike  the  case 
of  charge  flowing  in  a  solid  wire,  entire  charged  atoms  can  flow  in 
your  nervous  system 

Emission  of  electrons  in  a  cathode  ray  tube  example  5 

Why  do  electrons  detach  themselves  from  the  cathode  of  a  vac¬ 
uum  tube?  Certainly  they  are  encouraged  to  do  so  by  the  re¬ 
pulsion  of  the  negative  charge  placed  on  the  cathode  and  the 
attraction  from  the  net  positive  charge  of  the  anode,  but  these  are 
not  strong  enough  to  rip  electrons  out  of  atoms  by  main  force  — 
if  they  were,  then  the  entire  apparatus  would  have  been  instantly 
vaporized  as  every  atom  was  simultaneously  ripped  apart! 

The  raisin  cookie  model  leads  to  a  simple  explanation.  We  know 
that  heat  is  the  energy  of  random  motion  of  atoms.  The  atoms  in 
any  object  are  therefore  violently  jostling  each  other  all  the  time, 
and  a  few  of  these  collisions  are  violent  enough  to  knock  electrons 
out  of  atoms.  If  this  occurs  near  the  surface  of  a  solid  object,  the 
electron  may  can  come  loose.  Ordinarily,  however,  this  loss  of 
electrons  is  a  self-limiting  process;  the  loss  of  electrons  leaves 
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the  object  with  a  net  positive  charge,  which  attracts  the  lost  sheep 
home  to  the  fold.  (For  objects  immersed  in  air  rather  than  vacuum, 
there  will  also  be  a  balanced  exchange  of  electrons  between  the 
air  and  the  object.) 

This  interpretation  explains  the  warm  and  friendly  yellow  glow  of 
the  vacuum  tubes  in  an  antique  radio.  To  encourage  the  emission 
of  electrons  from  the  vacuum  tubes’  cathodes,  the  cathodes  are 
intentionally  warmed  up  with  little  heater  coils. 

Discussion  Questions 

A  Today  many  people  would  define  an  ion  as  an  atom  (or  molecule) 
with  missing  electrons  or  extra  electrons  added  on.  How  would  people 
have  defined  the  word  “ion”  before  the  discovery  of  the  electron? 

B  Since  electrically  neutral  atoms  were  known  to  exist,  there  had  to  be 
positively  charged  subatomic  stuff  to  cancel  out  the  negatively  charged 
electrons  in  an  atom.  Based  on  the  state  of  knowledge  immediately  after 
the  Millikan  and  Thomson  experiments,  was  it  possible  that  the  positively 
charged  stuff  had  an  unquantized  amount  of  charge?  Could  it  be  quan¬ 
tized  in  units  of  +e?  In  units  of  +2e?  In  units  of  +5/7e? 

This  chapter  is  summarized  on  page  998.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 

8.2  The  Nucleus 

8.2.1  Radioactivity 

Becquerel’s  discovery  of  radioactivity 

How  did  physicists  figure  out  that  the  raisin  cookie  model  was 
incorrect,  and  that  the  atom’s  positive  charge  was  concentrated  in 
a  tiny,  central  nucleus?  The  story  begins  with  the  discovery  of  ra¬ 
dioactivity  by  the  French  chemist  Becquerel.  Up  until  radioactivity 
was  discovered,  all  the  processes  of  nature  were  thought  to  be  based 
on  chemical  reactions,  which  were  rearrangements  of  combinations 
of  atoms.  Atoms  exert  forces  on  each  other  when  they  are  close  to¬ 
gether,  so  sticking  or  unsticking  them  would  either  release  or  store 
electrical  energy.  That  energy  could  be  converted  to  and  from  other 
forms,  as  when  a  plant  uses  the  energy  in  sunlight  to  make  sugars 
and  carbohydrates,  or  when  a  child  eats  sugar,  releasing  the  energy 
in  the  form  of  kinetic  energy. 

Becquerel  discovered  a  process  that  seemed  to  release  energy 
from  an  unknown  new  source  that  was  not  chemical.  Becquerel, 
whose  father  and  grandfather  had  also  been  physicists,  spent  the 
first  twenty  years  of  his  professional  life  as  a  successful  civil  engi¬ 
neer,  teaching  physics  on  a  part-time  basis.  He  was  awarded  the 
chair  of  physics  at  the  Musee  d’Histoire  Naturelle  in  Paris  after  the 
death  of  his  father,  who  had  previously  occupied  it.  Having  now  a 
significant  amount  of  time  to  devote  to  physics,  he  began  studying 
the  interaction  of  light  and  matter.  He  became  interested  in  the  phe- 


480 


Chapter  8  Atoms  and  Electromagnetism 


nomenon  of  phosphorescence,  in  which  a  substance  absorbs  energy 
from  light,  then  releases  the  energy  via  a  glow  that  only  gradually 
goes  away.  One  of  the  substances  he  investigated  was  a  uranium 
compound,  the  salt  UKSO5.  One  day  in  1896,  cloudy  weather  in¬ 
terfered  with  his  plan  to  expose  this  substance  to  sunlight  in  order 
to  observe  its  fluorescence.  He  stuck  it  in  a  drawer,  coincidentally  on 
top  of  a  blank  photographic  plate  —  the  old-fashioned  glass-backed 
counterpart  of  the  modern  plastic  roll  of  film.  The  plate  had  been 
carefully  wrapped,  but  several  days  later  when  Becquerel  checked  it 
in  the  darkroom  before  using  it,  he  found  that  it  was  ruined,  as  if  it 
had  been  completely  exposed  to  light. 

History  provides  many  examples  of  scientific  discoveries  that 
happened  this  way:  an  alert  and  inquisitive  mind  decides  to  in¬ 
vestigate  a  phenomenon  that  most  people  would  not  have  worried 
about  explaining.  Becquerel  first  determined  by  further  experiments 
that  the  effect  was  produced  by  the  uranium  salt,  despite  a  thick 
wrapping  of  paper  around  the  plate  that  blocked  out  all  light.  He 
tried  a  variety  of  compounds,  and  found  that  it  was  the  uranium 
that  did  it:  the  effect  was  produced  by  any  uranium  compound,  but 
not  by  any  compound  that  didn’t  include  uranium  atoms.  The  effect 
could  be  at  least  partially  blocked  by  a  sufficient  thickness  of  metal, 
and  he  was  able  to  produce  silhouettes  of  coins  by  interposing  them 
between  the  uranium  and  the  plate.  This  indicated  that  the  effect 
traveled  in  a  straight  line.,  so  that  it  must  have  been  some  kind  of 
ray  rather  than,  e.g.,  the  seepage  of  chemicals  through  the  paper. 
He  used  the  word  “radiations,”  since  the  effect  radiated  out  from 
the  uranium  salt. 

At  this  point  Becquerel  still  believed  that  the  uranium  atoms 
were  absorbing  energy  from  light  and  then  gradually  releasing  the 
energy  in  the  form  of  the  mysterious  rays,  and  this  was  how  he 
presented  it  in  his  first  published  lecture  describing  his  experiments. 
Interesting,  but  not  earth-shattering.  But  he  then  tried  to  determine 
how  long  it  took  for  the  uranium  to  use  up  all  the  energy  that  had 
supposedly  been  stored  in  it  by  light,  and  he  found  that  it  never 
seemed  to  become  inactive,  no  matter  how  long  he  waited.  Not  only 
that,  but  a  sample  that  had  been  exposed  to  intense  sunlight  for  a 
whole  afternoon  was  no  more  or  less  effective  than  a  sample  that 
had  always  been  kept  inside.  Was  this  a  violation  of  conservation 
of  energy?  If  the  energy  didn’t  come  from  exposure  to  light,  where 
did  it  come  from? 

Three  kinds  of  “radiations” 

Unable  to  determine  the  source  of  the  energy  directly,  turn-of- 
the-century  physicists  instead  studied  the  behavior  of  the  “radia¬ 
tions”  once  they  had  been  emitted.  Becquerel  had  already  shown 
that  the  radioactivity  could  penetrate  through  cloth  and  paper,  so 
the  first  obvious  thing  to  do  was  to  investigate  in  more  detail  what 


a  /  Henri  Becquerel  (1852-1908). 


b  /  Becquerel’s  photographic 
plate.  In  the  exposure  at  the 
bottom  of  the  image,  he  has 
found  that  he  could  absorb  the 
radiations,  casting  the  shadow 
of  a  Maltese  cross  that  was 
placed  between  the  plate  and  the 
uranium  salts. 
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thickness  of  material  the  radioactivity  could  get  through.  They  soon 
learned  that  a  certain  fraction  of  the  radioactivity’s  intensity  would 
be  eliminated  by  even  a  few  inches  of  air,  but  the  remainder  was 
not  eliminated  by  passing  through  more  air.  Apparently,  then,  the 
radioactivity  was  a  mixture  of  more  than  one  type,  of  which  one  was 
blocked  by  air.  They  then  found  that  of  the  part  that  could  pene¬ 
trate  air,  a  further  fraction  could  be  eliminated  by  a  piece  of  paper 
or  a  very  thin  metal  foil.  What  was  left  after  that,  however,  was 
a  third,  extremely  penetrating  type,  some  of  whose  intensity  would 
still  remain  even  after  passing  through  a  brick  wall.  They  decided 
that  this  showed  there  were  three  types  of  radioactivity,  and  with¬ 
out  having  the  faintest  idea  of  what  they  really  were,  they  made  up 
names  for  them.  The  least  penetrating  type  was  arbitrarily  labeled 
a  (alpha),  the  first  letter  of  the  Greek  alphabet,  and  so  on  through 
fd  (beta)  and  finally  7  (gamma)  for  the  most  penetrating  type. 

Radium:  a  more  intense  source  of  radioactivity 

The  measuring  devices  used  to  detect  radioactivity  were  crude: 
photographic  plates  or  even  human  eyeballs  (radioactivity  makes 
flashes  of  light  in  the  jelly-like  fluid  inside  the  eye,  which  can  be 
seen  by  the  eyeball’s  owner  if  it  is  otherwise  very  dark).  Because  the 
ways  of  detecting  radioactivity  were  so  crude  and  insensitive,  further 
progress  was  hindered  by  the  fact  that  the  amount  of  radioactivity 
emitted  by  uranium  was  not  really  very  great.  The  vital  contribu¬ 
tion  of  physicist/chemist  Marie  Curie  and  her  husband  Pierre  was 
to  discover  the  element  radium,  and  to  purify  and  isolate  significant 
quantities  it.  Radium  emits  about  a  million  times  more  radioactivity 
per  unit  mass  than  uranium,  making  it  possible  to  do  the  experi¬ 
ments  that  were  needed  to  learn  the  true  nature  of  radioactivity. 
The  dangers  of  radioactivity  to  human  health  were  then  unknown, 
and  Marie  died  of  leukemia  thirty  years  later.  (Pierre  was  run  over 
and  killed  by  a  horsecart.) 

Tracking  down  the  nature  of  alphas,  betas,  and  gammas 

As  radium  was  becoming  available,  an  apprentice  scientist  named 
Ernest  Rutherford  arrived  in  England  from  his  native  New  Zealand 
and  began  studying  radioactivity  at  the  Cavendish  Laboratory.  The 
young  colonial’s  first  success  was  to  measure  the  mass-to-charge  ra¬ 
tio  of  beta  rays.  The  technique  was  essentially  the  same  as  the  one 
Thomson  had  used  to  measure  the  mass-to-charge  ratio  of  cathode 
rays  by  measuring  their  deflections  in  electric  and  magnetic  fields. 
The  only  difference  was  that  instead  of  the  cathode  of  a  vacuum 
tube,  a  nugget  of  radium  was  used  to  supply  the  beta  rays.  Not 
only  was  the  technique  the  same,  but  so  was  the  result.  Beta  rays 
had  the  same  m/q  ratio  as  cathode  rays,  which  suggested  they  were 
one  and  the  same.  Nowadays,  it  would  make  sense  simply  to  use 
the  term  “electron,”  and  avoid  the  archaic  “cathode  ray”  and  “beta 
particle,”  but  the  old  labels  are  still  widely  used,  and  it  is  unfortu- 
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nately  necessary  for  physics  students  to  memorize  all  three  names 
for  the  same  thing. 

At  first,  it  seemed  that  neither  alphas  or  gammas  could  be  de¬ 
flected  in  electric  or  magnetic  fields,  making  it  appear  that  neither 
was  electrically  charged.  But  soon  Rutherford  obtained  a  much  more 
powerful  magnet,  and  was  able  to  use  it  to  deflect  the  alphas  but 
not  the  gammas.  The  alphas  had  a  much  larger  value  of  m/q  than 
the  betas  (about  4000  times  greater),  which  was  why  they  had  been 
so  hard  to  deflect.  Gammas  are  uncharged,  and  were  later  found  to 
be  a  form  of  light. 

The  m/q  ratio  of  alpha  particles  turned  out  to  be  the  same 
as  those  of  two  different  types  of  ions,  He++  (a  helium  atom  with 
two  missing  electrons)  and  (two  hydrogen  atoms  bonded  into  a 
molecule,  with  one  electron  missing),  so  it  seemed  likely  that  they 
were  one  or  the  other  of  those.  The  diagram  shows  a  simplified  ver¬ 
sion  of  Rutherford’s  ingenious  experiment  proving  that  they  were 
He++  ions.  The  gaseous  element  radon,  an  alpha  emitter,  was  in¬ 
troduced  into  one  half  of  a  double  glass  chamber.  The  glass  wall 
dividing  the  chamber  was  made  extremely  thin,  so  that  some  of  the 
rapidly  moving  alpha  particles  were  able  to  penetrate  it.  The  other 
chamber,  which  was  initially  evacuated,  gradually  began  to  accu¬ 
mulate  a  population  of  alpha  particles  (which  would  quickly  pick  up 
electrons  from  their  surroundings  and  become  electrically  neutral). 
Rutherford  then  determined  that  it  was  helium  gas  that  had  ap¬ 
peared  in  the  second  chamber.  Thus  alpha  particles  were  proved  to 
be  He++  ions.  The  nucleus  was  yet  to  be  discovered,  but  in  modern 
terms,  we  would  describe  a  He++  ion  as  the  nucleus  of  a  He  atom. 

To  summarize,  here  are  the  three  types  of  radiation  emitted  by 
radioactive  elements,  and  their  descriptions  in  modern  terms: 


a  particle 

stopped  by  a  few  inches  of  air 

He  nucleus 

(3  particle 

stopped  by  a  piece  of  paper 

electron 

7  ray 

penetrates  thick  shielding 

a  type  of  light 

Discussion  Question 

A  Most  sources  of  radioactivity  emit  alphas,  betas,  and  gammas,  not 
just  one  of  the  three.  In  the  radon  experiment,  how  did  Rutherford  know 
that  he  was  studying  the  alphas? 

8.2.2  The  planetary  model 

The  stage  was  now  set  for  the  unexpected  discovery  that  the 
positively  charged  part  of  the  atom  was  a  tiny,  dense  lump  at  the 
atom’s  center  rather  than  the  “cookie  dough”  of  the  raisin  cookie 
model.  By  1909,  Rutherford  was  an  established  professor,  and  had 
students  working  under  him.  For  a  raw  undergraduate  named  Mars- 
den,  he  picked  a  research  project  he  thought  would  be  tedious  but 
straightforward . 


alpha  particles 


thin  glass  wall 


•  • 


radon  gas 


•  •  . 


c  /  A  simplified  version  of  Ruther¬ 
ford’s  1908  experiment,  showing 
that  alpha  particles  were  doubly 
ionized  helium  atoms. 


d/ These  pellets  of  uranium 
fuel  will  be  inserted  into  the  metal 
fuel  rod  and  used  in  a  nuclear 
reactor.  The  pellets  emit  alpha 
and  beta  radiation,  which  the 
gloves  are  thick  enough  to  stop. 
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e  /  Ernest  Rutherford  (1871- 
1937). 


f  /  Marsden  and  Rutherford’s 
apparatus. 


It  was  already  known  that  although  alpha  particles  would  be 
stopped  completely  by  a  sheet  of  paper,  they  could  pass  through  a 
sufficiently  thin  metal  foil.  Marsden  was  to  work  with  a  gold  foil 
only  1000  atoms  thick.  (The  foil  was  probably  made  by  evaporating 
a  little  gold  in  a  vacuum  chamber  so  that  a  thin  layer  would  be 
deposited  on  a  glass  microscope  slide.  The  foil  would  then  be  lifted 
off  the  slide  by  submerging  the  slide  in  water.) 

Rutherford  had  already  determined  in  his  previous  experiments 
the  speed  of  the  alpha  particles  emitted  by  radium,  a  fantastic  1.5  x 
107  m/s.  The  experimenters  in  Rutherford’s  group  visualized  them 
as  very  small,  very  fast  cannonballs  penetrating  the  “cookie  dough” 
part  of  the  big  gold  atoms.  A  piece  of  paper  has  a  thickness  of  a 
hundred  thousand  atoms  or  so,  which  would  be  sufficient  to  stop 
them  completely,  but  crashing  through  a  thousand  would  only  slow 
them  a  little  and  turn  them  slightly  off  of  their  original  paths. 

Marsden’s  supposedly  ho-hum  assignment  was  to  use  the  appa¬ 
ratus  shown  in  figure  f  to  measure  how  often  alpha  particles  were 
deflected  at  various  angles.  A  tiny  lump  of  radium  in  a  box  emit¬ 
ted  alpha  particles,  and  a  thin  beam  was  created  by  blocking  all 
the  alphas  except  those  that  happened  to  pass  out  through  a  tube. 
Typically  deflected  in  the  gold  by  only  a  small  amount,  they  would 
reach  a  screen  very  much  like  the  screen  of  a  TV’s  picture  tube, 
which  would  make  a  flash  of  light  when  it  was  hit.  Here  is  the  first 
example  we  have  encountered  of  an  experiment  in  which  a  beam  of 
particles  is  detected  one  at  a  time.  This  was  possible  because  each 
alpha  particle  carried  so  much  kinetic  energy;  they  were  moving  at 
about  the  same  speed  as  the  electrons  in  the  Thomson  experiment, 
but  had  ten  thousand  times  more  mass. 

Marsden  sat  in  a  dark  room,  watching  the  apparatus  hour  after 
hour  and  recording  the  number  of  flashes  with  the  screen  moved  to 
various  angles.  The  rate  of  the  flashes  was  highest  when  he  set  the 
screen  at  an  angle  close  to  the  line  of  the  alphas’  original  path,  but  if 
he  watched  an  area  farther  off  to  the  side,  he  would  also  occasionally 
see  an  alpha  that  had  been  deflected  through  a  larger  angle.  After 
seeing  a  few  of  these,  he  got  the  crazy  idea  of  moving  the  screen  to 
see  if  even  larger  angles  ever  occurred,  perhaps  even  angles  larger 
than  90  degrees. 


The  crazy  idea  worked:  a  few  alpha  particles  were  deflected 
through  angles  of  up  to  180  degrees,  and  the  routine  experiment 
had  become  an  epoch-making  one.  Rutherford  said,  “We  have  been 
able  to  get  some  of  the  alpha  particles  coming  backwards.  It  was 
almost  as  incredible  as  if  you  fired  a  15-inch  shell  at  a  piece  of  tissue 
paper  and  it  came  back  and  hit  you.”  Explanations  were  hard  to 
come  by  in  the  raisin  cookie  model.  What  intense  electrical  forces 
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•  gold  nucleus 


g/ Alpha  particles  being  scattered  by  a  gold  nucleus.  On  this  scale, 
the  gold  atom  is  the  size  of  a  car,  so  all  the  alpha  particles  shown  here 
are  ones  that  just  happened  to  come  unusually  close  to  the  nucleus. 
For  these  exceptional  alpha  particles,  the  forces  from  the  electrons  are 
unimportant,  because  they  are  so  much  more  distant  than  the  nucleus. 


could  have  caused  some  of  the  alpha  particles,  moving  at  such  astro¬ 
nomical  speeds,  to  change  direction  so  drastically?  Since  each  gold 
atom  was  electrically  neutral,  it  would  not  exert  much  force  on  an 
alpha  particle  outside  it.  True,  if  the  alpha  particle  was  very  near  to 
or  inside  of  a  particular  atom,  then  the  forces  would  not  necessarily 
cancel  out  perfectly;  if  the  alpha  particle  happened  to  come  very 
close  to  a  particular  electron,  the  1/r2  form  of  the  Coulomb  force 
law  would  make  for  a  very  strong  force.  But  Marsden  and  Ruther¬ 
ford  knew  that  an  alpha  particle  was  8000  times  more  massive  than 
an  electron,  and  it  is  simply  not  possible  for  a  more  massive  object 
to  rebound  backwards  from  a  collision  with  a  less  massive  object 
while  conserving  momentum  and  energy.  It  might  be  possible  in 
principle  for  a  particular  alpha  to  follow  a  path  that  took  it  very 
close  to  one  electron,  and  then  very  close  to  another  electron,  and  so 
on,  with  the  net  result  of  a  large  deflection,  but  careful  calculations 
showed  that  such  multiple  “close  encounters”  with  electrons  would 
be  millions  of  times  too  rare  to  explain  what  was  actually  observed. 

At  this  point,  Rutherford  and  Marsden  dusted  off  an  unpop¬ 
ular  and  neglected  model  of  the  atom,  in  which  all  the  electrons 
orbited  around  a  small,  positively  charged  core  or  “nucleus,”  just 
like  the  planets  orbiting  around  the  sun.  All  the  positive  charge 
and  nearly  all  the  mass  of  the  atom  would  be  concentrated  in  the 
nucleus,  rather  than  spread  throughout  the  atom  as  in  the  raisin 
cookie  model.  The  positively  charged  alpha  particles  would  be  re¬ 
pelled  by  the  gold  atom’s  nucleus,  but  most  of  the  alphas  would  not 
come  close  enough  to  any  nucleus  to  have  their  paths  drastically 
altered.  The  few  that  did  come  close  to  a  nucleus,  however,  could 
rebound  backwards  from  a  single  such  encounter,  since  the  nucleus  of 
a  heavy  gold  atom  would  be  fifty  times  more  massive  than  an  alpha 
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orbiting  electrons 


h/The  planetary  model  of 
the  atom. 
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particle.  It  turned  out  that  it  was  not  even  too  difficult  to  derive  a 
formula  giving  the  relative  frequency  of  deflections  through  various 
angles,  and  this  calculation  agreed  with  the  data  well  enough  (to 
within  15%),  considering  the  difficulty  in  getting  good  experimental 
statistics  on  the  rare,  very  large  angles. 

What  had  started  out  as  a  tedious  exercise  to  get  a  student 
started  in  science  had  ended  as  a  revolution  in  our  understanding 
of  nature.  Indeed,  the  whole  thing  may  sound  a  little  too  much 
like  a  moralistic  fable  of  the  scientific  method  with  overtones  of 
the  Horatio  Alger  genre.  The  skeptical  reader  may  wonder  why 
the  planetary  model  was  ignored  so  thoroughly  until  Marsden  and 
Rutherford’s  discovery.  Is  science  really  more  of  a  sociological  enter¬ 
prise,  in  which  certain  ideas  become  accepted  by  the  establishment, 
and  other,  equally  plausible  explanations  are  arbitrarily  discarded? 
Some  social  scientists  are  currently  ruffling  a  lot  of  scientists’  feath¬ 
ers  with  critiques  very  much  like  this,  but  in  this  particular  case, 
there  were  very  sound  reasons  for  rejecting  the  planetary  model.  As 
you’ll  learn  in  more  detail  later  in  this  course,  any  charged  particle 
that  undergoes  an  acceleration  dissipate  energy  in  the  form  of  light. 
In  the  planetary  model,  the  electrons  were  orbiting  the  nucleus  in 
circles  or  ellipses,  which  meant  they  were  undergoing  acceleration, 
just  like  the  acceleration  you  feel  in  a  car  going  around  a  curve.  They 
should  have  dissipated  energy  as  light,  and  eventually  they  should 
have  lost  all  their  energy.  Atoms  don’t  spontaneously  collapse  like 
that,  which  was  why  the  raisin  cookie  model,  with  its  stationary 
electrons,  was  originally  preferred.  There  were  other  problems  as 
well.  In  the  planetary  model,  the  one-electron  atom  would  have 
to  be  flat,  which  would  be  inconsistent  with  the  success  of  molecu¬ 
lar  modeling  with  spherical  balls  representing  hydrogen  and  atoms. 
These  molecular  models  also  seemed  to  work  best  if  specific  sizes 
were  used  for  different  atoms,  but  there  is  no  obvious  reason  in  the 
planetary  model  why  the  radius  of  an  electron’s  orbit  should  be  a 
fixed  number.  In  view  of  the  conclusive  Marsden- Rutherford  results, 
however,  these  became  fresh  puzzles  in  atomic  physics,  not  reasons 
for  disbelieving  the  planetary  model. 

Some  phenomena  explained  with  the  planetary  model 

The  planetary  model  may  not  be  the  ultimate,  perfect  model  of 
the  atom,  but  don’t  underestimate  its  power.  It  already  allows  us 
to  visualize  correctly  a  great  many  phenomena. 

As  an  example,  let’s  consider  the  distinctions  among  nonmetals, 
metals  that  are  magnetic,  and  metals  that  are  nonmagnetic.  As 
shown  in  figure  i,  a  metal  differs  from  a  nonmetal  because  its  outer¬ 
most  electrons  are  free  to  wander  rather  than  owing  their  allegiance 
to  a  particular  atom.  A  metal  that  can  be  magnetized  is  one  that 
is  willing  to  line  up  the  rotations  of  some  of  its  electrons  so  that 
their  axes  are  parallel.  Recall  that  magnetic  forces  are  forces  made 
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by  moving  charges;  we  have  not  yet  discussed  the  mathematics  and 
geometry  of  magnetic  forces,  but  it  is  easy  to  see  how  random  ori¬ 
entations  of  the  atoms  in  the  nonmagnetic  substance  would  lead  to 
cancellation  of  the  forces. 

Even  if  the  planetary  model  does  not  immediately  answer  such 
questions  as  why  one  element  would  be  a  metal  and  another  a  non- 
metal,  these  ideas  would  be  difficult  or  impossible  to  conceptualize 
in  the  raisin  cookie  model. 

Discussion  Question 

A  In  reality,  charges  of  the  same  type  repel  one  another  and  charges 
of  different  types  are  attracted.  Suppose  the  rules  were  the  other  way 
around,  giving  repulsion  between  opposite  charges  and  attraction  be¬ 
tween  similar  ones.  What  would  the  universe  be  like? 
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8.2.3  Atomic  number 

As  alluded  to  in  a  discussion  question  in  the  previous  section, 
scientists  of  this  period  had  only  a  very  approximate  idea  of  how 
many  units  of  charge  resided  in  the  nuclei  of  the  various  chemical 
elements.  Although  we  now  associate  the  number  of  units  of  nu¬ 
clear  charge  with  the  element’s  position  on  the  periodic  table,  and 
call  it  the  atomic  number,  they  had  no  idea  that  such  a  relation¬ 
ship  existed.  Mendeleev’s  table  just  seemed  like  an  organizational 
tool,  not  something  with  any  necessary  physical  significance.  And 
everything  Mendeleev  had  done  seemed  equally  valid  if  you  turned 
the  table  upside-down  or  reversed  its  left  and  right  sides,  so  even  if 
you  wanted  to  number  the  elements  sequentially  with  integers,  there 
was  an  ambiguity  as  to  how  to  do  it.  Mendeleev’s  original  table  was 
in  fact  upside-down  compared  to  the  modern  one. 
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i/The  planetary  model  ap¬ 
plied  to  a  nonmetal,  1,  an 
unmagnetized  metal,  2,  and  a 
magnetized  metal,  3.  Note  that 
these  figures  are  all  simplified  in 
several  ways.  For  one  thing,  the 
electrons  of  an  individual  atom  do 
not  all  revolve  around  the  nucleus 
in  the  same  plane.  It  is  also  very 
unusual  for  a  metal  to  become  so 
strongly  magnetized  that  100% 
of  its  atoms  have  their  rotations 
aligned  as  shown  in  this  figure. 


j/A  modern  periodic  table,  labeled  with  atomic  numbers.  Mendeleev’s 
original  table  was  upside-down  compared  to  this  one. 


In  the  period  immediately  following  the  discovery  of  the  nucleus, 
physicists  only  had  rough  estimates  of  the  charges  of  the  various  nu- 
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clei.  In  the  case  of  the  very  lightest  nuclei,  they  simply  found  the 
maximum  number  of  electrons  they  could  strip  off  by  various  meth¬ 
ods:  chemical  reactions,  electric  sparks,  ultraviolet  light,  and  so  on. 
For  example  they  could  easily  strip  off  one  or  two  electrons  from 
helium,  making  He+  or  He++,  but  nobody  could  make  He+++,  pre¬ 
sumably  because  the  nuclear  charge  of  helium  was  only  +2e.  Unfor¬ 
tunately  only  a  few  of  the  lightest  elements  could  be  stripped  com¬ 
pletely,  because  the  more  electrons  were  stripped  off,  the  greater  the 
positive  net  charge  remaining,  and  the  more  strongly  the  rest  of  the 
negatively  charged  electrons  would  be  held  on.  The  heavy  elements’ 
atomic  numbers  could  only  be  roughly  extrapolated  from  the  light 
elements,  where  the  atomic  number  was  about  half  the  atom’s  mass 
expressed  in  units  of  the  mass  of  a  hydrogen  atom.  Gold,  for  ex¬ 
ample,  had  a  mass  about  197  times  that  of  hydrogen,  so  its  atomic 
number  was  estimated  to  be  about  half  that,  or  somewhere  around 
100.  We  now  know  it  to  be  79. 

How  did  we  finally  find  out?  The  riddle  of  the  nuclear  charges 
was  at  last  successfully  attacked  using  two  different  techniques, 
which  gave  consistent  results.  One  set  of  experiments,  involving 
x-rays,  was  performed  by  the  young  Henry  Mosely,  whose  scientific 
brilliance  was  soon  to  be  sacrificed  in  a  battle  between  European  im¬ 
perialists  over  who  would  own  the  Dardanelles,  during  that  pointless 
conflict  then  known  as  the  War  to  End  All  Wars,  and  now  referred 
to  as  World  War  I. 


k  /  An  alpha  particle  has  to  come 
much  closer  to  the  low-charged 
copper  nucleus  in  order  to  be  de¬ 
flected  through  the  same  angle. 


Since  Mosely’s  analysis  requires  several  concepts  with  which  you 
are  not  yet  familiar,  we  will  instead  describe  the  technique  used 
by  James  Chadwick  at  around  the  same  time.  An  added  bonus  of 
describing  Chadwick’s  experiments  is  that  they  presaged  the  impor¬ 
tant  modern  technique  of  studying  collisions  of  subatomic  particles. 
In  grad  school,  I  worked  with  a  professor  whose  thesis  adviser’s  the¬ 
sis  adviser  was  Chadwick,  and  he  related  some  interesting  stories 
about  the  man.  Chadwick  was  apparently  a  little  nutty  and  a  corn- 
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plete  fanatic  about  science,  to  the  extent  that  when  he  was  held  in  a 
German  prison  camp  during  World  War  II,  he  managed  to  cajole  his 
captors  into  allowing  him  to  scrounge  up  parts  from  broken  radios 
so  that  he  could  attempt  to  do  physics  experiments. 

Chadwick’s  experiment  worked  like  this.  Suppose  you  perform 
two  Rutherford-type  alpha  scattering  measurements,  first  one  with 
a  gold  foil  as  a  target  as  in  Rutherford’s  original  experiment,  and 
then  one  with  a  copper  foil.  It  is  possible  to  get  large  angles  of 
deflection  in  both  cases,  but  as  shown  in  figure  1,  the  alpha  particle 
must  be  heading  almost  straight  for  the  copper  nucleus  to  get  the 
same  angle  of  deflection  that  would  have  occurred  with  an  alpha 
that  was  much  farther  off  the  mark;  the  gold  nucleus’  charge  is  so 
much  greater  than  the  copper’s  that  it  exerts  a  strong  force  on  the 
alpha  particle  even  from  far  off.  The  situation  is  very  much  like 
that  of  a  blindfolded  person  playing  darts.  Just  as  it  is  impossible 
to  aim  an  alpha  particle  at  an  individual  nucleus  in  the  target,  the 
blindfolded  person  cannot  really  aim  the  darts.  Achieving  a  very 
close  encounter  with  the  copper  atom  would  be  akin  to  hitting  an 
inner  circle  on  the  dartboard.  It’s  much  more  likely  that  one  would 
have  the  luck  to  hit  the  outer  circle,  which  covers  a  greater  number 
of  square  inches.  By  analogy,  if  you  measure  the  frequency  with 
which  alphas  are  scattered  by  copper  at  some  particular  angle,  say 
between  19  and  20  degrees,  and  then  perform  the  same  measurement 
at  the  same  angle  with  gold,  you  get  a  much  higher  percentage  for 
gold  than  for  copper. 


I /An  alpha  particle  must  be 
headed  for  the  ring  on  the  front 
of  the  imaginary  cylindrical  pipe 
in  order  to  produce  scattering  at 
an  angle  between  19  and  20  de¬ 
grees.  The  area  of  this  ring 
is  called  the  “cross-section”  for 
scattering  at  19-20°because  it  is 
the  cross-sectional  area  of  a  cut 
through  the  pipe. 


In  fact,  the  numerical  ratio  of  the  two  nuclei’s  charges  can  be 
derived  from  this  same  experimentally  determined  ratio.  Using  the 
standard  notation  Z  for  the  atomic  number  (charge  of  the  nucleus 
divided  by  e),  the  following  equation  can  be  proved  (example  6): 


<7 2 

^ copper 


number  of  alphas  scattered  by  gold  at  19-20° 
number  of  alphas  scattered  by  copper  at  19-20° 


By  making  such  measurements  for  targets  constructed  from  all  the 
elements,  one  can  infer  the  ratios  of  all  the  atomic  numbers,  and 
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since  the  atomic  numbers  of  the  light  elements  were  already  known, 
atomic  numbers  could  be  assigned  to  the  entire  periodic  table.  Ac¬ 
cording  to  Mosely,  the  atomic  numbers  of  copper,  silver  and  plat¬ 
inum  were  29,  47,  and  78,  which  corresponded  well  with  their  posi¬ 
tions  on  the  periodic  table.  Chadwick’s  figures  for  the  same  elements 
were  29.3,  46.3,  and  77.4,  with  error  bars  of  about  1.5  times  the  fun¬ 
damental  charge,  so  the  two  experiments  were  in  good  agreement. 

The  point  here  is  absolutely  not  that  you  should  be  ready  to  plug 
numbers  into  the  above  equation  for  a  homework  or  exam  question! 
My  overall  goal  in  this  chapter  is  to  explain  how  we  know  what  we 
know  about  atoms.  An  added  bonus  of  describing  Chadwick’s  ex¬ 
periment  is  that  the  approach  is  very  similar  to  that  used  in  modern 
particle  physics  experiments,  and  the  ideas  used  in  the  analysis  are 
closely  related  to  the  now-ubiquitous  concept  of  a  “cross-section.” 
In  the  dartboard  analogy,  the  cross-section  would  be  the  area  of  the 
circular  ring  you  have  to  hit.  The  reasoning  behind  the  invention  of 
the  term  “cross-section”  can  be  visualized  as  shown  in  figure  1.  In 
this  language,  Rutherford’s  invention  of  the  planetary  model  came 
from  his  unexpected  discovery  that  there  was  a  nonzero  cross-section 
for  alpha  scattering  from  gold  at  large  angles,  and  Chadwick  con¬ 
firmed  Mosely’s  determinations  of  the  atomic  numbers  by  measuring 
cross-sections  for  alpha  scattering. 

Proof  of  the  relationship  between  Z  and  scattering  example  6 
The  equation  above  can  be  derived  by  the  following  not  very  rigor¬ 
ous  proof.  To  deflect  the  alpha  particle  by  a  certain  angle  requires 
that  it  acquire  a  certain  momentum  component  in  the  direction 
perpendicular  to  its  original  momentum.  Although  the  nucleus’s 
force  on  the  alpha  particle  is  not  constant,  we  can  pretend  that 
it  is  approximately  constant  during  the  time  when  the  alpha  is 
within  a  distance  equal  to,  say,  150%  of  its  distance  of  closest 
approach,  and  that  the  force  is  zero  before  and  after  that  part  of 
the  motion.  (If  we  chose  120%  or  200%,  it  shouldn’t  make  any 
difference  in  the  final  result,  because  the  final  result  is  a  ratio, 
and  the  effects  on  the  numerator  and  denominator  should  cancel 
each  other.)  In  the  approximation  of  constant  force,  the  change 
in  the  alpha’s  perpendicular  momentum  component  is  then  equal 
to  FAt.  The  Coulomb  force  law  says  the  force  is  proportional  to 
Z/r2.  Although  r  does  change  somewhat  during  the  time  interval 
of  interest,  it’s  good  enough  to  treat  it  as  a  constant  number,  since 
we’re  only  computing  the  ratio  between  the  two  experiments’  re¬ 
sults.  Since  we  are  approximating  the  force  as  acting  over  the 
time  during  which  the  distance  is  not  too  much  greater  than  the 
distance  of  closest  approach,  the  time  interval  At  must  be  propor¬ 
tional  to  r,  and  the  sideways  momentum  imparted  to  the  alpha, 
FAt,  is  proportional  to  ( Z/r2)r ,  or  Z/r.  If  we’re  comparing  alphas 
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scattered  at  the  same  angle  from  gold  and  from  copper,  then  A p 
is  the  same  in  both  cases,  and  the  proportionality  Ap  oc  Z/r  tells 
us  that  the  ones  scattered  from  copper  at  that  angle  had  to  be 
headed  in  along  a  line  closer  to  the  central  axis  by  a  factor  equal¬ 
ing  Zgoid /^copper-  If  you  imagine  a  “dartboard  ring”  that  the  alphas 
have  to  hit,  then  the  ring  for  the  gold  experiment  has  the  same 
proportions  as  the  one  for  copper,  but  it  is  enlarged  by  a  factor 
equal  to  Zg0|d/ZC0pper-  That  is,  not  only  is  the  radius  of  the  ring 
greater  by  that  factor,  but  unlike  the  rings  on  a  normal  dartboard, 
the  thickness  of  the  outer  ring  is  also  greater  in  proportion  to  its 
radius.  When  you  take  a  geometric  shape  and  scale  it  up  in  size 
like  a  photographic  enlargement,  its  area  is  increased  in  propor¬ 
tion  to  the  square  of  the  enlargement  factor,  so  the  area  of  the 
dartboard  ring  in  the  gold  experiment  is  greater  by  a  factor  equal 
to  (Zg0|d/ZCopper)2.  Since  the  alphas  are  aimed  entirely  randomly, 
the  chances  of  an  alpha  hitting  the  ring  are  in  proportion  to  the 
area  of  the  ring,  which  proves  the  equation  given  above. 

As  an  example  of  the  modern  use  of  scattering  experiments  and 
cross-section  measurements,  you  may  have  heard  of  the  recent  ex¬ 
perimental  evidence  for  the  existence  of  a  particle  called  the  top 
quark.  Of  the  twelve  subatomic  particles  currently  believed  to  be  the 
smallest  constituents  of  matter,  six  form  a  family  called  the  quarks, 
distinguished  from  the  other  six  by  the  intense  attractive  forces  that 
make  the  quarks  stick  to  each  other.  (The  other  six  consist  of  the 
electron  plus  five  other,  more  exotic  particles.)  The  only  two  types  of 
quarks  found  in  naturally  occurring  matter  are  the  “up  quark”  and 
“down  quark,”  which  are  what  protons  and  neutrons  are  made  of, 
but  four  other  types  were  theoretically  predicted  to  exist,  for  a  total 
of  six.  (The  whimsical  term  “quark”  comes  from  a  line  by  James 
Joyce  reading  “Three  quarks  for  master  Mark.”)  Until  recently,  only 
five  types  of  quarks  had  been  proven  to  exist  via  experiments,  and 
the  sixth,  the  top  quark,  was  only  theorized.  There  was  no  hope 
of  ever  detecting  a  top  quark  directly,  since  it  is  radioactive,  and 
only  exists  for  a  zillionth  of  a  second  before  evaporating.  Instead, 
the  researchers  searching  for  it  at  the  Fermi  National  Accelerator 
Laboratory  near  Chicago  measured  cross-sections  for  scattering  of 
nuclei  off  of  other  nuclei.  The  experiment  was  much  like  those  of 
Rutherford  and  Chadwick,  except  that  the  incoming  nuclei  had  to 
be  boosted  to  much  higher  speeds  in  a  particle  accelerator.  The 
resulting  encounter  with  a  target  nucleus  was  so  violent  that  both 
nuclei  were  completely  demolished,  but,  as  Einstein  proved,  energy 
can  be  converted  into  matter,  and  the  energy  of  the  collision  creates 
a  spray  of  exotic,  radioactive  particles,  like  the  deadly  shower  of 
wood  fragments  produced  by  a  cannon  ball  in  an  old  naval  battle. 
Among  those  particles  were  some  top  quarks.  The  cross-sections 
being  measured  were  the  cross-sections  for  the  production  of  certain 
combinations  of  these  secondary  particles.  However  different  the 
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details,  the  principle  was  the  same  as  that  employed  at  the  turn  of 
the  century:  you  smash  things  together  and  look  at  the  fragments 
that  fly  off  to  see  what  was  inside  them.  The  approach  has  been 
compared  to  shooting  a  clock  with  a  rifle  and  then  studying  the 
pieces  that  fly  off  to  figure  out  how  the  clock  worked. 

Discussion  Questions 

A  The  diagram,  showing  alpha  particles  being  deflected  by  a  gold 
nucleus,  was  drawn  with  the  assumption  that  alpha  particles  came  in  on 
lines  at  many  different  distances  from  the  nucleus.  Why  wouldn’t  they  all 
come  in  along  the  same  line,  since  they  all  came  out  through  the  same 
tube? 

B  Why  does  it  make  sense  that,  as  shown  in  the  figure,  the  trajectories 
that  result  in  19°  and  20°  scattering  cross  each  other? 

C  Rutherford  knew  the  velocity  of  the  alpha  particles  emitted  by  radium, 
and  guessed  that  the  positively  charged  part  of  a  gold  atom  had  a  charge 
of  about  +1  OOe  (we  now  know  it  is  +79e).  Considering  the  fact  that  some 
alpha  particles  were  deflected  by  180°,  how  could  he  then  use  conserva¬ 
tion  of  energy  to  derive  an  upper  limit  on  the  size  of  a  gold  nucleus?  (For 
simplicity,  assume  the  size  of  the  alpha  particle  is  negligible  compared  to 
that  of  the  gold  nucleus,  and  ignore  the  fact  that  the  gold  nucleus  recoils 
a  little  from  the  collision,  picking  up  a  little  kinetic  energy.) 

8.2.4  The  structure  of  nuclei 

The  proton 

The  fact  that  the  nuclear  charges  were  all  integer  multiples  of  e 
suggested  to  many  physicists  that  rather  than  being  a  pointlike  ob¬ 
ject,  the  nucleus  might  contain  smaller  particles  having  individual 
charges  of  +e.  Evidence  in  favor  of  this  idea  was  not  long  in  arriv¬ 
ing.  Rutherford  reasoned  that  if  he  bombarded  the  atoms  of  a  very 
light  element  with  alpha  particles,  the  small  charge  of  the  target 
nuclei  would  give  a  very  weak  repulsion.  Perhaps  those  few  alpha 
particles  that  happened  to  arrive  on  head-on  collision  courses  would 
get  so  close  that  they  would  physically  crash  into  some  of  the  target 
nuclei.  An  alpha  particle  is  itself  a  nucleus,  so  this  would  be  a  col¬ 
lision  between  two  nuclei,  and  a  violent  one  due  to  the  high  speeds 
involved.  Rutherford  hit  pay  dirt  in  an  experiment  with  alpha  par¬ 
ticles  striking  a  target  containing  nitrogen  atoms.  Charged  particles 
were  detected  flying  out  of  the  target  like  parts  flying  off  of  cars  in 
a  high-speed  crash.  Measurements  of  the  deflection  of  these  parti¬ 
cles  in  electric  and  magnetic  fields  showed  that  they  had  the  same 
charge-to-mass  ratio  as  singly-ionized  hydrogen  atoms.  Rutherford 
concluded  that  these  were  the  conjectured  singly-charged  particles 
that  held  the  charge  of  the  nucleus,  and  they  were  later  named 
protons.  The  hydrogen  nucleus  consists  of  a  single  proton,  and  in 
general,  an  element’s  atomic  number  gives  the  number  of  protons 
contained  in  each  of  its  nuclei.  The  mass  of  the  proton  is  about  1800 
times  greater  than  the  mass  of  the  electron. 


492 


Chapter  8  Atoms  and  Electromagnetism 


The  neutron 

It  would  have  been  nice  and  simple  if  all  the  nuclei  could  have 
been  built  only  from  protons,  but  that  couldn’t  be  the  case.  If  you 
spend  a  little  time  looking  at  a  periodic  table,  you  will  soon  notice 
that  although  some  of  the  atomic  masses  are  very  nearly  integer 
multiples  of  hydrogen’s  mass,  many  others  are  not.  Even  where  the 
masses  are  close  whole  numbers,  the  masses  of  an  element  other 
than  hydrogen  is  always  greater  than  its  atomic  number,  not  equal 
to  it.  Helium,  for  instance,  has  two  protons,  but  its  mass  is  four 
times  greater  than  that  of  hydrogen. 

Chadwick  cleared  up  the  confusion  by  proving  the  existence  of 
a  new  subatomic  particle.  Unlike  the  electron  and  proton,  which 
are  electrically  charged,  this  particle  is  electrically  neutral,  and  he 
named  it  the  neutron.  Chadwick’s  experiment  has  been  described 
in  detail  on  p.  138,  but  briefly  the  method  was  to  expose  a  sample 
of  the  light  element  beryllium  to  a  stream  of  alpha  particles  from  a 
lump  of  radium.  Beryllium  has  only  four  protons,  so  an  alpha  that 
happens  to  be  aimed  directly  at  a  beryllium  nucleus  can  actually  hit 
it  rather  than  being  stopped  short  of  a  collision  by  electrical  repul¬ 
sion.  Neutrons  were  observed  as  a  new  form  of  radiation  emerging 
from  the  collisions,  and  Chadwick  correctly  inferred  that  they  were 
previously  unsuspected  components  of  the  nucleus  that  had  been 
knocked  out.  As  described  earlier,  Chadwick  also  determined  the 
mass  of  the  neutron;  it  is  very  nearly  the  same  as  that  of  the  pro¬ 
ton. 


To  summarize,  atoms  are  made  of  three  types  of  particles: 


charge 

mass  in  units  of 
the  proton’s  mass 

location  in  atom 

proton 

+e 

1 

in  nucleus 

neutron 

0 

1.001 

in  nucleus 

electron 

— e 

1/1836 

orbiting  nucleus 

The  existence  of  neutrons  explained  the  mysterious  masses  of 
the  elements.  Helium,  for  instance,  has  a  mass  very  close  to  four 
times  greater  than  that  of  hydrogen.  This  is  because  it  contains 
two  neutrons  in  addition  to  its  two  protons.  The  mass  of  an  atom  is 
essentially  determined  by  the  total  number  of  neutrons  and  protons. 
The  total  number  of  neutrons  plus  protons  is  therefore  referred  to 
as  the  atom’s  mass  number. 


1ST 


m  /  Examples  of  the  construction 
of  atoms:  hydrogen  (top)  and 
helium  (bottom).  On  this  scale, 
the  electrons’  orbits  would  be  the 
size  of  a  college  campus. 


Isotopes 

We  now  have  a  clear  interpretation  of  the  fact  that  helium  is 
close  to  four  times  more  massive  than  hydrogen,  and  similarly  for 
all  the  atomic  masses  that  are  close  to  an  integer  multiple  of  the 
mass  of  hydrogen.  But  what  about  copper,  for  instance,  which  had 
an  atomic  mass  63.5  times  that  of  hydrogen?  It  didn’t  seem  rea¬ 
sonable  to  think  that  it  possessed  an  extra  half  of  a  neutron!  The 
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/  extra  charged  plates 
■  or  magnets  for  de¬ 


*»  fleeting  the  beam 

*  * 

*  * 

-  -  - 

r  — 

1 _ 
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+ 

+ 

+ 

+ 

+ 

oven 


vacuum  chamber 

n/A  version  of  the  Thomson 
apparatus  modified  for  measuring 
the  mass-to-charge  ratios  of 
ions  rather  than  electrons.  A 
small  sample  of  the  element  in 
question,  copper  in  our  example, 
is  boiled  in  the  oven  to  create 
a  thin  vapor.  (A  vacuum  pump 
is  continuously  sucking  on  the 
main  chamber  to  keep  it  from 
accumulating  enough  gas  to  stop 
the  beam  of  ions.)  Some  of  the 
atoms  of  the  vapor  are  ionized  by 
a  spark  or  by  ultraviolet  light.  Ions 
that  wander  out  of  the  nozzle 
and  into  the  region  between 
the  charged  plates  are  then 
accelerated  toward  the  top  of  the 
figure.  As  in  the  Thomson  experi¬ 
ment,  mass-to-charge  ratios  are 
inferred  from  the  deflection  of  the 
beam. 


solution  was  found  by  measuring  the  mass-to-charge  ratios  of  singly- 
ionized  atoms  (atoms  with  one  electron  removed).  The  technique 
is  essentially  that  same  as  the  one  used  by  Thomson  for  cathode 
rays,  except  that  whole  atoms  do  not  spontaneously  leap  out  of  the 
surface  of  an  object  as  electrons  sometimes  do.  Figure  n  shows  an 
example  of  how  the  ions  can  be  created  and  injected  between  the 
charged  plates  for  acceleration. 

Injecting  a  stream  of  copper  ions  into  the  device,  we  find  a  sur¬ 
prise  —  the  beam  splits  into  two  parts!  Chemists  had  elevated  to 
dogma  the  assumption  that  all  the  atoms  of  a  given  element  were 
identical,  but  we  find  that  69%  of  copper  atoms  have  one  mass,  and 
31%  have  another.  Not  only  that,  but  both  masses  are  very  nearly 
integer  multiples  of  the  mass  of  hydrogen  (63  and  65,  respectively). 
Copper  gets  its  chemical  identity  from  the  number  of  protons  in  its 
nucleus,  29,  since  chemical  reactions  work  by  electric  forces.  But 
apparently  some  copper  atoms  have  63  —  29  =  34  neutrons  while 
others  have  65  —  29  =  36.  The  atomic  mass  of  copper,  63.5,  reflects 
the  proportions  of  the  mixture  of  the  nrass-63  and  nrass-65  varieties. 
The  different  mass  varieties  of  a  given  element  are  called  isotopes  of 
that  element. 

Isotopes  can  be  named  by  giving  the  mass  number  as  a  subscript 
to  the  left  of  the  chemical  symbol,  e.g.,  65Cu.  Examples: 


protons 

neutrons 

mass  number 

'H 

1 

0 

0+1  =  1 

4He 

2 

2 

2+2  =  4 

6 

6 

6+6  =  12 

6 

8 

6+8  =  14 

262Ha 

105 

157 

105+157  =  262 

self-check  D 

Why  are  the  positive  and  negative  charges  of  the  accelerating  plates 
reversed  in  the  isotope-separating  apparatus  compared  to  the  Thomson 
apparatus?  >  Answer,  p.  965 

Chemical  reactions  are  all  about  the  exchange  and  sharing  of 
electrons:  the  nuclei  have  to  sit  out  this  dance  because  the  forces 
of  electrical  repulsion  prevent  them  from  ever  getting  close  enough 
to  make  contact  with  each  other.  Although  the  protons  do  have  a 
vitally  important  effect  on  chemical  processes  because  of  their  elec¬ 
trical  forces,  the  neutrons  can  have  no  effect  on  the  atom’s  chemical 
reactions.  It  is  not  possible,  for  instance,  to  separate  63Cu  from  65Cu 
by  chemical  reactions.  This  is  why  chemists  had  never  realized  that 
different  isotopes  existed.  (To  be  perfectly  accurate,  different  iso¬ 
topes  do  behave  slightly  differently  because  the  more  massive  atoms 
move  more  sluggishly  and  therefore  react  with  a  tiny  bit  less  inten¬ 
sity.  This  tiny  difference  is  used,  for  instance,  to  separate  out  the 
isotopes  of  uranium  needed  to  build  a  nuclear  bomb.  The  smallness 
of  this  effect  makes  the  separation  process  a  slow  and  difficult  one, 
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which  is  what  we  have  to  thank  for  the  fact  that  nuclear  weapons 
have  not  been  built  by  every  terrorist  cabal  on  the  planet.) 

Sizes  and  shapes  of  nuclei 

Matter  is  nearly  all  nuclei  if  you  count  by  weight,  but  in  terms 
of  volume  nuclei  don’t  amount  to  much.  The  radius  of  an  individual 
neutron  or  proton  is  very  close  to  1  fm  (1  fm=10-15  m),  so  even  a  big 
lead  nucleus  with  a  mass  number  of  208  still  has  a  diameter  of  only 
about  13  fm,  which  is  ten  thousand  times  smaller  than  the  diameter 
of  a  typical  atom.  Contrary  to  the  usual  imagery  of  the  nucleus  as  a 
small  sphere,  it  turns  out  that  many  nuclei  are  somewhat  elongated, 
like  an  American  football,  and  a  few  have  exotic  asymmetric  shapes 
like  pears  or  kiwi  fruits. 

Discussion  Questions 

A  Suppose  the  entire  universe  was  in  a  (very  large)  cereal  box,  and 
the  nutritional  labeling  was  supposed  to  tell  a  godlike  consumer  what  per¬ 
centage  of  the  contents  was  nuclei.  Roughly  what  would  the  percentage 
be  like  if  the  labeling  was  according  to  mass?  What  if  it  was  by  volume? 


o  /  A  nuclear  power  plant  at  Cat- 
tenom,  France.  Unlike  the  coal 
and  oil  plants  that  supply  most 
of  the  U.S.’s  electrical  power,  a 
nuclear  power  plant  like  this  one 
releases  no  pollution  or  green¬ 
house  gases  into  the  Earth’s  at¬ 
mosphere,  and  therefore  doesn’t 
contribute  to  global  warming.  The 
white  stuff  puffing  out  of  this 
plant  is  non-radioactive  water  va¬ 
por.  Although  nuclear  power 
plants  generate  long-lived  nuclear 
waste,  this  waste  arguably  poses 
much  less  of  a  threat  to  the  bio¬ 
sphere  than  greenhouse  gases 
would. 


8.2.5  The  strong  nuclear  force,  alpha  decay  and  fission 

Once  physicists  realized  that  nuclei  consisted  of  positively  charged 
protons  and  uncharged  neutrons,  they  had  a  problem  on  their  hands. 
The  electrical  forces  among  the  protons  are  all  repulsive,  so  the  nu¬ 
cleus  should  simply  fly  apart!  The  reason  all  the  nuclei  in  your 
body  are  not  spontaneously  exploding  at  this  moment  is  that  there 
is  another  force  acting.  This  force,  called  the  strong  nuclear  force,  is 
always  attractive,  and  acts  between  neutrons  and  neutrons,  neutrons 
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p  /  The  strong  nuclear  force 
cuts  off  very  sharply  at  a  range  of 
about  1  fm. 


and  protons,  and  protons  and  protons  with  roughly  equal  strength. 
The  strong  nuclear  force  does  not  have  any  effect  on  electrons,  which 
is  why  it  does  not  influence  chemical  reactions. 

Unlike  electric  forces,  whose  strengths  are  given  by  the  simple 
Coulomb  force  law,  there  is  no  simple  formula  for  how  the  strong 
nuclear  force  depends  on  distance.  Roughly  speaking,  it  is  effec¬ 
tive  over  ranges  of  ~  1  fm,  but  falls  off  extremely  quickly  at  larger 
distances  (much  faster  than  1/r2).  Since  the  radius  of  a  neutron  or 
proton  is  about  1  fm,  that  means  that  when  a  bunch  of  neutrons  and 
protons  are  packed  together  to  form  a  nucleus,  the  strong  nuclear 
force  is  effective  only  between  neighbors. 


q/1.  The  forces  cancel.  2.  The  forces  don’t  cancel.  3.  In  a 
heavy  nucleus,  the  large  number  of  electrical  repulsions  can  add  up  to 
a  force  that  is  comparable  to  the  strong  nuclear  attraction.  4.  Alpha 
emission.  5.  Fission. 


Figure  q  illustrates  how  the  strong  nuclear  force  acts  to  keep 
ordinary  nuclei  together,  but  is  not  able  to  keep  very  heavy  nuclei 
from  breaking  apart.  In  q/1,  a  proton  in  the  middle  of  a  carbon 
nucleus  feels  an  attractive  strong  nuclear  force  (arrows)  from  each 
of  its  nearest  neighbors.  The  forces  are  all  in  different  directions, 
and  tend  to  cancel  out.  The  same  is  true  for  the  repulsive  electrical 
forces  (not  shown).  In  figure  q/2,  a  proton  at  the  edge  of  the  nucleus 
has  neighbors  only  on  one  side,  and  therefore  all  the  strong  nuclear 
forces  acting  on  it  are  tending  to  pull  it  back  in.  Although  all  the 
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electrical  forces  from  the  other  five  protons  (dark  arrows)  are  all 
pushing  it  out  of  the  nucleus,  they  are  not  sufficient  to  overcome 
the  strong  nuclear  forces. 

In  a  very  heavy  nucleus,  q/3,  a  proton  that  finds  itself  near  the 
edge  has  only  a  few  neighbors  close  enough  to  attract  it  significantly 
via  the  strong  nuclear  force,  but  every  other  proton  in  the  nucleus 
exerts  a  repulsive  electrical  force  on  it.  If  the  nucleus  is  large  enough, 
the  total  electrical  repulsion  may  be  sufficient  to  overcome  the  at¬ 
traction  of  the  strong  force,  and  the  nucleus  may  spit  out  a  proton. 
Proton  emission  is  fairly  rare,  however;  a  more  common  type  of  ra¬ 
dioactive  decay1  in  heavy  nuclei  is  alpha  decay,  shown  in  q/4.  The 
imbalance  of  the  forces  is  similar,  but  the  chunk  that  is  ejected  is  an 
alpha  particle  (two  protons  and  two  neutrons)  rather  than  a  single 
proton. 

It  is  also  possible  for  the  nucleus  to  split  into  two  pieces  of 
roughly  equal  size,  q/5,  a  process  known  as  fission.  Note  that  in 
addition  to  the  two  large  fragments,  there  is  a  spray  of  individual 
neutrons.  In  a  nuclear  fission  bomb  or  a  nuclear  fission  reactor, 
some  of  these  neutrons  fly  off  and  hit  other  nuclei,  causing  them  to 
undergo  fission  as  well.  The  result  is  a  chain  reaction. 

When  a  nucleus  is  able  to  undergo  one  of  these  processes,  it  is 
said  to  be  radioactive,  and  to  undergo  radioactive  decay.  Some  of 
the  naturally  occurring  nuclei  on  earth  are  radioactive.  The  term 
“radioactive”  comes  from  Becquerel’s  image  of  rays  radiating  out 
from  something,  not  from  radio  waves,  which  are  a  whole  differ¬ 
ent  phenomenon.  The  term  “decay”  can  also  be  a  little  misleading, 
since  it  implies  that  the  nucleus  turns  to  dust  or  simply  disappears 
-  actually  it  is  splitting  into  two  new  nuclei  with  the  same  total 
number  of  neutrons  and  protons,  so  the  term  “radioactive  transfor¬ 
mation”  would  have  been  more  appropriate.  Although  the  original 
atom’s  electrons  are  mere  spectators  in  the  process  of  weak  radioac¬ 
tive  decay,  we  often  speak  loosely  of  “radioactive  atoms”  rather  than 
“radioactive  nuclei.” 

Randomness  in  physics 

How  does  an  atom  decide  when  to  decay?  We  might  imagine 
that  it  is  like  a  termite-infested  house  that  gets  weaker  and  weaker, 
until  finally  it  reaches  the  day  on  which  it  is  destined  to  fall  apart. 
Experiments,  however,  have  not  succeeded  in  detecting  such  “tick¬ 
ing  clock”  hidden  below  the  surface;  the  evidence  is  that  all  atoms 
of  a  given  isotope  are  absolutely  identical.  Why,  then,  would  one 
uranium  atom  decay  today  while  another  lives  for  another  million 
years?  The  answer  appears  to  be  that  it  is  entirely  random.  We 
can  make  general  statements  about  the  average  time  required  for  a 


1  Alpha  decay  is  more  common  because  an  alpha  particle  happens  to  be  a 
very  stable  arrangement  of  protons  and  neutrons. 
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certain  isotope  to  decay,  or  how  long  it  will  take  for  half  the  atoms 
in  a  sample  to  decay  (its  half-life),  but  we  can  never  predict  the 
behavior  of  a  particular  atom. 

This  is  the  first  example  we  have  encountered  of  an  inescapable 
randomness  in  the  laws  of  physics.  If  this  kind  of  randomness  makes 
you  uneasy,  you’re  in  good  company.  Einstein’s  famous  quote  is 
“...I  am  convinced  that  He  [God]  does  not  play  dice.11  Einstein’s 
distaste  for  randomness,  and  his  association  of  determinism  with 
divinity,  goes  back  to  the  Enlightenment  conception  of  the  universe 
as  a  gigantic  piece  of  clockwork  that  only  had  to  be  set  in  motion 
initially  by  the  Builder.  Physics  had  to  be  entirely  rebuilt  in  the  20th 
century  to  incorporate  the  fundamental  randomness  of  physics,  and 
this  modern  revolution  is  the  topic  of  chapter  13.  In  particular, 
we  will  delay  the  mathematical  development  of  the  half-life  concept 
until  then. 

8.2.6  The  weak  nuclear  force;  beta  decay 

All  the  nuclear  processes  we’ve  discussed  so  far  have  involved 
rearrangements  of  neutrons  and  protons,  with  no  change  in  the  total 
number  of  neutrons  or  the  total  number  of  protons.  Now  consider 
the  proportions  of  neutrons  and  protons  in  your  body  and  in  the 
planet  earth:  neutrons  and  protons  are  roughly  equally  numerous 
in  your  body’s  carbon  and  oxygen  nuclei,  and  also  in  the  nickel  and 
iron  that  make  up  most  of  the  earth.  The  proportions  are  about 
50-50.  But,  as  discussed  in  more  detail  on  p.  508,  the  only  chemical 
elements  produced  in  any  significant  quantities  by  the  big  bang2 
were  hydrogen  (about  90%)  and  helium  (about  10%).  If  the  early 
universe  was  almost  nothing  but  hydrogen  atoms,  whose  nuclei  are 
protons,  where  did  all  those  neutrons  come  from? 

The  answer  is  that  there  is  another  nuclear  force,  the  weak  nu¬ 
clear  force,  that  is  capable  of  transforming  neutrons  into  protons 
and  vice-versa.  Two  possible  reactions  are 

n  — >  p  +  e“  +  v  [electron  decay] 


and 

p  — )•  n  +  e+  +  v.  [positron  decay] 

(There  is  also  a  third  type  called  electron  capture,  in  which  a  proton 
grabs  one  of  the  atom’s  electrons  and  they  produce  a  neutron  and 
a  neutrino.) 

Whereas  alpha  decay  and  fission  are  just  a  redivision  of  the  pre¬ 
viously  existing  particles,  these  reactions  involve  the  destruction  of 
one  particle  and  the  creation  of  three  new  particles  that  did  not 
exist  before. 

2The  evidence  for  the  big  bang  theory  of  the  origin  of  the  universe  was  dis¬ 
cussed  on  p.  360. 
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There  are  three  new  particles  here  that  you  have  never  previ¬ 
ously  encountered.  The  symbol  e+  stands  for  an  antielectron,  which 
is  a  particle  just  like  the  electron  in  every  way,  except  that  its  elec¬ 
tric  charge  is  positive  rather  than  negative.  Antielectrons  are  also 
known  as  positrons.  Nobody  knows  why  electrons  are  so  common  in 
the  universe  and  antielectrons  are  scarce.  When  an  antielectron  en¬ 
counters  an  electron,  they  annihilate  each  other,  producing  gamma 
rays,  and  this  is  the  fate  of  all  the  antielectrons  that  are  produced 
by  natural  radioactivity  on  earth.  Antielectrons  are  an  example  of 
antimatter.  A  complete  atom  of  antimatter  would  consist  of  antipro¬ 
tons,  antielectrons,  and  antineutrons.  Although  individual  particles 
of  antimatter  occur  commonly  in  nature  due  to  natural  radioactivity 
and  cosmic  rays,  only  a  few  complete  atoms  of  antihydrogen  have 
ever  been  produced  artificially. 

The  notation  v  stands  for  a  particle  called  a  neutrino,  and  v 
means  an  antineutrino.  Neutrinos  and  antineutrinos  have  no  electric 
charge  (hence  the  name). 

We  can  now  list  all  four  of  the  known  fundamental  forces  of 
physics: 

•  gravity 

•  electromagnetism 

•  strong  nuclear  force 

•  weak  nuclear  force 

The  other  forces  we  have  learned  about,  such  as  friction  and  the 
normal  force,  all  arise  from  electromagnetic  interactions  between 
atoms,  and  therefore  are  not  considered  to  be  fundamental  forces  of 
physics. 

Decay  of 212  Pb  example  7 

As  an  example,  consider  the  radioactive  isotope  of  lead  212Pb.  It 
contains  82  protons  and  130  neutrons.  It  decays  by  the  process 
n  ->  p  +  e  +  v  .  The  newly  created  proton  is  held  inside  the 
nucleus  by  the  strong  nuclear  force,  so  the  new  nucleus  contains 
83  protons  and  129  neutrons.  Having  83  protons  makes  it  the 
element  bismuth,  so  it  will  be  an  atom  of  212Bi. 

In  a  reaction  like  this  one,  the  electron  flies  off  at  high  speed 
(typically  close  to  the  speed  of  light),  and  the  escaping  electrons 
are  the  things  that  make  large  amounts  of  this  type  of  radioactivity 
dangerous.  The  outgoing  electron  was  the  first  thing  that  tipped 
off  scientists  in  the  early  1900s  to  the  existence  of  this  type  of  ra¬ 
dioactivity.  Since  they  didn’t  know  that  the  outgoing  particles  were 
electrons,  they  called  them  beta  particles,  and  this  type  of  radioac¬ 
tive  decay  was  therefore  known  as  beta  decay.  A  clearer  but  less 
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common  terminology  is  to  call  the  two  processes  electron  decay  and 
positron  decay. 

The  neutrino  or  antineutrino  emitted  in  such  a  reaction  pretty 
much  ignores  all  matter,  because  its  lack  of  charge  makes  it  immune 
to  electrical  forces,  and  it  also  remains  aloof  from  strong  nuclear 
interactions.  Even  if  it  happens  to  fly  off  going  straight  down,  it 
is  almost  certain  to  make  it  through  the  entire  earth  without  in¬ 
teracting  with  any  atoms  in  any  way.  It  ends  up  flying  through 
outer  space  forever.  The  neutrino’s  behavior  makes  it  exceedingly 
difficult  to  detect,  and  when  beta  decay  was  first  discovered  nobody 
realized  that  neutrinos  even  existed.  We  now  know  that  the  neu¬ 
trino  carries  off  some  of  the  energy  produced  in  the  reaction,  but  at 
the  time  it  seemed  that  the  total  energy  afterwards  (not  counting 
the  unsuspected  neutrino’s  energy)  was  greater  than  the  total  en¬ 
ergy  before  the  reaction,  violating  conservation  of  energy.  Physicists 
were  getting  ready  to  throw  conservation  of  energy  out  the  window 
as  a  basic  law  of  physics  when  indirect  evidence  led  them  to  the 
conclusion  that  neutrinos  existed. 

Discussion  Questions 

A  In  the  reactions  n  ->  p  +  e~  +  v  and  p  ->  n  +  e+  +  v,  verify  that 
charge  is  conserved.  In  beta  decay,  when  one  of  these  reactions  happens 
to  a  neutron  or  proton  within  a  nucleus,  one  or  more  gamma  rays  may 
also  be  emitted.  Does  this  affect  conservation  of  charge?  Would  it  be 
possible  for  some  extra  electrons  to  be  released  without  violating  charge 
conservation? 

B  When  an  antielectron  and  an  electron  annihilate  each  other,  they 
produce  two  gamma  rays.  Is  charge  conserved  in  this  reaction? 


8.2.7  Fusion 

As  we  have  seen,  heavy  nuclei  tend  to  fly  apart  because  each 
proton  is  being  repelled  by  every  other  proton  in  the  nucleus,  but  is 
only  attracted  by  its  nearest  neighbors.  The  nucleus  splits  up  into 
two  parts,  and  as  soon  as  those  two  parts  are  more  than  about  1  fm 
apart,  the  strong  nuclear  force  no  longer  causes  the  two  fragments  to 
attract  each  other.  The  electrical  repulsion  then  accelerates  them, 
causing  them  to  gain  a  large  amount  of  kinetic  energy.  This  release 
of  kinetic  energy  is  what  powers  nuclear  reactors  and  fission  bombs. 

It  might  seem,  then,  that  the  lightest  nuclei  would  be  the  most 
stable,  but  that  is  not  the  case.  Let’s  compare  an  extremely  light 
nucleus  like  4He  with  a  somewhat  heavier  one,  160.  A  neutron  or 
proton  in  4 He  can  be  attracted  by  the  three  others,  but  in  16 O,  it 
might  have  five  or  six  neighbors  attracting  it.  The  16 O  nucleus  is 
therefore  more  stable. 

It  turns  out  that  the  most  stable  nuclei  of  all  are  those  around 
nickel  and  iron,  having  about  30  protons  and  30  neutrons.  Just  as  a 
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r/1.  Our  sun’s  source  of  energy  is  nuclear  fusion,  so  nuclear  fusion  is  also  the  source  of  power  for  all 
life  on  earth,  including,  2,  this  rain  forest  in  Fatu-Hiva.  3.  The  first  release  of  energy  by  nuclear  fusion  through 
human  technology  was  the  1952  Ivy  Mike  test  at  the  Enewetak  Atoll.  4.  This  array  of  gamma-ray  detectors  is 
called  GAMMASPHERE.  During  operation,  the  array  is  closed  up,  and  a  beam  of  ions  produced  by  a  particle 
accelerator  strikes  a  target  at  its  center,  producing  nuclear  fusion  reactions.  The  gamma  rays  can  be  studied  for 
information  about  the  structure  of  the  fused  nuclei,  which  are  typically  varieties  not  found  in  nature.  5.  Nuclear 
fusion  promises  to  be  a  clean,  inexhaustible  source  of  energy.  However,  the  goal  of  commercially  viable  nuclear 
fusion  power  has  remained  elusive,  due  to  the  engineering  difficulties  involved  in  magnetically  containing  a 
plasma  (ionized  gas)  at  a  sufficiently  high  temperature  and  density.  This  photo  shows  the  experimental  JET 
reactor,  with  the  device  opened  up  on  the  left,  and  in  action  on  the  right. 


nucleus  that  is  too  heavy  to  be  stable  can  release  energy  by  splitting 
apart  into  pieces  that  are  closer  to  the  most  stable  size,  light  nuclei 
can  release  energy  if  you  stick  them  together  to  make  bigger  nuclei 
that  are  closer  to  the  most  stable  size.  Fusing  one  nucleus  with 
another  is  called  nuclear  fusion.  Nuclear  fusion  is  what  powers  our 
sun  and  other  stars. 
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8.2.8  Nuclear  energy  and  binding  energies 

In  the  same  way  that  chemical  reactions  can  be  classified  as 
exothermic  (releasing  energy)  or  endothermic  (requiring  energy  to 
react),  so  nuclear  reactions  may  either  release  or  use  up  energy.  The 
energies  involved  in  nuclear  reactions  are  greater  by  a  huge  factor. 
Thousands  of  tons  of  coal  would  have  to  be  burned  to  produce  as 
much  energy  as  would  be  produced  in  a  nuclear  power  plant  by  one 
kg  of  fuel. 

Although  nuclear  reactions  that  use  up  energy  (endothermic 
reactions)  can  be  initiated  in  accelerators,  where  one  nucleus  is 
rammed  into  another  at  high  speed,  they  do  not  occur  in  nature,  not 
even  in  the  sun.  The  amount  of  kinetic  energy  required  is  simply 
not  available. 

To  find  the  amount  of  energy  consumed  or  released  in  a  nuclear 
reaction,  you  need  to  know  how  much  nuclear  interaction  energy, 
Unuc ,  was  stored  or  released.  Experimentalists  have  determined  the 
amount  of  nuclear  energy  stored  in  the  nucleus  of  every  stable  el¬ 
ement,  as  well  as  many  unstable  elements.  This  is  the  amount  of 
mechanical  work  that  would  be  required  to  pull  the  nucleus  apart 
into  its  individual  neutrons  and  protons,  and  is  known  as  the  nuclear 
binding  energy. 

A  reaction  occurring  in  the  sun  example  8 

The  sun  produces  its  energy  through  a  series  of  nuclear  fusion 
reactions.  One  of  the  reactions  is 

1H+2H^3  He  +  Y 

The  excess  energy  is  almost  all  carried  off  by  the  gamma  ray  (not 
by  the  kinetic  energy  of  the  helium-3  atom).  The  binding  energies 
in  units  of  pJ  (picojoules)  are: 

'H  0  J 

2H  0.35593  pJ 

3He  1.23489  pJ 

The  total  initial  nuclear  energy  is  0  pJ+0. 35593  pJ,  and  the  final 
nuclear  energy  is  1 .23489  pJ,  so  by  conservation  of  energy,  the 
gamma  ray  must  carry  off  0.87896  pJ  of  energy.  The  gamma  ray 
is  then  absorbed  by  the  sun  and  converted  to  heat. 

self-check  E 

Why  is  the  binding  energy  of 1 H  exactly  equal  to  zero?  >  Answer,  p. 
965 


Figure  s  is  a  compact  way  of  showing  the  vast  variety  of  the 
nuclei.  Each  box  represents  a  particular  number  of  neutrons  and 
protons.  The  black  boxes  are  nuclei  that  are  stable,  i.e.,  that  would 
require  an  input  of  energy  in  order  to  change  into  another.  The 
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s/The  known  nuclei,  represented  on  a  chart  of  proton  number  versus  neutron  number.  Note  the  two 
nuclei  in  the  bottom  row  with  zero  protons. 


gray  boxes  show  all  the  unstable  nuclei  that  have  been  studied  ex¬ 
perimentally.  Some  of  these  last  for  billions  of  years  on  the  aver¬ 
age  before  decaying  and  are  found  in  nature,  but  most  have  much 
shorter  average  lifetimes,  and  can  only  be  created  and  studied  in 
the  laboratory. 

The  curve  along  which  the  stable  nuclei  he  is  called  the  line  of 
stability.  Nuclei  along  this  line  have  the  most  stable  proportion 
of  neutrons  to  protons.  For  light  nuclei  the  most  stable  mixture 
is  about  50-50,  but  we  can  see  that  stable  heavy  nuclei  have  two 
or  three  times  more  neutrons  than  protons.  This  is  because  the 
electrical  repulsions  of  all  the  protons  in  a  heavy  nucleus  add  up 
to  a  powerful  force  that  would  tend  to  tear  it  apart.  The  presence 
of  a  large  number  of  neutrons  increases  the  distances  among  the 
protons,  and  also  increases  the  number  of  attractions  due  to  the  (■ /  map  showing  levels  of 
strong  nuclear  force.  radiation  near  the  site  of  the 

Chernobyl  nuclear  accident. 

8.2.9  Biological  effects  of  ionizing  radiation 
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THEY  WON’T  STAY  DEAD! 


JOKES  MARILYN  EASTMAN  ■  KARL  HARDMAN  JUDITH  RtOLET  -  KEITH  WAYNE 


u  /  In  this  classic  zombie  flick,  a 
newscaster  speculates  that  the 
dead  have  been  reanimated  due 
to  radiation  brought  back  to  earth 
by  a  space  probe. 


v  /  Radiation  doesn’t  mutate 
entire  multicellular  organisms. 


Units  used  to  measure  exposure 

As  a  science  educator,  I  find  it  frustrating  that  nowhere  in  the 
massive  amount  of  journalism  devoted  to  nuclear  safety  does  one 
ever  find  any  numerical  statements  about  the  amount  of  radiation 
to  which  people  have  been  exposed.  Anyone  capable  of  understand¬ 
ing  sports  statistics  or  weather  reports  ought  to  be  able  to  under¬ 
stand  such  measurements,  as  long  as  something  like  the  following 
explanatory  text  was  inserted  somewhere  in  the  article: 

Radiation  exposure  is  measured  in  units  of  Sieverts  (Sv).  The 
average  person  is  exposed  to  about  2000  pSv  (microSieverts)  each 
year  from  natural  background  sources. 

With  this  context,  people  would  be  able  to  come  to  informed 
conclusions.  For  example,  figure  t  shows  a  scary-looking  map  of  the 
levels  of  radiation  in  the  area  surrounding  the  1986  nuclear  accident 
at  Chernobyl,  Ukraine,  the  most  serious  that  has  ever  occurred.  At 
the  boundary  of  the  most  highly  contaminated  (bright  red)  areas, 
people  would  be  exposed  to  about  13,000  pSv  per  year,  or  about  four 
times  the  natural  background  level.  In  the  pink  areas,  which  are  still 
densely  populated,  the  exposure  is  comparable  to  the  natural  level 
found  in  a  high-altitude  city  such  as  Denver. 

What  is  a  Sievert?  It  measures  the  amount  of  energy  per  kilo¬ 
gram  deposited  in  the  body  by  ionizing  radiation,  multiplied  by  a 
“quality  factor”  to  account  for  the  different  health  hazards  posed 
by  alphas,  betas,  gammas,  neutrons,  and  other  types  of  radiation. 
Only  ionizing  radiation  is  counted,  since  nonionizing  radiation  sim¬ 
ply  heats  one’s  body  rather  than  killing  cells  or  altering  DNA.  For 
instance,  alpha  particles  are  typically  moving  so  fast  that  their  ki¬ 
netic  energy  is  sufficient  to  ionize  thousands  of  atoms,  but  it  is 
possible  for  an  alpha  particle  to  be  moving  so  slowly  that  it  would 
not  have  enough  kinetic  energy  to  ionize  even  one  atom. 

Unfortunately,  most  people  don’t  know  much  about  radiation 
and  tend  to  react  to  it  based  on  unscientific  cultural  notions.  These 
may,  as  in  figure  u,  be  based  on  fictional  tropes  silly  enough  to  re¬ 
quire  the  suspension  of  disbelief  by  the  audience,  but  they  can  also 
be  more  subtle.  People  of  my  kids’  generation  are  more  familiar  with 
the  2011  Fukushima  nuclear  accident  than  with  the  much  more  se¬ 
rious  Chernobyl  accident.  The  news  coverage  of  Fukushima  showed 
scary  scenes  of  devastated  landscapes  and  distraught  evacuees,  im¬ 
plying  that  people  had  been  killed  and  displaced  by  the  release  of 
radiation  from  the  reaction.  In  fact,  there  were  no  deaths  at  all  due 
to  the  radiation  released  at  Fukushima,  and  no  excess  cancer  deaths 
are  statistically  predicted  in  the  future.  The  devastation  and  the 
death  toll  of  16,000  were  caused  by  the  earthquake  and  tsunami, 
which  were  also  what  damaged  the  plant. 
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Effects  of  exposure 

Notwithstanding  the  pop  culture  images  like  figure  v,  it  is  not 
possible  for  a  multicellular  animal  to  become  “mutated”  as  a  whole. 
In  most  cases,  a  particle  of  ionizing  radiation  will  not  even  hit  the 
DNA,  and  even  if  it  does,  it  will  only  affect  the  DNA  of  a  single  cell, 
not  every  cell  in  the  animal’s  body.  Typically,  that  cell  is  simply 
killed,  because  the  DNA  becomes  unable  to  function  properly.  Once 
in  a  while,  however,  the  DNA  may  be  altered  so  as  to  make  that 
cell  cancerous.  For  instance,  skin  cancer  can  be  caused  by  UV  light 
hitting  a  single  skin  cell  in  the  body  of  a  sunbather.  If  that  cell 
becomes  cancerous  and  begins  reproducing  uncontrollably,  she  will 
end  up  with  a  tumor  twenty  years  later. 

Other  than  cancer,  the  only  other  dramatic  effect  that  can  result 
from  altering  a  single  cell’s  DNA  is  if  that  cell  happens  to  be  a 
sperm  or  ovum,  which  can  result  in  nonviable  or  mutated  offspring. 
Men  are  relatively  immune  to  reproductive  harm  from  radiation, 
because  their  sperm  cells  are  replaced  frequently.  Women  are  more 
vulnerable  because  they  keep  the  same  set  of  ova  as  long  as  they 
live. 

Effects  of  high  doses  of  radiation 

A  whole-body  exposure  of  5,000,000  /jSv  will  kill  a  person  within 
a  week  or  so.  Luckily,  only  a  small  number  of  humans  have  ever  been 
exposed  to  such  levels:  one  scientist  working  on  the  Manhattan 
Project,  some  victims  of  the  Nagasaki  and  Hiroshima  explosions, 
and  31  workers  at  Chernobyl.  Death  occurs  by  massive  killing  of 
cells,  especially  in  the  blood-producing  cells  of  the  bone  marrow. 

Effects  of  low  doses  radiation 

Lower  levels,  on  the  order  of  1,000,000  //Sv,  were  inflicted  on 
some  people  at  Nagasaki  and  Hiroshima.  No  acute  symptoms  result 
from  this  level  of  exposure,  but  certain  types  of  cancer  are  signifi¬ 
cantly  more  common  among  these  people.  It  was  originally  expected 
that  the  radiation  would  cause  many  mutations  resulting  in  birth 
defects,  but  very  few  such  inherited  effects  have  been  observed. 

A  great  deal  of  time  has  been  spent  debating  the  effects  of  very 
low  levels  of  ionizing  radiation.  The  following  table  gives  some  sam¬ 
ple  figures. 

maximum  beneficial  dose  per  day  ~  10,000  /uSv 

CT  scan  ~  10,000  gSv 

natural  background  per  year  2,000-7,000  gSv 

health  guidelines  for  exposure  to  a  fetus  1,000  gSv 

flying  from  New  York  to  Tokyo  150  gSv 

chest  x-ray  50  gSv 

Note  that  the  largest  number,  on  the  first  line  of  the  table,  is  the 
maximum  beneficial  dose.  The  most  useful  evidence  comes  from 
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w  /  A  typical  example  of  radi¬ 
ation  hormesis:  the  health  of 
mice  is  improved  by  low  levels 
of  radiation.  In  this  study,  young 
mice  were  exposed  to  fairly  high 
levels  of  x-rays,  while  a  control 
group  of  mice  was  not  exposed. 
The  mice  were  weighed,  and 
their  rate  of  growth  was  taken 
as  a  measure  of  their  health.  At 
levels  below  about  50,000  pSv, 
the  radiation  had  a  beneficial 
effect  on  the  health  of  the  mice, 
presumably  by  activating  cellular 
damage  control  mechanisms. 
The  two  highest  data  points 
are  statistically  significant  at  the 
99%  level.  The  curve  is  a  fit  to 
a  theoretical  model.  Redrawn 
from  T.D.  Luckey,  Hormesis  with 
Ionizing  Radiation,  CRC  Press, 
1980. 


experiments  in  animals,  which  can  intentionally  be  exposed  to  sig¬ 
nificant  and  well  measured  doses  of  radiation  under  controlled  con¬ 
ditions.  Experiments  show  that  low  levels  of  radiation  activate  cel¬ 
lular  damage  control  mechanisms,  increasing  the  health  of  the  or¬ 
ganism.  For  example,  exposure  to  radiation  up  to  a  certain  level 
makes  mice  grow  faster;  makes  guinea  pigs’  immune  systems  func¬ 
tion  better  against  diptheria;  increases  fertility  in  trout  and  mice; 
improves  fetal  mice’s  resistance  to  disease;  increases  the  life-spans  of 
flour  beetles  and  mice;  and  reduces  mortality  from  cancer  in  mice. 
This  type  of  effect  is  called  radiation  hormesis. 

There  is  also  some  evidence  that  in  humans,  small  doses  of  ra¬ 
diation  increase  fertility,  reduce  genetic  abnormalities,  and  reduce 
mortality  from  cancer.  The  human  data,  however,  tend  to  be  very 
poor  compared  to  the  animal  data.  Due  to  ethical  issues,  one  cannot 
do  controlled  experiments  in  humans.  For  example,  one  of  the  best 
sources  of  information  has  been  from  the  survivors  of  the  Hiroshima 
and  Nagasaki  bomb  blasts,  but  these  people  were  also  exposed  to 
high  levels  of  carcinogenic  chemicals  in  the  smoke  from  their  burning 
cities;  for  comparison,  firefighters  have  a  heightened  risk  of  cancer, 
and  there  are  also  significant  concerns  about  cancer  from  the  9/11 
attacks  in  New  York.  The  direct  empirical  evidence  about  radiation 
hormesis  in  humans  is  therefore  not  good  enough  to  tell  us  anything 
unambiguous,3  and  the  most  scientifically  reasonable  approach  is  to 
assume  that  the  results  in  animals  also  hold  for  humans:  small  doses 
of  radiation  in  humans  are  beneficial,  rather  than  harmful.  However, 
a  variety  of  cultural  and  historical  factors  have  led  to  a  situation  in 
which  public  health  policy  is  based  on  the  assumption,  known  as 
“linear  no-threshold”  (LNT),  that  even  tiny  doses  of  radiation  are 
harmful,  and  that  the  risk  they  carry  is  proportional  to  the  dose. 
In  other  words,  law  and  policy  are  made  based  on  the  assumption 
that  the  effects  of  radiation  on  humans  are  dramatically  different 
than  its  effects  on  mice  and  guinea  pigs.  Even  with  the  unrealis¬ 
tic  assumption  of  LNT,  one  can  still  evaluate  risks  by  comparing 
with  natural  background  radiation.  For  example,  we  can  see  that 
the  effect  of  a  chest  x-ray  is  about  a  hundred  times  smaller  than 
the  effect  of  spending  a  year  in  Colorado,  where  the  level  of  natural 
background  radiation  from  cosmic  rays  is  higher  than  average,  due 
to  the  high  altitude.  Dropping  the  implausible  LNT  assumption,  we 
can  see  that  the  impact  on  one’s  health  of  spending  a  year  in  Col¬ 
orado  is  likely  to  be  positive ,  because  the  excess  radiation  is  below 
the  maximum  beneficial  level. 


,sFor  two  opposing  viewpoints,  see  Tubiana  et  al.,  “The  Linear  No-Threshold 
Relationship  Is  Inconsistent  with  Radiation  Biologic  and  Experimental  Data,” 
Radiology,  251  (2009)  13  and  Little  et  al.,  “  Risks  Associated  with  Low  Doses 
and  Low  Dose  Rates  of  Ionizing  Radiation:  Why  Linearity  May  Be  (Almost)  the 
Best  We  Can  Do,”  Radiology,  251  (2009)  6. 
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The  green  case  for  nuclear  power 

In  the  late  twentieth  century,  antinuclear  activists  largely  suc¬ 
ceeded  in  bringing  construction  of  new  nuclear  power  plants  to  a 
halt  in  the  U.S.  Ironically,  we  now  know  that  the  burning  of  fossil 
fuels,  which  leads  to  global  warming,  is  a  far  more  grave  threat  to 
the  environment  than  even  the  Chernobyl  disaster.  A  team  of  bi¬ 
ologists  writes:  “During  recent  visits  to  Chernobyl,  we  experienced 
numerous  sightings  of  moose  (Alces  alces),  roe  deer  (Capreol  capre- 
olus),  Russian  wild  boar  (Sus  scrofa),  foxes  (Vulpes  vulpes),  river 
otter  (Lutra  canadensis),  and  rabbits  (Lepus  europaeus)  ...  Diver¬ 
sity  of  flowers  and  other  plants  in  the  highly  radioactive  regions  is 
impressive  and  equals  that  observed  in  protected  habitats  outside 
the  zone  ...  The  observation  that  typical  human  activity  (industrial¬ 
ization,  farming,  cattle  raising,  collection  of  firewood,  hunting,  etc.) 
is  more  devastating  to  biodiversity  and  abundance  of  local  flora  and 
fauna  than  is  the  worst  nuclear  power  plant  disaster  validates  the 
negative  impact  the  exponential  growth  of  human  populations  has 
on  wildlife.”4 

Nuclear  power  is  the  only  source  of  energy  that  is  sufficient  to 
replace  any  significant  percentage  of  energy  from  fossil  fuels  on  the 
rapid  schedule  demanded  by  the  speed  at  which  global  warming  is 
progressing.  People  worried  about  the  downside  of  nuclear  energy 
might  be  better  off  putting  their  energy  into  issues  related  to  nu¬ 
clear  weapons:  the  poor  stewardship  of  the  former  Soviet  Union’s 
warheads;  nuclear  proliferation  in  unstable  states  such  as  Pakistan; 
and  the  poor  safety  and  environmental  history  of  the  superpowers’ 
nuclear  weapons  programs,  including  the  loss  of  several  warheads 
in  plane  crashes,  and  the  environmental  disaster  at  the  Hanford, 
Washington,  weapons  plant. 

Protection  from  radiation 

People  do  sometimes  work  with  strong  enough  radioactivity  that 
there  is  a  serious  health  risk.  Typically  the  scariest  sources  are  those 
used  in  cancer  treatment  and  in  medical  and  biological  research. 
Also,  a  dental  technician,  for  example,  needs  to  take  precautions  to 
avoid  accumulating  a  large  radiation  dose  from  giving  dental  x-rays 
to  many  patients.  There  are  three  general  ways  to  reduce  exposure: 
time,  distance,  and  shielding.  This  is  why  a  dental  technician  doing 
x-rays  wears  a  lead  apron  (shielding)  and  steps  outside  of  the  x-ray 
room  while  running  an  exposure  (distance).  Reducing  the  time  of 
exposure  dictates,  for  example,  that  a  person  working  with  a  hot 
cancer-therapy  source  would  minimize  the  amount  of  time  spent 

4Baker  and  Chesser,  Env.  Toxicology  and  Chem.  19  (1231)  2000.  Similar 
effects  have  been  seen  at  the  Bikini  Atoll,  the  site  of  a  1954  hydrogen  bomb  test. 
Although  some  species  have  disappeared  from  the  area,  the  coral  reef  is  in  many 
ways  healthier  than  similar  reefs  elsewhere,  because  humans  have  tended  to  stay 
away  for  fear  of  radiation  (Richards  et  al.,  Marine  Pollution  Bulletin  56  (2008) 
503). 
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x  /  Wild  Przewalski’s  horses 
prosper  in  the  Chernobyl  area. 


y  /  Fossil  fuels  have  done  in¬ 
comparably  more  damage  to  the 
environment  than  nuclear  power 
ever  has.  Polar  bears’  habitat  is 
rapidly  being  destroyed  by  global 
warming. 
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z  /  The  Crab  Nebula  is  a  remnant 
of  a  supernova  explosion.  Al¬ 
most  all  the  elements  our  planet 
is  made  of  originated  in  such 
explosions. 


aa  /  Construction  of  the  UNI- 
LAC  accelerator  in  Germany,  one 
of  whose  uses  is  for  experiments 
to  create  very  heavy  artificial 
elements.  In  such  an  experiment, 
fusion  products  recoil  through  a 
device  called  SHIP  (not  shown) 
that  separates  them  based  on 
their  charge-to-mass  ratios  — 
it  is  essentially  just  a  scaled-up 
version  of  Thomson’s  appara¬ 
tus.  A  typical  experiment  runs 
for  several  months,  and  out  of 
the  billions  of  fusion  reactions 
induced  during  this  time,  only 
one  or  two  may  result  in  the 
production  of  superheavy  atoms. 
In  all  the  rest,  the  fused  nucleus 
breaks  up  immediately.  SHIP 
is  used  to  identify  the  small 
number  of  “good”  reactions  and 
separate  them  from  this  intense 
background. 


near  it. 

Shielding  against  alpha  and  beta  particles  is  trivial  to  accom¬ 
plish.  (Alphas  can’t  even  penetrate  the  skin.)  Gammas  and  x-rays 
interact  most  strongly  with  materials  that  are  dense  and  have  high 
atomic  numbers,  which  is  why  lead  is  so  commonly  used.  But  other 
materials  will  also  work.  For  example,  the  reason  that  bones  show 
up  so  clearly  on  x-ray  images  is  that  they  are  dense  and  contain 
plenty  of  calcium,  which  has  a  higher  atomic  number  than  the  ele¬ 
ments  found  in  most  other  body  tissues,  which  are  mostly  made  of 
water. 

Neutrons  are  difficult  to  shield  against.  Because  they  are  electri¬ 
cally  neutral,  they  don’t  interact  intensely  with  matter  in  the  same 
way  as  alphas  and  betas.  They  only  interact  if  they  happen  to  col¬ 
lide  head-on  with  a  nucleus,  and  that  doesn’t  happen  very  often 
because  nuclei  are  tiny  targets.  Kinematically,  a  collision  can  trans¬ 
fer  kinetic  energy  most  efficiently  when  the  target  is  as  low  in  mass 
as  possible  compared  to  the  projectile.  For  this  reason,  substances 
that  contain  a  lot  of  hydrogen  make  the  best  shielding  against  neu¬ 
trons.  Blocks  of  paraffin  wax  from  the  supermarket  are  often  used 
for  this  purpose. 

8.2.1 0  *  The  creation  of  the  elements 

Creation  of  hydrogen  and  helium  in  the  Big  Bang 

Did  all  the  chemical  elements  we’re  made  of  come  into  being  in 
the  big  bang?5  Temperatures  in  the  first  microseconds  after  the  big 
bang  were  so  high  that  atoms  and  nuclei  could  not  hold  together 
at  all.  After  things  had  cooled  down  enough  for  nuclei  and  atoms 
to  exist,  there  was  a  period  of  about  three  minutes  during  which 
the  temperature  and  density  were  high  enough  for  fusion  to  occur, 
but  not  so  high  that  atoms  could  hold  together.  We  have  a  good, 
detailed  understanding  of  the  laws  of  physics  that  apply  under  these 
conditions,  so  theorists  are  able  to  say  with  confidence  that  the 
only  element  heavier  than  hydrogen  that  was  created  in  significant 
quantities  was  helium. 

We  are  stardust 

In  that  case,  where  did  all  the  other  elements  come  from?  As¬ 
tronomers  came  up  with  the  answer.  By  studying  the  combinations 
of  wavelengths  of  light,  called  spectra,  emitted  by  various  stars,  they 
had  been  able  to  determine  what  kinds  of  atoms  they  contained. 
(We  will  have  more  to  say  about  spectra  at  the  end  of  this  book.) 
They  found  that  the  stars  fell  into  two  groups.  One  type  was  nearly 
100%  hydrogen  and  helium,  while  the  other  contained  99%  hydrogen 
and  helium  and  1%  other  elements.  They  interpreted  these  as  two 


5The  evidence  for  the  big  bang  theory  of  the  origin  of  the  universe  was  dis¬ 
cussed  on  p.  360. 
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generations  of  stars.  The  first  generation  had  formed  out  of  clouds 
of  gas  that  came  fresh  from  the  big  bang,  and  their  composition 
reflected  that  of  the  early  universe.  The  nuclear  fusion  reactions 
by  which  they  shine  have  mainly  just  increased  the  proportion  of 
helium  relative  to  hydrogen,  without  making  any  heavier  elements. 
The  members  of  the  first  generation  that  we  see  today,  however,  are 
only  those  that  lived  a  long  time.  Small  stars  are  more  miserly  with 
their  fuel  than  large  stars,  which  have  short  lives.  The  large  stars  of 
the  first  generation  have  already  finished  their  lives.  Near  the  end  of 
its  lifetime,  a  star  runs  out  of  hydrogen  fuel  and  undergoes  a  series 
of  violent  and  spectacular  reorganizations  as  it  fuses  heavier  and 
heavier  elements.  Very  large  stars  finish  this  sequence  of  events  by 
undergoing  supernova  explosions,  in  which  some  of  their  material  is 
flung  off  into  the  void  while  the  rest  collapses  into  an  exotic  object 
such  as  a  black  hole  or  neutron  star. 

The  second  generation  of  stars,  of  which  our  own  sun  is  an  exam¬ 
ple,  condensed  out  of  clouds  of  gas  that  had  been  enriched  in  heavy 
elements  due  to  supernova  explosions.  It  is  those  heavy  elements 
that  make  up  our  planet  and  our  bodies. 

Discussion  Questions 

A  Should  the  quality  factor  for  neutrinos  be  very  small,  because  they 
mostly  don’t  interact  with  your  body? 

B  Would  an  alpha  source  be  likely  to  cause  different  types  of  cancer 
depending  on  whether  the  source  was  external  to  the  body  or  swallowed 
in  contaminated  food?  What  about  a  gamma  source? 


Section  8.2  The  Nucleus 
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Problem  1.  Top:  A  realistic 
picture  of  a  neuron.  Bottom: 
A  simplified  diagram  of  one 
segment  of  the  tail  (axon). 

0.1  nm  0.1  nm 

A - ►  ◄ - ► 

o  o  o 

electron  nucleus  electron 

Problem  3. 


The  symbols  \/,  - ,  etc.  are  explained  on  page  513. 

1  The  figure  shows  a  neuron,  which  is  the  type  of  cell  your  nerves 
are  made  of.  Neurons  serve  to  transmit  sensory  information  to  the 
brain,  and  commands  from  the  brain  to  the  muscles.  All  this  data 
is  transmitted  electrically,  but  even  when  the  cell  is  resting  and  not 
transmitting  any  information,  there  is  a  layer  of  negative  electrical 
charge  on  the  inside  of  the  cell  membrane,  and  a  layer  of  positive 
charge  just  outside  it.  This  charge  is  in  the  form  of  various  ions 
dissolved  in  the  interior  and  exterior  fluids.  Why  would  the  negative 
charge  remain  plastered  against  the  inside  surface  of  the  membrane, 
and  likewise  why  doesn’t  the  positive  charge  wander  away  from  the 
outside  surface? 

2  The  Earth  and  Moon  are  bound  together  by  gravity.  If,  in¬ 

stead,  the  force  of  attraction  were  the  result  of  each  having  a  charge 
of  the  same  magnitude  but  opposite  in  sign,  find  the  quantity  of 
charge  that  would  have  to  be  placed  on  each  to  produce  the  re¬ 
quired  force.  V 

3  A  helium  atom  finds  itself  momentarily  in  this  arrangement. 
Find  the  direction  and  magnitude  of  the  force  acting  on  the  right- 
hand  electron.  The  two  protons  in  the  nucleus  are  so  close  together 
(~  1  fm)  that  you  can  consider  them  as  being  right  on  top  of  each 
other. 

V  ■ 

4  241Pu  decays  either  by  electron  decay  or  by  alpha  decay.  (A 
given  241Pu  nucleus  may  do  either  one;  it’s  random.)  What  are  the 
isotopes  created  as  products  of  these  two  modes  of  decay? 

5  Suppose  that  a  proton  in  a  lead  nucleus  wanders  out  to  the 

surface  of  the  nucleus,  and  experiences  a  strong  nuclear  force  of 
about  8  kN  from  the  nearby  neutrons  and  protons  pulling  it  back 
in.  Compare  this  numerically  to  the  repulsive  electrical  force  from 
the  other  protons,  and  verify  that  the  net  force  is  attractive.  A 
lead  nucleus  is  very  nearly  spherical,  is  about  6.5  fm  in  radius,  and 
contains  82  protons,  each  with  a  charge  of  +e,  where  e  =  1.60  x 
10“19  C.  V  - 

6  The  nuclear  process  of  beta  decay  by  electron  capture  is  de¬ 
scribed  parenthetically  on  page  498.  The  reaction  is  p  +  e_  — >■  n  +  v. 

(a)  Show  that  charge  is  conserved  in  this  reaction. 

(b)  Conversion  between  energy  and  mass  is  discussed  in  chapter  7. 
Based  on  these  ideas,  explain  why  electron  capture  doesn’t  occur  in 
hydrogen  atoms.  (If  it  did,  matter  wouldn’t  exist!) 
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7  In  the  semifinals  of  an  electrostatic  croquet  tournament,  Jes¬ 
sica  hits  her  positively  charged  ball,  sending  it  across  the  playing 
field,  rolling  to  the  left  along  the  x  axis.  It  is  repelled  by  two  other 
positive  charges.  These  two  charges  are  fixed  on  the  y  axis  at  the 
locations  shown  in  the  figure.  The  two  fixed  charges  are  equal  to 
each  other,  but  not  to  the  charge  of  the  ball. 

(a)  Express  the  force  on  the  ball  in  terms  of  the  ball’s  position,  x. 

(b)  At  what  value  of  x  does  the  ball  experience  the  greatest  de¬ 
celeration?  Express  you  answer  in  terms  of  b.  (Assume  the  ball 
has  enough  energy  to  continue  through,  so  you  don’t  have  to  worry 
about  whether  it  actually  gets  as  far  as  this  particular  value  of  x.) 
(Based  on  a  problem  by  Halliday  and  Resnick.) 

8  Suppose  that  at  some  instant  in  time,  a  wire  extending  from 

x  =  0  to  x  =  oo  holds  a  charge  density,  in  units  of  coulombs  per 
meter,  given  by  ae~hx .  This  type  of  charge  density,  d q/  dx,  is  typi¬ 
cally  notated  as  A  (Greek  letter  lambda).  Find  the  total  charge  on 
the  wire.  V  ■ 

9  Use  the  nutritional  information  on  some  packaged  food  to 

make  an  order-of-magnitude  estimate  of  the  amount  of  chemical 
energy  stored  in  one  atom  of  food,  in  units  of  joules.  Assume  that 
a  typical  atom  has  a  mass  of  1CU26  kg.  This  constitutes  a  rough 
estimate  of  the  amounts  of  energy  there  are  on  the  atomic  scale. 
[See  chapter  0  for  help  on  how  to  do  order-of-magnitude  estimates. 
Note  that  a  nutritional  “calorie”  is  really  a  kilocalorie;  see  page 
986.]  >/  ■ 

10  The  helium  atom  of  problem  3  has  some  new  experiences,  goes 
through  some  life  changes,  and  later  on  finds  itself  in  the  configura¬ 
tion  shown  here.  What  are  the  direction  and  magnitude  of  the  force 
acting  on  the  bottom  electron?  (Draw  a  sketch  to  make  clear  the 
definition  you  are  using  for  the  angle  that  gives  direction.)  V  - 

11  A  neon  light  consists  of  a  long  glass  tube  full  of  neon,  with 
metal  caps  on  the  ends.  Positive  charge  is  placed  on  one  end  of  the 
tube,  and  negative  charge  on  the  other.  The  electric  forces  generated 
can  be  strong  enough  to  strip  electrons  off  of  a  certain  number  of 
neon  atoms.  Assume  for  simplicity  that  only  one  electron  is  ever 
stripped  off  of  any  neon  atom.  When  an  electron  is  stripped  off  of 
an  atom,  both  the  electron  and  the  neon  atom  (now  an  ion)  have 
electric  charge,  and  they  are  accelerated  by  the  forces  exerted  by  the 
charged  ends  of  the  tube.  (They  do  not  feel  any  significant  forces 
from  the  other  ions  and  electrons  within  the  tube,  because  only 
a  tiny  minority  of  neon  atoms  ever  gets  ionized.)  Light  is  finally 
produced  when  ions  are  reunited  with  electrons.  Give  a  numerical 
comparison  of  the  magnitudes  and  directions  of  the  accelerations  of 
the  electrons  and  ions.  [You  may  need  some  data  from  appendix  5.] 

V  I 
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12  The  subatomic  particles  called  muons  behave  exactly  like 
electrons,  except  that  a  muon’s  mass  is  greater  by  a  factor  of  206.77. 
Muons  are  continually  bombarding  the  Earth  as  part  of  the  stream 
of  particles  from  space  known  as  cosmic  rays.  When  a  muon  strikes 
an  atom,  it  can  displace  one  of  its  electrons.  If  the  atom  happens 
to  be  a  hydrogen  atom,  then  the  muon  takes  up  an  orbit  that  is 
on  the  average  206.77  times  closer  to  the  proton  than  the  orbit  of 
the  ejected  electron.  How  many  times  greater  is  the  electric  force 
experienced  by  the  muon  than  that  previously  felt  by  the  electron? 

V 

13  (a)  Recall  that  the  gravitational  energy  of  two  gravitationally 

interacting  spheres  is  given  by  Ug  =  —Gm\m2/r,  where  r  is  the 
center-to-center  distance.  What  would  be  the  analogous  equation 
for  two  electrically  interacting  spheres?  Justify  your  choice  of  a 
plus  or  minus  sign  on  physical  grounds,  considering  attraction  and 
repulsion.  V 

(b)  Use  this  expression  to  estimate  the  energy  required  to  pull  apart 

a  raisin-cookie  atom  of  the  one-electron  type,  assuming  a  radius  of 
1CU10  m.  V 

(c)  Compare  this  with  the  result  of  problem  9. 

14  If  you  put  two  hydrogen  atoms  near  each  other,  they  will  feel 

an  attractive  force,  and  they  will  pull  together  to  form  a  molecule. 
(Molecules  consisting  of  two  hydrogen  atoms  are  the  normal  form 
of  hydrogen  gas.)  Why  do  they  feel  a  force  if  they  are  near  each 
other,  since  each  is  electrically  neutral?  Shouldn’t  the  attractive 
and  repulsive  forces  all  cancel  out  exactly?  Use  the  raisin  cookie 
model.  (Students  who  have  taken  chemistry  often  try  to  use  fancier 
models  to  explain  this,  but  if  you  can’t  explain  it  using  a  simple 
model,  you  probably  don’t  understand  the  fancy  model  as  well  as 
you  thought  you  did!)  - 

15  The  figure  shows  one  layer  of  the  three-dimensional  structure 
of  a  salt  crystal.  The  atoms  extend  much  farther  off  in  all  directions, 
but  only  a  six-by-six  square  is  shown  here.  The  larger  circles  are 
the  chlorine  ions,  which  have  charges  of  —  e.  The  smaller  circles  are 
sodium  ions,  with  charges  of  +e.  The  distance  between  neighboring 
ions  is  about  0.3  nm.  Real  crystals  are  never  perfect,  and  the  crystal 
shown  here  has  two  defects:  a  missing  atom  at  one  location,  and  an 
extra  lithium  atom,  shown  as  a  grey  circle,  inserted  in  one  of  the 
small  gaps.  If  the  lithium  atom  has  a  charge  of  +e,  what  is  the 
direction  and  magnitude  of  the  total  force  on  it?  Assume  there  are 
no  other  defects  nearby  in  the  crystal  besides  the  two  shown  here. 

>  Hint,  p.  979  V  fl 
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16  Suppose  you  are  holding  your  hands  in  front  of  you,  10  cm 
apart. 

(a)  Estimate  the  total  number  of  electrons  in  each  hand.  V 

(b)  Estimate  the  total  repulsive  force  of  all  the  electrons  in  one  hand 

on  all  the  electrons  in  the  other.  V 

(c)  Why  don’t  you  feel  your  hands  repelling  each  other? 

(d)  Estimate  how  much  the  charge  of  a  proton  could  differ  in  mag¬ 

nitude  from  the  charge  of  an  electron  without  creating  a  noticeable 
force  between  your  hands.  ■ 

17  Potassium  40  is  the  strongest  source  of  naturally  occurring 
beta  radioactivity  in  our  environment.  It  decays  according  to 

40K  ->■  40 Ca  +  e"  +  v. 

The  energy  released  in  the  decay  is  1.33  MeV,  where  1  eV  is  defined 
as  the  fundamental  charge  e  multiplied  by  one  volt.  The  energy 
is  shared  randomly  among  the  products,  subject  to  the  constraint 
imposed  by  conservation  of  energy- momentum,  which  dictates  that 
very  little  of  the  energy  is  carried  by  the  recoiling  calcium  nucleus. 
Determine  the  maximum  energy  of  the  calcium,  and  compare  with 
the  typical  energy  of  a  chemical  bond,  which  is  a  few  eV.  If  the 
potassium  is  part  of  a  molecule,  do  we  expect  the  molecule  to  sur¬ 
vive?  Carry  out  the  calculation  first  by  assuming  that  the  electron 
is  ultrarelativistic,  then  without  the  approximation,  and  comment 
on  the  how  good  the  approximation  is.  ■ 


Key  to  symbols: 

easy  -  typical  B  challenging  B  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  8A:  Nuclear  decay 

1.  Consulting  a  periodic  table,  find  the  N,  Z,  and  A  of  the  following: 


N 

Z 

A 

'He 

244Pu 

2.  Consider  the  following  five  decay  processes: 

•  a  decay 

•  7  decay 

•  p  — )•  n  +  e+  +  ^  (/3+  decay) 

•  n  — )•  p  +  e-  +  P  {(5~  decay) 

•  p  +  e~  — >  n  +  v  (electron  capture) 

What  would  be  the  action  of  each  of  these  on  the  chart  of  the  nuclei?  The  *  represents  the 
original  nucleus. 


3.  (a)  Suppose  that  244Pu  undergoes  perfectly  symmetric  fission,  and  also  emits  two  neutrons. 
Find  the  daughter  isotope. 

(b)  Is  the  daughter  stable,  or  is  it  neutron-rich  or  -poor  relative  to  the  line  of  stability?  (To 
estimate  what’s  stable,  you  can  use  a  large  chart  of  the  nuclei,  or,  if  you  don’t  have  one  handy, 
consult  a  periodic  table  and  use  the  average  atomic  mass  as  an  approximation  ot  the  stable 
value  of  A) 

(c)  Consulting  the  chart  of  the  nuclei  (fig.  s  on  p.  503),  explain  why  it  turns  out  this  way. 

(d)  If  the  daughter  is  unstable,  which  process  from  question  #2  would  you  expect  it  to  decay 
by? 
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Chapter  9 

Circuits 


Madam,  what  good  is  a  baby?  Michael  Faraday,  when  asked  by 
Queen  Victoria  what  the  electrical  devices  in  his  lab  were  good  for 

A  few  years  ago,  my  wife  and  I  bought  a  house  with  Character, 
Character  being  a  survival  mechanism  that  houses  have  evolved  in 
order  to  convince  humans  to  agree  to  much  larger  mortgage  pay¬ 
ments  than  they’d  originally  envisioned.  Anyway,  one  of  the  fea¬ 
tures  that  gives  our  house  Character  is  that  it  possesses,  built  into 
the  wall  of  the  family  room,  a  set  of  three  pachinko  machines.  These 
are  Japanese  gambling  devices  sort  of  like  vertical  pinball  machines. 
(The  legal  papers  we  got  from  the  sellers  hastened  to  tell  us  that 
they  were  “for  amusement  purposes  only.”)  Unfortunately,  only  one 
of  the  three  machines  was  working  when  we  moved  in,  and  it  soon 
died  on  us.  Having  become  a  pachinko  addict,  I  decided  to  fix  it, 
but  that  was  easier  said  than  done.  The  inside  is  a  veritable  Rube 
Goldberg  mechanism  of  levers,  hooks,  springs,  and  chutes.  My  hor¬ 
monal  pride,  combined  with  my  Ph.D.  in  physics,  made  me  certain 
of  success,  and  rendered  my  eventual  utter  failure  all  the  more  de¬ 
moralizing. 

Contemplating  my  defeat,  I  realized  how  few  complex  mechan¬ 
ical  devices  I  used  from  day  to  day.  Apart  from  our  cars  and  my 
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saxophone,  every  technological  tool  in  our  modern  life-support  sys¬ 
tem  was  electronic  rather  than  mechanical. 


a  /  Gymnotus  carapo ,  a  knifefish, 
uses  electrical  signals  to  sense 
its  environment  and  to  commu¬ 
nicate  with  others  of  its  species. 
( Greg  DeGreef) 


9.1  Current  and  Voltage 

9.1.1  Current 

Unity  of  all  types  of  electricity 

We  are  surrounded  by  things  we  have  been  told  are  “electrical,” 
but  it’s  far  from  obvious  what  they  have  in  common  to  justify  being 
grouped  together.  What  relationship  is  there  between  the  way  socks 
cling  together  and  the  way  a  battery  lights  a  lightbulb?  We  have 
been  told  that  both  an  electric  eel  and  our  own  brains  are  somehow 
electrical  in  nature,  but  what  do  they  have  in  common? 

British  physicist  Michael  Faraday  (1791-1867)  set  out  to  address 
this  problem.  He  investigated  electricity  from  a  variety  of  sources 
-  including  electric  eels!  —  to  see  whether  they  could  all  produce 
the  same  effects,  such  as  shocks  and  sparks,  attraction  and  repul¬ 
sion.  “Heating”  refers,  for  example,  to  the  way  a  lightbulb  filament 
gets  hot  enough  to  glow  and  emit  light.  Magnetic  induction  is  an 
effect  discovered  by  Faraday  himself  that  connects  electricity  and 
magnetism.  We  will  not  study  this  effect,  which  is  the  basis  for  the 
electric  generator,  in  detail  until  later  in  the  book. 

attraction  and 


shocks 

sparks 

repulsion 

heating 

rubbing 

V 

V 

V 

V 

battery 

V 

V 

V 

V 

animal 

V 

V 

(V) 

V 

magnetically  induced 

V 

V 

V 

V 

The  table  shows  a  summary  of  some  of  Faraday’s  results.  Check 
marks  indicate  that  Faraday  or  his  close  contemporaries  were  able  to 
verify  that  a  particular  source  of  electricity  was  capable  of  producing 
a  certain  effect.  (They  evidently  failed  to  demonstrate  attraction 
and  repulsion  between  objects  charged  by  electric  eels,  although 
modern  workers  have  studied  these  species  in  detail  and  been  able 
to  understand  all  their  electrical  characteristics  on  the  same  footing 
as  other  forms  of  electricity.) 

Faraday’s  results  indicate  that  there  is  nothing  fundamentally 
different  about  the  types  of  electricity  supplied  by  the  various  sources. 
They  are  all  able  to  produce  a  wide  variety  of  identical  effects.  Wrote 
Faraday,  “The  general  conclusion  which  must  be  drawn  from  this 
collection  of  facts  is  that  electricity,  whatever  may  be  its  source,  is 
identical  in  its  nature.” 

If  the  types  of  electricity  are  the  same  thing,  what  thing  is  that? 
The  answer  is  provided  by  the  fact  that  all  the  sources  of  electricity 
can  cause  objects  to  repel  or  attract  each  other.  We  use  the  word 
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“charge”  to  describe  the  property  of  an  object  that  allows  it  to 
participate  in  such  electrical  forces,  and  we  have  learned  that  charge 
is  present  in  matter  in  the  form  of  nuclei  and  electrons.  Evidently 
all  these  electrical  phenomena  boil  down  to  the  motion  of  charged 
particles  in  matter. 

Electric  current 

If  the  fundamental  phenomenon  is  the  motion  of  charged  parti¬ 
cles,  then  how  can  we  define  a  useful  numerical  measurement  of  it? 
We  might  describe  the  flow  of  a  river  simply  by  the  velocity  of  the 
water,  but  velocity  will  not  be  appropriate  for  electrical  purposes 
because  we  need  to  take  into  account  how  much  charge  the  moving 
particles  have,  and  in  any  case  there  are  no  practical  devices  sold 
at  Radio  Shack  that  can  tell  us  the  velocity  of  charged  particles. 
Experiments  show  that  the  intensity  of  various  electrical  effects  is 
related  to  a  different  quantity:  the  number  of  coulombs  of  charge 
that  pass  by  a  certain  point  per  second.  By  analogy  with  the  flow 
of  water,  this  quantity  is  called  the  electric  current  ,  I.  Its  units  of 
coulombs/second  are  more  conveniently  abbreviated  as  amperes,  1 
A=1  C/s.  (In  informal  speech,  one  usually  says  “amps.”) 

The  main  subtlety  involved  in  this  definition  is  how  to  account 
for  the  two  types  of  charge.  The  stream  of  water  coming  from  a 
hose  is  made  of  atoms  containing  charged  particles,  but  it  produces 
none  of  the  effects  we  associate  with  electric  currents.  For  example, 
you  do  not  get  an  electrical  shock  when  you  are  sprayed  by  a  hose. 
This  type  of  experiment  shows  that  the  effect  created  by  the  motion 
of  one  type  of  charged  particle  can  be  canceled  out  by  the  motion  of 
the  opposite  type  of  charge  in  the  same  direction.  In  water,  every 
oxygen  atom  with  a  charge  of  +8e  is  surrounded  by  eight  electrons 
with  charges  of  — e,  and  likewise  for  the  hydrogen  atoms. 

We  therefore  refine  our  definition  of  current  as  follows: 

When  charged  particles  are  exchanged  between  regions  of  space 
A  and  B,  the  electric  current  flowing  from  A  to  B  is  defined  as 


where  d q  is  the  change  in  region  B’s  total  charge  occurring  over  a 
period  of  time  dt. 

In  the  garden  hose  example,  your  body  picks  up  equal  amounts 
of  positive  and  negative  charge,  resulting  in  no  change  in  your  total 
charge,  so  the  electrical  current  flowing  into  you  is  zero. 
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b  /  Example  1 


Ions  moving  across  a  cell  membrane  example  1 

t>  Figure  b  shows  ions,  labeled  with  their  charges,  moving  in  or 
out  through  the  membranes  of  four  cells.  If  the  ions  all  cross 
the  membranes  during  the  same  interval  of  time,  how  would  the 
currents  into  the  cells  compare  with  each  other? 

>  We’re  just  assuming  the  rate  of  flow  is  constant,  so  we  can  talk 
about  A q  instead  of  d q. 

Cell  A  has  positive  current  going  into  it  because  its  charge  is  in¬ 
creased,  i.e.,  has  a  positive  value  of  A q. 

Cell  B  has  the  same  current  as  cell  A,  because  by  losing  one  unit 
of  negative  charge  it  also  ends  up  increasing  its  own  total  charge 
by  one  unit. 

Cell  C’s  total  charge  is  reduced  by  three  units,  so  it  has  a  large 
negative  current  going  into  it. 

Cell  D  loses  one  unit  of  charge,  so  it  has  a  small  negative  current 
into  it. 

Finding  current  given  charge  example  2 

>  A  charged  balloon  falls  to  the  ground,  and  its  charge  begins 
leaking  off  to  the  Earth.  Suppose  that  the  charge  on  the  balloon 
is  given  by  q  =  ae~bt.  Find  the  current  as  a  function  of  time,  and 
interpret  the  answer. 

>  Taking  the  derivative,  we  have 


=  -abe  bt 

An  exponential  function  approaches  zero  as  the  exponent  gets 
more  and  more  negative.  This  means  that  both  the  charge  and 
the  current  are  decreasing  in  magnitude  with  time.  It  makes  sense 
that  the  charge  approaches  zero,  since  the  balloon  is  losing  its 
charge.  It  also  makes  sense  that  the  current  is  decreasing  in 
magnitude,  since  charge  cannot  flow  at  the  same  rate  forever 
without  overshooting  zero. 

The  reverse  of  differentiation  is  integration,  so  if  we  know  the 
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current  as  a  function  of  time,  we  can  find  the  charge  by  integrating. 
Example  8  on  page  528  shows  such  a  calculation. 

It  may  seem  strange  to  say  that  a  negatively  charged  particle 
going  one  way  creates  a  current  going  the  other  way,  but  this  is 
quite  ordinary.  As  we  will  see,  currents  flow  through  metal  wires 
via  the  motion  of  electrons,  which  are  negatively  charged,  so  the 
direction  of  motion  of  the  electrons  in  a  circuit  is  always  opposite  to 
the  direction  of  the  current.  Of  course  it  would  have  been  convenient 
of  Benjamin  Franklin  had  defined  the  positive  and  negative  signs  of 
charge  the  opposite  way,  since  so  many  electrical  devices  are  based 
on  metal  wires. 

Number  of  electrons  flowing  through  a  lightbulb  example  3 

>  If  a  lightbulb  has  1.0  A  flowing  through  it,  how  many  electrons 
will  pass  through  the  filament  in  1 .0  s? 

>  We  are  only  calculating  the  number  of  electrons  that  flow,  so  we 
can  ignore  the  positive  and  negative  signs.  Also,  since  the  rate  of 
flow  is  constant,  we  don’t  really  need  to  think  in  terms  of  calculus; 
the  derivative  d qj  d t  that  defines  current  is  the  same  as  Aq/At  in 
this  situation.  Solving  for  Aq  =  I  At  gives  a  charge  of  1 .0  C  flowing 
in  this  time  interval.  The  number  of  electrons  is 

electrons 

number  of  electrons  =  coulombs  x - T — — 

coulomb 

,  .coulombs 

=  coulombs/  — j - 

electron 

=  1.0  C/e 
=  6.2  x  1018 


9.1.2  Circuits 

How  can  we  put  electric  currents  to  work?  The  only  method  of 
controlling  electric  charge  we  have  studied  so  far  is  to  charge  differ¬ 
ent  substances,  e.g.,  rubber  and  fur,  by  rubbing  them  against  each 
other.  Figure  c/1  shows  an  attempt  to  use  this  technique  to  light 
a  lightbulb.  This  method  is  unsatisfactory.  True,  current  will  flow 
through  the  bulb,  since  electrons  can  move  through  metal  wires,  and 
the  excess  electrons  on  the  rubber  rod  will  therefore  come  through 
the  wires  and  bulb  due  to  the  attraction  of  the  positively  charged 
fur  and  the  repulsion  of  the  other  electrons.  The  problem  is  that 
after  a  zillionth  of  a  second  of  current,  the  rod  and  fur  will  both  have 
run  out  of  charge.  No  more  current  will  flow,  and  the  lightbulb  will 
go  out. 

Figure  c/2  shows  a  setup  that  works.  The  battery  pushes  charge 
through  the  circuit,  and  recycles  it  over  and  over  again.  (We  will 
have  more  to  say  later  in  this  chapter  about  how  batteries  work.) 
This  is  called  a  complete  circuit.  Today,  the  electrical  use  of  the 
word  “circuit”  is  the  only  one  that  springs  to  mind  for  most  people, 
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c  / 1 .  Static  electricity  runs 
out  quickly.  2.  A  practical  circuit. 
3.  An  open  circuit.  4.  How  an 
ammeter  works.  5.  Measuring 
the  current  with  an  ammeter. 


but  the  original  meaning  was  to  travel  around  and  make  a  round 
trip,  as  when  a  circuit  court  judge  would  ride  around  the  boondocks, 
dispensing  justice  in  each  town  on  a  certain  date. 

Note  that  an  example  like  c/3  does  not  work.  The  wire  will 
quickly  begin  acquiring  a  net  charge,  because  it  has  no  way  to  get 
rid  of  the  charge  flowing  into  it.  The  repulsion  of  this  charge  will 
make  it  more  and  more  difficult  to  send  any  more  charge  in,  and 
soon  the  electrical  forces  exerted  by  the  battery  will  be  canceled 
out  completely.  The  whole  process  would  be  over  so  quickly  that 
the  filament  would  not  even  have  enough  time  to  get  hot  and  glow. 
This  is  known  as  an  open  circuit.  Exactly  the  same  thing  would 
happen  if  the  complete  circuit  of  figure  c/2  was  cut  somewhere  with 
a  pair  of  scissors,  and  in  fact  that  is  essentially  how  an  ordinary 
light  switch  works:  by  opening  up  a  gap  in  the  circuit. 

The  definition  of  electric  current  we  have  developed  has  the  great 
virtue  that  it  is  easy  to  measure.  In  practical  electrical  work,  one 
almost  always  measures  current,  not  charge.  The  instrument  used 
to  measure  current  is  called  an  ammeter.  A  simplified  ammeter,  c/4, 
simply  consists  of  a  coiled-wire  magnet  whose  force  twists  an  iron 
needle  against  the  resistance  of  a  spring.  The  greater  the  current, 
the  greater  the  force.  Although  the  construction  of  ammeters  may 
differ,  their  use  is  always  the  same.  We  break  into  the  path  of  the 
electric  current  and  interpose  the  meter  like  a  tollbooth  on  a  road, 
c/5.  There  is  still  a  complete  circuit,  and  as  far  as  the  battery  and 
bulb  are  concerned,  the  ammeter  is  just  another  segment  of  wire. 

Does  it  matter  where  in  the  circuit  we  place  the  ammeter?  Could 
we,  for  instance,  have  put  it  in  the  left  side  of  the  circuit  instead 
of  the  right?  Conservation  of  charge  tells  us  that  this  can  make  no 
difference.  Charge  is  not  destroyed  or  “used  up”  by  the  lightbulb, 
so  we  will  get  the  same  current  reading  on  either  side  of  it.  What  is 
“used  up”  is  energy  stored  in  the  battery,  which  is  being  converted 
into  heat  and  light  energy. 

9.1.3  Voltage 

The  volt  unit 

Electrical  circuits  can  be  used  for  sending  signals,  storing  infor¬ 
mation,  or  doing  calculations,  but  their  most  common  purpose  by 
far  is  to  manipulate  energy,  as  in  the  battery-and-bulb  example  of 
the  previous  section.  We  know  that  lightbulbs  are  rated  in  units  of 
watts,  i.e.,  how  many  joules  per  second  of  energy  they  can  convert 
into  heat  and  light,  but  how  would  this  relate  to  the  flow  of  charge  as 
measured  in  amperes?  By  way  of  analogy,  suppose  your  friend,  who 
didn’t  take  physics,  can’t  find  any  job  better  than  pitching  bales  of 
hay.  The  number  of  calories  he  burns  per  hour  will  certainly  depend 
on  how  many  bales  he  pitches  per  minute,  but  it  will  also  be  pro¬ 
portional  to  how  much  mechanical  work  he  has  to  do  on  each  bale. 
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If  his  job  is  to  toss  them  up  into  a  hayloft,  he  will  get  tired  a  lot 
more  quickly  than  someone  who  merely  tips  bales  off  a  loading  dock 
into  trucks.  In  metric  units, 

joules  haybales  joules 
second  second  haybale 

Similarly,  the  rate  of  energy  transformation  by  a  battery  will  not 
just  depend  on  how  many  coulombs  per  second  it  pushes  through  a 
circuit  but  also  on  how  much  mechanical  work  it  has  to  do  on  each 
coulomb  of  charge: 

joules  coulombs  joules 

second  second  coulomb 


or 


power  =  current  x  work  per  unit  charge. 


Units  of  joules  per  coulomb  are  abbreviated  as  volts,  1  V=1  J/C, 
named  after  the  Italian  physicist  Alessandro  Volta.  Everyone  knows 
that  batteries  are  rated  in  units  of  volts,  but  the  voltage  concept  is 
more  general  than  that;  it  turns  out  that  voltage  is  a  property  of 
every  point  in  space.  To  gain  more  insight,  let’s  think  more  carefully 
about  what  goes  on  in  the  battery  and  bulb  circuit. 

The  voltage  concept  in  general 

To  do  work  on  a  charged  particle,  the  battery  apparently  must  be 
exerting  forces  on  it.  How  does  it  do  this?  Well,  the  only  thing  that 
can  exert  an  electrical  force  on  a  charged  particle  is  another  charged 
particle.  It’s  as  though  the  haybales  were  pushing  and  pulling  each 
other  into  the  hayloft!  This  is  potentially  a  horribly  complicated 
situation.  Even  if  we  knew  how  much  excess  positive  or  negative 
charge  there  was  at  every  point  in  the  circuit  (which  realistically  we 
don’t)  we  would  have  to  calculate  zillions  of  forces  using  Coulomb’s 
law,  perform  all  the  vector  additions,  and  finally  calculate  how  much 
work  was  being  done  on  the  charges  as  they  moved  along.  To  make 
things  even  more  scary,  there  is  more  than  one  type  of  charged 
particle  that  moves:  electrons  are  what  move  in  the  wires  and  the 
bulb’s  filament,  but  ions  are  the  moving  charge  carriers  inside  the 
battery.  Luckily,  there  are  two  ways  in  which  we  can  simplify  things: 

The  situation  is  unchanging.  Unlike  the  imaginary  setup 
in  which  we  attempted  to  light  a  bulb  using  a  rubber  rod  and  a 
piece  of  fur,  this  circuit  maintains  itself  in  a  steady  state  (after 
perhaps  a  microsecond-long  period  of  settling  down  after  the 
circuit  is  first  assembled) .  The  current  is  steady,  and  as  charge 
flows  out  of  any  area  of  the  circuit  it  is  replaced  by  the  same 
amount  of  charge  flowing  in.  The  amount  of  excess  positive 
or  negative  charge  in  any  part  of  the  circuit  therefore  stays 
constant.  Similarly,  when  we  watch  a  river  flowing,  the  water 
goes  by  but  the  river  doesn’t  disappear. 
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d  /  Example  4. 


Force  depends  only  on  position.  Since  the  charge  distri¬ 
bution  is  not  changing,  the  total  electrical  force  on  a  charged 
particle  depends  only  on  its  own  charge  and  on  its  location. 
If  another  charged  particle  of  the  same  type  visits  the  same 
location  later  on,  it  will  feel  exactly  the  same  force. 

The  second  observation  tells  us  that  there  is  nothing  all  that 
different  about  the  experience  of  one  charged  particle  as  compared 
to  another’s.  If  we  single  out  one  particle  to  pay  attention  to,  and 
figure  out  the  amount  of  work  done  on  it  by  electrical  forces  as  it 
goes  from  point  A  to  point  B  along  a  certain  path,  then  this  is 
the  same  amount  of  work  that  will  be  done  on  any  other  charged 
particles  of  the  same  type  as  it  follows  the  same  path.  For  the  sake  of 
visualization,  let’s  think  about  the  path  that  starts  at  one  terminal 
of  the  battery,  goes  through  the  light  bulb’s  filament,  and  ends  at 
the  other  terminal.  When  an  object  experiences  a  force  that  depends 
only  on  its  position  (and  when  certain  other,  technical  conditions 
are  satisfied),  we  can  define  an  electrical  energy  associated  with  the 
position  of  that  object.  The  amount  of  work  done  on  the  particle 
by  electrical  forces  as  it  moves  from  A  to  B  equals  the  drop  in 
electrical  energy  between  A  and  B.  This  electrical  energy  is  what  is 
being  converted  into  other  forms  of  energy  such  as  heat  and  light. 
We  therefore  define  voltage  in  general  as  electrical  energy  per  unit 
charge: 

The  difference  in  voltage  between  two  points  in  space  is  defined 
as 

AF  =  A  Uelec/q, 

where  A  Ueiec  is  the  change  in  the  electrical  energy  of  a  particle  with 
charge  q  as  it  moves  from  the  initial  point  to  the  final  point. 

The  amount  of  power  dissipated  (i.e.,  rate  at  which  energy  is 
transformed  by  the  flow  of  electricity)  is  then  given  by  the  equation 

P  =  IAV. 


Energy  stored  in  a  battery  example  4 

>  The  1 .2  V  rechargeable  battery  in  figure  d  is  labeled  1 800  milliamp- 
hours.  What  is  the  maximum  amount  of  energy  the  battery  can 
store? 

>  An  ampere-hour  is  a  unit  of  current  multiplied  by  a  unit  of  time. 
Current  is  charge  per  unit  time,  so  an  ampere-hour  is  in  fact  a 
funny  unit  of  charge: 

(1  A)(1  hour)  =  (1  C/s)(3600  s) 

=  3600  C 


1800  milliamp-hours  is  therefore  1800  x  10~3  x  3600  C  =  6.5  x 
103  C.  That’s  a  huge  number  of  charged  particles,  but  the  total 
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loss  of  electrical  energy  will  just  be  their  total  charge  multiplied  by 
the  voltage  difference  across  which  they  move: 


AUeigQ  —  QAV 

=  (6.5  x  103  C)(1 .2  V) 
=  7.8  kJ 


Units  of  volt-amps  example  5 

>  Doorbells  are  often  rated  in  volt-amps.  What  does  this  combi¬ 
nation  of  units  mean? 

>  Current  times  voltage  gives  units  of  power,  P  =  IAV,  so  volt- 
amps  are  really  just  a  nonstandard  way  of  writing  watts.  They  are 
telling  you  how  much  power  the  doorbell  requires. 

Power  dissipated  by  a  battery  and  bulb  example  6 

>  If  a  9.0-volt  battery  causes  1 .0  A  to  flow  through  a  lightbulb,  how 
much  power  is  dissipated? 

>  The  voltage  rating  of  a  battery  tells  us  what  voltage  difference 
AV  it  is  designed  to  maintain  between  its  terminals. 


P=  I AV 
=  9.0  A  - V 


=  9.0 

=  9.0 
=  9.0 


C  J 
s  '  C 
J/s 
W 


The  only  nontrivial  thing  in  this  problem  was  dealing  with  the  units. 
One  quickly  gets  used  to  translating  common  combinations  like 
A  ■  V  into  simpler  terms. 
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Here  are  a  few  questions  and  answers  about  the  voltage  concept  . 
Question:  OK,  so  what  is  voltage,  really? 

Answer:  A  device  like  a  battery  has  positive  and  negative  charges 
inside  it  that  push  other  charges  around  the  outside  circuit.  A 
higher-voltage  battery  has  denser  charges  in  it,  which  will  do  more 
work  on  each  charged  particle  that  moves  through  the  outside  cir¬ 
cuit. 

To  use  a  gravitational  analogy,  we  can  put  a  paddlewheel  at  the 
bottom  of  either  a  tall  waterfall  or  a  short  one,  but  a  kg  of  water 
that  falls  through  the  greater  gravitational  energy  difference  will 
have  more  energy  to  give  up  to  the  paddlewheel  at  the  bottom. 

Question:  Why  do  we  define  voltage  as  electrical  energy  divided  by 
charge,  instead  of  just  defining  it  as  electrical  energy? 

Answer:  One  answer  is  that  it’s  the  only  definition  that  makes  the 
equation  P  =  IAV  work.  A  more  general  answer  is  that  we  want 
to  be  able  to  define  a  voltage  difference  between  any  two  points 
in  space  without  having  to  know  in  advance  how  much  charge  the 
particles  moving  between  them  will  have.  If  you  put  a  nine-volt 
battery  on  your  tongue,  then  the  charged  particles  that  move  across 
your  tongue  and  give  you  that  tingly  sensation  are  not  electrons  but 
ions,  which  may  have  charges  of  +e,  —  2e,  or  practically  anything. 
The  manufacturer  probably  expected  the  battery  to  be  used  mostly 
in  circuits  with  metal  wires,  where  the  charged  particles  that  flowed 
would  be  electrons  with  charges  of  — e.  If  the  ones  flowing  across 
your  tongue  happen  to  have  charges  of  — 2e,  the  electrical  energy 
difference  for  them  will  be  twice  as  much,  but  dividing  by  their 
charge  of  —  2e  in  the  definition  of  voltage  will  still  give  a  result  of  9 
V. 

Question:  Are  there  two  separate  roles  for  the  charged  particles  in 
the  circuit,  a  type  that  sits  still  and  exerts  the  forces,  and  another 
that  moves  under  the  influence  of  those  forces? 

Answer:  No.  Every  charged  particle  simultaneously  plays  both 
roles.  Newton’s  third  law  says  that  any  particle  that  has  an  electri¬ 
cal  force  acting  on  it  must  also  be  exerting  an  electrical  force  back  on 
the  other  particle.  There  are  no  “designated  movers”  or  “designated 
force-makers.” 

Question:  Why  does  the  definition  of  voltage  only  refer  to  voltage 
differences ? 

Answer:  It’s  perfectly  OK  to  define  voltage  as  V  =  Ueiec/q.  But 
recall  that  it  is  only  differences  in  interaction  energy,  U,  that  have 
direct  physical  meaning  in  physics.  Similarly,  voltage  differences  are 
really  more  useful  than  absolute  voltages.  A  voltmeter  measures 
voltage  differences,  not  absolute  voltages. 
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Discussion  Questions 

A  A  roller  coaster  is  sort  of  like  an  electric  circuit,  but  it  uses  gravitational 
forces  on  the  cars  instead  of  electric  ones.  What  would  a  high-voltage 
roller  coaster  be  like?  What  would  a  high-current  roller  coaster  be  like? 

B  Criticize  the  following  statements: 

“He  touched  the  wire,  and  1 0000  volts  went  through  him.” 

“That  battery  has  a  charge  of  9  volts.” 

“You  used  up  the  charge  of  the  battery.” 

C  When  you  touch  a  9-volt  battery  to  your  tongue,  both  positive  and 
negative  ions  move  through  your  saliva.  Which  ions  go  which  way? 

D  I  once  touched  a  piece  of  physics  apparatus  that  had  been  wired 
incorrectly,  and  got  a  several-thousand-volt  voltage  difference  across  my 
hand.  I  was  not  injured.  For  what  possible  reason  would  the  shock  have 
had  insufficient  power  to  hurt  me? 

9.1.4  Resistance 

Resistance 

So  far  we  have  simply  presented  it  as  an  observed  fact  that  a 
battery-and-bulb  circuit  quickly  settles  down  to  a  steady  flow,  but 
why  should  it?  Newton’s  second  law,  a  =  F/m ,  would  seem  to 
predict  that  the  steady  forces  on  the  charged  particles  should  make 
them  whip  around  the  circuit  faster  and  faster.  The  answer  is  that  as 
charged  particles  move  through  matter,  there  are  always  forces,  anal¬ 
ogous  to  frictional  forces,  that  resist  the  motion.  These  forces  need 
to  be  included  in  Newton’s  second  law,  which  is  really  a  =  Ftotal /m-< 
not  a  =  F/m.  If,  by  analogy,  you  push  a  crate  across  the  floor  at 
constant  speed,  i.e. ,  with  zero  acceleration,  the  total  force  on  it  must 
be  zero.  After  you  get  the  crate  going,  the  floor’s  frictional  force  is 
exactly  canceling  out  your  force.  The  chemical  energy  stored  in 
your  body  is  being  transformed  into  heat  in  the  crate  and  the  floor, 
and  no  longer  into  an  increase  in  the  crate’s  kinetic  energy.  Simi¬ 
larly,  the  battery’s  internal  chemical  energy  is  converted  into  heat, 
not  into  perpetually  increasing  the  charged  particles’  kinetic  energy. 
Changing  energy  into  heat  may  be  a  nuisance  in  some  circuits,  such 
as  a  computer  chip,  but  it  is  vital  in  a  lightbulb,  which  must  get  hot 
enough  to  glow.  Whether  we  like  it  or  not,  this  kind  of  heating  effect 
is  going  to  occur  any  time  charged  particles  move  through  matter. 

What  determines  the  amount  of  heating?  One  flashlight  bulb 
designed  to  work  with  a  9-volt  battery  might  be  labeled  1.0  watts, 
another  5.0.  How  does  this  work?  Even  without  knowing  the  details 
of  this  type  of  friction  at  the  atomic  level,  we  can  relate  the  heat 
dissipation  to  the  amount  of  current  that  flows  via  the  equation 
P  =  I AV.  If  the  two  flashlight  bulbs  can  have  two  different  values 
of  P  when  used  with  a  battery  that  maintains  the  same  AV,  it 
must  be  that  the  5.0-watt  bulb  allows  five  times  more  current  to 
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1854). 


flow  through  it. 

For  many  substances,  including  the  tungsten  from  which  light- 
bulb  filaments  are  made,  experiments  show  that  the  amount  of  cur¬ 
rent  that  will  flow  through  it  is  directly  proportional  to  the  voltage 
difference  placed  across  it.  For  an  object  made  of  such  a  substance, 
we  define  its  electrical  resistance  as  follows: 

If  an  object  inserted  in  a  circuit  displays  a  current  flow  which 
is  proportional  to  the  voltage  difference  across  it,  then  we  define  its 
resistance  as  the  constant  ratio 

R  =  AV/I. 

The  units  of  resistance  are  volts/ampere,  usually  abbreviated  as 
ohms,  symbolized  with  the  capital  Greek  letter  omega,  fh 

Resistance  of  a  lightbulb  example  7 

>  A  flashlight  bulb  powered  by  a  9-volt  battery  has  a  resistance  of 

10  O.  How  much  current  will  it  draw? 

>  Solving  the  definition  of  resistance  for  I,  we  find 


l  =  AV/R 
=  0.9  V/Q 
=  0.9  V/(V/A) 
=  0.9  A 


f  /  Four  objects  made  of  the 
same  substance  have  different 
resistances. 


Ohm’s  law  states  that  many  substances,  including  many  solids 
and  some  liquids,  display  this  kind  of  behavior,  at  least  for  voltages 
that  are  not  too  large.  The  fact  that  Ohm’s  law  is  called  a  “law” 
should  not  be  taken  to  mean  that  all  materials  obey  it,  or  that  it  has 
the  same  fundamental  importance  as  Newton’s  laws,  for  example. 
Materials  are  called  ohmic  or  nonohmic,  depending  on  whether  they 
obey  Ohm’s  law. 

If  objects  of  the  same  size  and  shape  made  from  two  different 
ohmic  materials  have  different  resistances,  we  can  say  that  one  ma¬ 
terial  is  more  resistive  than  the  other,  or  equivalently  that  it  is  less 
conductive.  Materials,  such  as  metals,  that  are  very  conductive  are 
said  to  be  good  conductors.  Those  that  are  extremely  poor  conduc¬ 
tors,  for  example  wood  or  rubber,  are  classified  as  insulators.  There 
is  no  sharp  distinction  between  the  two  classes  of  materials.  Some, 
such  as  silicon,  lie  midway  between  the  two  extremes,  and  are  called 
semiconductors . 

On  an  intuitive  level,  we  can  understand  the  idea  of  resistance 
by  making  the  sounds  “hhhhhh”  and  “ffffff.”  To  make  air  flow  out 
of  your  mouth,  you  use  your  diaphragm  to  compress  the  air  in  your 
chest.  The  pressure  difference  between  your  chest  and  the  air  outside 
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your  mouth  is  analogous  to  a  voltage  difference.  When  you  make  the 
“h”  sound,  you  form  your  mouth  and  throat  in  a  way  that  allows  air 
to  flow  easily.  The  large  flow  of  air  is  like  a  large  current.  Dividing 
by  a  large  current  in  the  definition  of  resistance  means  that  we  get 
a  small  resistance.  We  say  that  the  small  resistance  of  your  mouth 
and  throat  allows  a  large  current  to  flow.  When  you  make  the  “f” 
sound,  you  increase  the  resistance  and  cause  a  smaller  current  to 
flow. 

Note  that  although  the  resistance  of  an  object  depends  on  the 
substance  it  is  made  of,  we  cannot  speak  simply  of  the  “resistance 
of  gold”  or  the  “resistance  of  wood.”  Figure  f  shows  four  examples 
of  objects  that  have  had  wires  attached  at  the  ends  as  electrical 
connections.  If  they  were  made  of  the  same  substance,  they  would 
all  nevertheless  have  different  resistances  because  of  their  different 
sizes  and  shapes.  A  more  detailed  discussion  will  be  more  natural 
in  the  context  of  the  following  chapter,  but  it  should  not  be  too 
surprising  that  the  resistance  of  f/2  will  be  greater  than  that  of 
f/1  —  the  image  of  water  flowing  through  a  pipe,  however  incorrect, 
gives  us  the  right  intuition.  Object  f/3  will  have  a  smaller  resistance 
than  f/1  because  the  charged  particles  have  less  of  it  to  get  through. 

Superconductors 

All  materials  display  some  variation  in  resistance  according  to 
temperature  (a  fact  that  is  used  in  thermostats  to  make  a  ther¬ 
mometer  that  can  be  easily  interfaced  to  an  electric  circuit).  More 
spectacularly,  most  metals  have  been  found  to  exhibit  a  sudden 
change  to  zero  resistance  when  cooled  to  a  certain  critical  temper¬ 
ature.  They  are  then  said  to  be  superconductors.  Theoretically, 
superconductors  should  make  a  great  many  exciting  devices  possi¬ 
ble,  for  example  coiled-wire  magnets  that  could  be  used  to  levitate 
trains.  In  practice,  the  critical  temperatures  of  all  metals  are  very 
low,  and  the  resulting  need  for  extreme  refrigeration  has  made  their 
use  uneconomical  except  for  such  specialized  applications  as  particle 
accelerators  for  physics  research. 

But  scientists  have  recently  made  the  surprising  discovery  that 
certain  ceramics  are  superconductors  at  less  extreme  temperatures. 
The  technological  barrier  is  now  in  finding  practical  methods  for 
making  wire  out  of  these  brittle  materials.  Wall  Street  is  currently 
investing  billions  of  dollars  in  developing  superconducting  devices 
for  cellular  phone  relay  stations  based  on  these  materials.  In  2001, 
the  city  of  Copenhagen  replaced  a  short  section  of  its  electrical  power 
trunks  with  superconducing  cables,  and  they  are  now  in  operation 
and  supplying  power  to  customers. 

There  is  currently  no  satisfactory  theory  of  superconductivity  in 
general,  although  superconductivity  in  metals  is  understood  fairly 
well.  Unfortunately  I  have  yet  to  find  a  fundamental  explanation  of 


g/A  superconducting  seg¬ 
ment  of  the  ATLAS  accelerator 
at  Argonne  National  Laboratory 
near  Chicago.  It  is  used  to 
accelerate  beams  of  ions  to  a  few 
percent  of  the  speed  of  light  for 
nuclear  physics  research.  The 
shiny  silver-colored  surfaces  are 
made  of  the  element  niobium, 
which  is  a  superconductor  at 
relatively  high  temperatures  com¬ 
pared  to  other  metals  —  relatively 
high  meaning  the  temperature  of 
liquid  helium!  The  beam  of  ions 
passes  through  the  holes  in  the 
two  small  cylinders  on  the  ends 
of  the  curved  rods.  Charge  is 
shuffled  back  and  forth  between 
them  at  a  frequency  of  12  million 
cycles  per  second,  so  that  they 
take  turns  being  positive  and 
negative.  The  positively  charged 
beam  consists  of  short  spurts, 
each  timed  so  that  when  it  is 
in  one  of  the  segments  it  will 
be  pulled  forward  by  negative 
charge  on  the  cylinder  in  front 
of  it  and  pushed  forward  by  the 
positively  charged  one  behind. 
The  huge  currents  involved  would 
quickly  melt  any  metal  that  was 
not  superconducting,  but  in  a 
superconductor  they  produce  no 
heat  at  all. 
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superconductivity  in  metals  that  works  at  the  introductory  level. 

Finding  charge  given  current  example  8 

t>  In  the  segment  of  the  ATLAS  accelerator  shown  in  figure  g,  the 
current  flowing  back  and  forth  between  the  two  cylinders  is  given 
by  /  =  a  cos  bt.  What  is  the  charge  on  one  of  the  cylinders  as  a 
function  of  time?  >  We  are  given  the  current  and  want  to  find  the 
charge,  i.e. ,  we  are  given  the  derivative  and  we  want  to  find  the 
original  function  that  would  give  that  derivative.  This  means  we 
need  to  integrate: 


Idt 

a  cos  bt  df 


-  sin  bt  +  q0, 


where  q0  is  a  constant  of  integration. 

We  can  interpret  this  in  order  to  explain  why  a  superconductor 
needs  to  be  used.  The  constant  b  must  be  very  large,  since  the 
current  is  supposed  to  oscillate  back  and  forth  millions  of  times  a 
second.  Looking  at  the  final  result,  we  see  that  if  b  is  a  very  large 
number,  and  q  is  to  be  a  significant  amount  of  charge,  then  a  must 
be  a  very  large  number  as  well.  If  a  is  numerically  large,  then  the 
current  must  be  very  large,  so  it  would  heat  the  accelerator  too 
much  if  it  was  flowing  through  an  ordinary  conductor. 

Constant  voltage  throughout  a  conductor 

The  idea  of  a  superconductor  leads  us  to  the  question  of  how 
we  should  expect  an  object  to  behave  if  it  is  made  of  a  very  good 
conductor.  Superconductors  are  an  extreme  case,  but  often  a  metal 
wire  can  be  thought  of  as  a  perfect  conductor,  for  example  if  the 
parts  of  the  circuit  other  than  the  wire  are  made  of  much  less  con¬ 
ductive  materials.  What  happens  if  R  equals  zero  in  the  equation 
R  =  AV/I1  The  result  of  dividing  two  numbers  can  only  be  zero  if 
the  number  on  top  equals  zero.  This  tells  us  that  if  we  pick  any  two 
points  in  a  perfect  conductor,  the  voltage  difference  between  them 
must  be  zero.  In  other  words,  the  entire  conductor  must  be  at  the 
same  voltage. 
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Constant  voltage  means  that  no  work  would  be  done  on  a  charge 
as  it  moved  from  one  point  in  the  conductor  to  another.  If  zero  work 
was  done  only  along  a  certain  path  between  two  specific  points,  it 
might  mean  that  positive  work  was  done  along  part  of  the  path  and 
negative  work  along  the  rest,  resulting  in  a  cancellation.  But  there  is 
no  way  that  the  work  could  come  out  to  be  zero  for  all  possible  paths 
unless  the  electrical  force  on  a  charge  was  in  fact  zero  at  every  point. 
Suppose,  for  example,  that  you  build  up  a  static  charge  by  scuffing 
your  feet  on  a  carpet,  and  then  you  deposit  some  of  that  charge  onto 
a  doorknob,  which  is  a  good  conductor.  How  can  all  that  charge  be 
in  the  doorknob  without  creating  any  electrical  force  at  any  point 
inside  it?  The  only  possible  answer  is  that  the  charge  moves  around 
until  it  has  spread  itself  into  just  the  right  configuration  so  that  the 
forces  exerted  by  all  the  little  bits  of  excess  surface  charge  on  any 
charged  particle  within  the  doorknob  exactly  cancel  out. 

We  can  explain  this  behavior  if  we  assume  that  the  charge  placed 
on  the  doorknob  eventually  settles  down  into  a  stable  equilibrium. 
Since  the  doorknob  is  a  conductor,  the  charge  is  free  to  move  through 
it.  If  it  was  free  to  move  and  any  part  of  it  did  experience  a  nonzero 
total  force  from  the  rest  of  the  charge,  then  it  would  move,  and  we 
would  not  have  an  equilibrium. 

Excess  charge  placed  on  a  conductor,  once  it  reaches  its  equi¬ 
librium  configuration,  is  entirely  on  the  surface,  not  on  the  interior. 
This  should  be  intuitively  reasonable  in  figure  h,  for  example,  since 
the  charges  are  all  repelling  each  other.  A  proof  is  given  in  example 
36  on  p.  628. 

Since  wires  are  good  conductors,  constancy  of  voltage  through¬ 
out  a  conductor  provides  a  convenient  freedom  in  hooking  up  a 
voltmeter  to  a  circuit.  In  figure  i,  points  B  and  C  are  on  the  same 
piece  of  conducting  wire,  so  Vb  =  Vc-  Measuring  Vb  —  Va  gives  the 
same  result  as  measuring  Vc  —  Va- 


h  / 1 .  The  finger  deposits  charges 
on  the  solid,  spherical,  metal 
doorknob  and  is  then  withdrawn. 

2.  Almost  instantaneously,  the 
charges’  mutual  repulsion  makes 
them  redistribute  themselves  uni¬ 
formly  on  the  surface  of  the 
sphere.  The  only  excess  charge 
is  on  the  surface;  charges  do  ex¬ 
ist  in  the  atoms  that  form  the  in¬ 
terior  of  the  sphere,  but  they  are 
balanced.  Charges  on  the  interior 
feel  zero  total  electrical  force  from 
the  ones  at  the  surface.  Charges 
at  the  surface  experience  a  net 
outward  repulsion,  but  this  is  can¬ 
celed  out  by  the  force  that  keeps 
them  from  escaping  into  the  air. 

3.  A  voltmeter  shows  zero  dif¬ 
ference  in  voltage  between  any 
two  points  on  the  interior  or  sur¬ 
face  of  the  sphere.  If  the  volt¬ 
age  difference  wasn’t  zero,  then 
energy  could  be  released  by  the 
flow  of  charge  from  one  point  to 
the  other;  this  only  happens  be¬ 
fore  equilibrium  is  reached. 


i  /  The  voltmeter  doesn’t  care 
which  of  these  setups  you  use. 
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j  /  Example  9.  In  1  and  2, 
charges  that  are  visible  on  the 
front  surface  of  the  conductor  are 
shown  as  solid  dots;  the  others 
would  have  to  be  seen  through 
the  conductor,  which  we  imagine 
is  semi-transparent. 


k  /  Short-circuiting  a  battery. 
Warning:  you  can  burn  yourself 
this  way  or  start  a  fire!  If  you 
want  to  try  this,  try  making  the 
connection  only  very  briefly,  use 
a  low-voltage  battery,  and  avoid 
touching  the  battery  or  the  wire, 
both  of  which  will  get  hot. 


The  lightning  rod  example  9 

Suppose  you  have  a  pear-shaped  conductor  like  the  one  in  figure 
j/1 .  Since  the  pear  is  a  conductor,  there  are  free  charges  every¬ 
where  inside  it.  Panels  1  and  2  of  the  figure  show  a  computer  sim¬ 
ulation  with  100  identical  electric  charges.  In  1,  the  charges  are 
released  at  random  positions  inside  the  pear.  Repulsion  causes 
them  all  to  fly  outward  onto  the  surface  and  then  settle  down  into 
an  orderly  but  nonuniform  pattern. 

We  might  not  have  been  able  to  guess  the  pattern  in  advance,  but 
we  can  verify  that  some  of  its  features  make  sense.  For  example, 
charge  A  has  more  neighbors  on  the  right  than  on  the  left,  which 
would  tend  to  make  it  accelerate  off  to  the  left.  But  when  we 
look  at  the  picture  as  a  whole,  it  appears  reasonable  that  this  is 
prevented  by  the  larger  number  of  more  distant  charges  on  its  left 
than  on  its  right. 

There  also  seems  to  be  a  pattern  to  the  nonuniformity:  the  charges 
collect  more  densely  in  areas  like  B,  where  the  surface  is  strongly 
curved,  and  less  densely  in  flatter  areas  like  C. 

To  understand  the  reason  for  this  pattern,  consider  j/3.  Two  con¬ 
ducting  spheres  are  connected  by  a  conducting  wire.  Since  the 
whole  apparatus  is  conducting,  it  must  all  be  at  one  voltage.  As 
shown  in  problem  37  on  p.  556,  the  density  of  charge  is  greater 
on  the  smaller  sphere.  This  is  an  example  of  a  more  general  fact 
observed  in  j/2,  which  is  that  the  charge  on  a  conductor  packs 
itself  more  densely  in  areas  that  are  more  sharply  curved. 

Similar  reasoning  shows  why  Benjamin  Franklin  used  a  sharp  tip 
when  he  invented  the  lightning  rod.  The  charged  stormclouds  in¬ 
duce  positive  and  negative  charges  to  move  to  opposite  ends  of 
the  rod.  At  the  pointed  upper  end  of  the  rod,  the  charge  tends 
to  concentrate  at  the  point,  and  this  charge  attracts  the  light¬ 
ning.  The  same  effect  can  sometimes  be  seen  when  a  scrap 
of  aluminum  foil  is  inadvertently  put  in  a  microwave  oven.  Mod¬ 
ern  experiments  (Moore  etal.,  Journal  of  Applied  Meteorology  39 
(1999)  593)  show  that  although  a  sharp  tip  is  best  at  starting  a 
spark,  a  more  moderate  curve,  like  the  right-hand  tip  of  the  pear 
in  this  example,  is  better  at  successfully  sustaining  the  spark  for 
long  enough  to  connect  a  discharge  to  the  clouds. 

Short  circuits 

So  far  we  have  been  assuming  a  perfect  conductor.  What  if  it 
is  a  good  conductor,  but  not  a  perfect  one?  Then  we  can  solve  for 
Ay  =  I R.  An  ordinary-sized  current  will  make  a  very  small  result 
when  we  multiply  it  by  the  resistance  of  a  good  conductor  such  as 
a  metal  wire.  The  voltage  throughout  the  wire  will  then  be  nearly 
constant.  If,  on  the  other  hand,  the  current  is  extremely  large,  we 
can  have  a  significant  voltage  difference.  This  is  what  happens  in  a 
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short-circuit:  a  circuit  in  which  a  low- resistance  pathway  connects 
the  two  sides  of  a  voltage  source.  Note  that  this  is  much  more 
specific  than  the  popular  use  of  the  term  to  indicate  any  electrical 
malfunction  at  all.  If,  for  example,  you  short-circuit  a  9- volt  battery 
as  shown  in  figure  k,  you  will  produce  perhaps  a  thousand  amperes 
of  current,  leading  to  a  very  large  value  of  P  =  I AV.  The  wire  gets 
hot! 

self-check  A 

What  would  happen  to  the  battery  in  this  kind  of  short  circuit?  > 

Answer,  p.  965 

At  this  stage,  most  students  have  a  hard  time  understanding  why 
resistors  would  be  used  inside  a  radio  or  a  computer.  We  obviously 
want  a  lightbulb  or  an  electric  stove  to  have  a  circuit  element  that 
resists  the  flow  of  electricity  and  heats  up,  but  heating  is  undesirable 
in  radios  and  computers.  Without  going  too  far  afield,  let’s  use  a 
mechanical  analogy  to  get  a  general  idea  of  why  a  resistor  would  be 
used  in  a  radio. 

The  main  parts  of  a  radio  receiver  are  an  antenna,  a  tuner  for 
selecting  the  frequency,  and  an  amplifier  to  strengthen  the  signal  suf¬ 
ficiently  to  drive  a  speaker.  The  tuner  resonates  at  the  selected  fre¬ 
quency,  just  as  in  the  examples  of  mechanical  resonance  discussed  in 
3.  The  behavior  of  a  mechanical  resonator  depends  on  three  things: 
its  inertia,  its  stiffness,  and  the  amount  of  friction  or  damping.  The 
first  two  parameters  locate  the  peak  of  the  resonance  curve,  while 
the  damping  determines  the  width  of  the  resonance.  In  the  radio 
tuner  we  have  an  electrically  vibrating  system  that  resonates  at  a 
particular  frequency.  Instead  of  a  physical  object  moving  back  and 
forth,  these  vibrations  consist  of  electrical  currents  that  flow  first 
in  one  direction  and  then  in  the  other.  In  a  mechanical  system, 
damping  means  taking  energy  out  of  the  vibration  in  the  form  of 
heat,  and  exactly  the  same  idea  applies  to  an  electrical  system:  the 
resistor  supplies  the  damping,  and  therefore  controls  the  width  of 
the  resonance.  If  we  set  out  to  eliminate  all  resistance  in  the  tuner 
circuit,  by  not  building  in  a  resistor  and  by  somehow  getting  rid  of 
all  the  inherent  electrical  resistance  of  the  wires,  we  would  have  a 
useless  radio.  The  tuner’s  resonance  would  be  so  narrow  that  we 
could  never  get  close  enough  to  the  right  frequency  to  bring  in  the 
station.  The  roles  of  inertia  and  stiffness  are  played  by  other  circuit 
elements  we  have  not  discusses  (a  capacitor  and  a  coil). 

Resistors 

Inside  any  electronic  gadget  you  will  see  quite  a  few  little  circuit 
elements  like  the  one  shown  below.  These  resistors  are  simply  a 
cylinder  of  ohmic  material  with  wires  attached  to  the  end. 

Many  electrical  devices  are  based  on  electrical  resistance  and 
Ohm’s  law,  even  if  they  do  not  have  little  components  in  them  that 


color  meaning 
black  0 
brown  1 
red  2 
orange  3 
yellow  4 
green  5 
blue  6 
violet  7 
gray  8 
white  9 
silver  ±10% 
gold  ±5% 

I  /  Color  codes  used  on  resistors. 


silver 


blue 


1  brown 


1 


2  1  6  ±10% 
21x10  6  Q.  ±10% 


m  /  An  example  of  a  resistor 
with  a  color  code. 


— w — 

n/The  symbol  used  in  schemat¬ 
ics  to  represent  a  resistor. 
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look  like  the  usual  resistor.  The  following  are  some  examples. 
Lightbulb 

There  is  nothing  special  about  a  lightbulb  filament  —  you  can 
easily  make  a  lightbulb  by  cutting  a  narrow  waist  into  a  metallic 
gum  wrapper  and  connecting  the  wrapper  across  the  terminals  of 
a  9- volt  battery.  The  trouble  is  that  it  will  instantly  burn  out. 
Edison  solved  this  technical  challenge  by  encasing  the  filament  in 
an  evacuated  bulb,  which  prevented  burning,  since  burning  requires 
oxygen. 

Polygraph 

The  polygraph,  or  “lie  detector,”  is  really  just  a  set  of  meters 
for  recording  physical  measures  of  the  subject’s  psychological  stress, 
such  as  sweating  and  quickened  heartbeat.  The  real-time  sweat  mea¬ 
surement  works  on  the  principle  that  dry  skin  is  a  good  insulator, 
but  sweaty  skin  is  a  conductor.  Of  course  a  truthful  subject  may 
become  nervous  simply  because  of  the  situation,  and  a  practiced 
liar  may  not  even  break  a  sweat.  The  method’s  practitioners  claim 
that  they  can  tell  the  difference,  but  you  should  think  twice  before 
allowing  yourself  to  be  polygraph  tested.  Most  U.S.  courts  exclude 
all  polygraph  evidence,  but  some  employers  attempt  to  screen  out 
dishonest  employees  by  polygraph  testing  job  applicants,  an  abuse 
that  ranks  with  such  pseudoscience  as  handwriting  analysis. 

Fuse 

A  fuse  is  a  device  inserted  in  a  circuit  tollbooth-style  in  the  same 
manner  as  an  ammeter.  It  is  simply  a  piece  of  wire  made  of  metals 
having  a  relatively  low  melting  point.  If  too  much  current  passes 
through  the  fuse,  it  melts,  opening  the  circuit.  The  purpose  is  to 
make  sure  that  the  building’s  wires  do  not  carry  so  much  current 
that  they  themselves  will  get  hot  enough  to  start  a  fire.  Most  modern 
houses  use  circuit  breakers  instead  of  fuses,  although  fuses  are  still 
common  in  cars  and  small  devices.  A  circuit  breaker  is  a  switch 
operated  by  a  coiled-wire  magnet,  which  opens  the  circuit  when 
enough  current  flows.  The  advantage  is  that  once  you  turn  off  some 
of  the  appliances  that  were  sucking  up  too  much  current,  you  can 
immediately  flip  the  switch  closed.  In  the  days  of  fuses,  one  might 
get  caught  without  a  replacement  fuse,  or  even  be  tempted  to  stuff 
aluminum  foil  in  as  a  replacement,  defeating  the  safety  feature. 

Voltmeter 

A  voltmeter  is  nothing  more  than  an  ammeter  with  an  addi¬ 
tional  high-value  resistor  through  which  the  current  is  also  forced 
to  flow.  Ohm’s  law  states  that  the  current  through  the  resistor  is 
related  directly  to  the  voltage  difference  across  it,  so  the  meter  can 
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be  calibrated  in  units  of  volts  based  on  the  known  value  of  the  re¬ 
sistor.  The  voltmeter’s  two  probes  are  touched  to  the  two  locations 
in  a  circuit  between  which  we  wish  to  measure  the  voltage  differ¬ 
ence,  o/2.  Note  how  cumbersome  this  type  of  drawing  is,  and  how 
difficult  it  can  be  to  tell  what  is  connected  to  what.  This  is  why 
electrical  drawing  are  usually  shown  in  schematic  form.  Figure  o/3 
is  a  schematic  representation  of  figure  o/2. 

The  setups  for  measuring  current  and  voltage  are  different.  When 
we  are  measuring  current,  we  are  finding  “how  much  stuff  goes 
through,”  so  we  place  the  ammeter  where  all  the  current  is  forced 
to  go  through  it.  Voltage,  however,  is  not  “stuff  that  goes  through,” 
it  is  a  measure  of  electrical  energy.  If  an  ammeter  is  like  the  meter 
that  measures  your  water  use,  a  voltmeter  is  like  a  measuring  stick 
that  tells  you  how  high  a  waterfall  is,  so  that  you  can  determine  how 
much  energy  will  be  released  by  each  kilogram  of  falling  water.  We 
do  not  want  to  force  the  water  to  go  through  the  measuring  stick! 
The  arrangement  in  figure  o/3  is  a  parallel  circuit:  one  in  there  are 
“forks  in  the  road”  where  some  of  the  current  will  flow  one  way  and 
some  will  flow  the  other.  Figure  o/4  is  said  to  be  wired  in  series: 
all  the  current  will  visit  all  the  circuit  elements  one  after  the  other. 
We  will  deal  with  series  and  parallel  circuits  in  more  detail  in  the 
following  chapter. 

If  you  inserted  a  voltmeter  incorrectly,  in  series  with  the  bulb  and 
battery,  its  large  internal  resistance  would  cut  the  current  down  so 
low  that  the  bulb  would  go  out.  You  would  have  severely  disturbed 
the  behavior  of  the  circuit  by  trying  to  measure  something  about  it. 

Incorrectly  placing  an  ammeter  in  parallel  is  likely  to  be  even 
more  disconcerting.  The  ammeter  has  nothing  but  wire  inside  it  to 
provide  resistance,  so  given  the  choice,  most  of  the  current  will  flow 
through  it  rather  than  through  the  bulb.  So  much  current  will  flow 
through  the  ammeter,  in  fact,  that  there  is  a  danger  of  burning  out 
the  battery  or  the  meter  or  both!  For  this  reason,  most  ammeters 
have  fuses  or  circuit  breakers  inside.  Some  models  will  trip  their 
circuit  breakers  and  make  an  audible  alarm  in  this  situation,  while 
others  will  simply  blow  a  fuse  and  stop  working  until  you  replace  it. 

Discussion  Questions 

A  In  figure  o/l,  would  it  make  any  difference  in  the  voltage  measure¬ 
ment  if  we  touched  the  voltmeter’s  probes  to  different  points  along  the 
same  segments  of  wire? 

B  Explain  why  it  would  be  incorrect  to  define  resistance  as  the  amount 
of  charge  the  resistor  allows  to  flow. 


o/l.  A  simplified  diagram  of 
how  a  voltmeter  works.  2.  Mea¬ 
suring  the  voltage  difference 
across  a  lightbulb.  3.  The  same 
setup  drawn  in  schematic  form.  4. 
The  setup  for  measuring  current 
is  different. 
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9.1.5  Current-conducting  properties  of  materials 

Ohm’s  law  has  a  remarkable  property,  which  is  that  current  will 
flow  even  in  response  to  a  voltage  difference  that  is  as  small  as  we 
care  to  make  it.  In  the  analogy  of  pushing  a  crate  across  a  floor,  it 
is  as  though  even  a  flea  could  slide  the  crate  across  the  floor,  albeit 
at  some  very  low  speed.  The  flea  cannot  do  this  because  of  static 
friction,  which  we  can  think  of  as  an  effect  arising  from  the  tendency 
of  the  microscopic  bumps  and  valleys  in  the  crate  and  floor  to  lock 
together.  The  fact  that  Ohm’s  law  holds  for  nearly  all  solids  has 
an  interesting  interpretation:  at  least  some  of  the  electrons  are  not 
“locked  down”  at  all  to  any  specific  atom. 

More  generally  we  can  ask  how  charge  actually  flows  in  various 
solids,  liquids,  and  gases.  This  will  lead  us  to  the  explanations  of 
many  interesting  phenomena,  including  lightning,  the  bluish  crust 
that  builds  up  on  the  terminals  of  car  batteries,  and  the  need  for 
electrolytes  in  sports  drinks. 

Solids 

In  atomic  terms,  the  defining  characteristic  of  a  solid  is  that  its 
atoms  are  packed  together,  and  the  nuclei  cannot  move  very  far  from 
their  equilibrium  positions.  It  makes  sense,  then,  that  electrons,  not 
ions,  would  be  the  charge  carriers  when  currents  flow  in  solids.  This 
fact  was  established  experimentally  by  Tolrnan  and  Stewart,  in  an 
experiment  in  which  they  spun  a  large  coil  of  wire  and  then  abruptly 
stopped  it.  They  observed  a  current  in  the  wire  immediately  after 
the  coil  was  stopped,  which  indicated  that  charged  particles  that 
were  not  permanently  locked  to  a  specific  atom  had  continued  to 
move  because  of  their  own  inertia,  even  after  the  material  of  the 
wire  in  general  stopped.  The  direction  of  the  current  showed  that 
it  was  negatively  charged  particles  that  kept  moving.  The  current 
only  lasted  for  an  instant,  however;  as  the  negatively  charged  par¬ 
ticles  collected  at  the  downstream  end  of  the  wire,  farther  particles 
were  prevented  joining  them  due  to  their  electrical  repulsion,  as  well 
as  the  attraction  from  the  upstream  end,  which  was  left  with  a  net 
positive  charge.  Tolrnan  and  Stewart  were  even  able  to  determine 
the  mass-to-charge  ratio  of  the  particles.  We  need  not  go  into  the 
details  of  the  analysis  here,  but  particles  with  high  mass  would  be 
difficult  to  decelerate,  leading  to  a  stronger  and  longer  pulse  of  cur¬ 
rent,  while  particles  with  high  charge  would  feel  stronger  electrical 
forces  decelerating  them,  which  would  cause  a  weaker  and  shorter 
pulse.  The  mass-to-charge  ratio  thus  determined  was  consistent  with 
the  m/q  of  the  electron  to  within  the  accuracy  of  the  experiment, 
which  essentially  established  that  the  particles  were  electrons. 

The  fact  that  only  electrons  carry  current  in  solids,  not  ions,  has 
many  important  implications.  For  one  thing,  it  explains  why  wires 
don’t  fray  or  turn  to  dust  after  carrying  current  for  a  long  time. 
Electrons  are  very  small  (perhaps  even  pointlike),  and  it  is  easy  to 
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imagine  them  passing  between  the  cracks  among  the  atoms  without 
creating  holes  or  fractures  in  the  atomic  framework.  For  those  who 
know  a  little  chemistry,  it  also  explains  why  all  the  best  conductors 
are  on  the  left  side  of  the  periodic  table.  The  elements  in  that  area 
are  the  ones  that  have  only  a  very  loose  hold  on  their  outermost 
electrons. 

Gases 

The  molecules  in  a  gas  spend  most  of  their  time  separated  from 
each  other  by  significant  distances,  so  it  is  not  possible  for  them  to 
conduct  electricity  the  way  solids  do,  by  handing  off  electrons  from 
atom  to  atom.  It  is  therefore  not  surprising  that  gases  are  good 
insulators. 

Gases  are  also  usually  nonohmic.  As  opposite  charges  build  up 
on  a  stormcloud  and  the  ground  below,  the  voltage  difference  be¬ 
comes  greater  and  greater.  Zero  current  flows,  however,  until  finally 
the  voltage  reaches  a  certain  threshold  and  we  have  an  impressive 
example  of  what  is  known  as  a  spark  or  electrical  discharge.  If  air 
was  ohmic,  the  current  between  the  cloud  and  the  ground  would 
simply  increase  steadily  as  the  voltage  difference  increased,  rather 
than  being  zero  until  a  threshold  was  reached.  This  behavior  can  be 
explained  as  follows.  At  some  point,  the  electrical  forces  on  the  air 
electrons  and  nuclei  of  the  air  molecules  become  so  strong  that  elec¬ 
trons  are  ripped  right  off  of  some  of  the  molecules.  The  electrons 
then  accelerate  toward  either  the  cloud  or  the  ground,  whichever 
is  positively  charged,  and  the  positive  ions  accelerate  the  opposite 
way.  As  these  charge  carriers  accelerate,  they  strike  and  ionize  other 
molecules,  which  produces  a  rapidly  growing  cascade. 

Liquids 

Molecules  in  a  liquid  are  able  to  slide  past  each  other,  so  ions 
as  well  as  electrons  can  carry  currents.  Pure  water  is  a  poor  con¬ 
ductor  because  the  water  molecules  tend  to  hold  onto  their  electrons 
strongly,  and  there  are  therefore  not  many  electrons  or  ions  available 
to  move.  Water  can  become  quite  a  good  conductor,  however,  with 
the  addition  of  even  a  small  amount  of  certain  substances  called 
electrolytes,  which  are  typically  salts.  For  example,  if  we  add  table 
salt,  NaCl,  to  water,  the  NaCl  molecules  dissolve  into  Na+  and  Cl- 
ions,  which  can  then  move  and  create  currents.  This  is  why  elec¬ 
tric  currents  can  flow  among  the  cells  in  our  bodies:  cellular  fluid  is 
quite  salty.  When  we  sweat,  we  lose  not  just  water  but  electrolytes, 
so  dehydration  plays  havoc  with  our  cells’  electrical  systems.  It  is 
for  this  reason  that  electrolytes  are  included  in  sports  drinks  and 
formulas  for  rehydrating  infants  who  have  diarrhea. 

Since  current  flow  in  liquids  involves  entire  ions,  it  is  not  sur¬ 
prising  that  we  can  see  physical  evidence  when  it  has  occurred.  For 
example,  after  a  car  battery  has  been  in  use  for  a  while,  the  H2SO4 
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battery  acid  becomes  depleted  of  hydrogen  ions,  which  are  the  main 
charge  carriers  that  complete  the  circuit  on  the  inside  of  the  bat¬ 
tery.  The  leftover  SO4  then  forms  a  visible  blue  crust  on  the  battery 
posts. 

Speed  of  currents  and  electrical  signals 

When  I  talk  on  the  phone  to  my  mother  in  law  two  thousand 
miles  away,  I  do  not  notice  any  delay  while  the  signal  makes  its  way 
back  and  forth.  Electrical  signals  therefore  must  travel  very  quickly, 
but  how  fast  exactly?  The  answer  is  rather  subtle.  For  the  sake 
of  concreteness,  let’s  restrict  ourselves  to  currents  in  metals,  which 
consist  of  electrons. 

The  electrons  themselves  are  only  moving  at  speeds  of  perhaps 
a  few  thousand  miles  per  hour,  and  their  motion  is  mostly  random 
thermal  motion.  This  shows  that  the  electrons  in  my  phone  cannot 
possibly  be  zipping  back  and  forth  between  California  and  New  York 
fast  enough  to  carry  the  signals.  Even  if  their  thousand- mile-an- hour 
motion  was  organized  rather  than  random,  it  would  still  take  them 
many  minutes  to  get  there.  Realistically,  it  will  take  the  average 
electron  even  longer  than  that  to  make  the  trip.  The  current  in  the 
wire  consists  only  of  a  slow  overall  drift,  at  a  speed  on  the  order 
of  a  few  centimeters  per  second,  superimposed  011  the  more  rapid 
random  motion.  We  can  compare  this  with  the  slow  westward  drift 
in  the  population  of  the  U.S.  If  we  could  make  a  movie  of  the  motion 
of  all  the  people  in  the  U.S.  from  outer  space,  and  could  watch  it  at 
high  speed  so  that  the  people  appeared  to  be  scurrying  around  like 
ants,  we  would  think  that  the  motion  was  fairly  random,  and  we 
would  not  immediately  notice  the  westward  drift.  Only  after  many 
years  would  we  realize  that  the  number  of  people  heading  west  over 
the  Sierras  had  exceeded  the  number  going  east,  so  that  California 
increased  its  share  of  the  country’s  population. 

So  why  are  electrical  signals  so  fast  if  the  average  drift  speed  of 
electrons  is  so  slow?  The  answer  is  that  a  disturbance  in  an  electrical 
system  can  move  much  more  quickly  than  the  charges  themselves. 
It  is  as  though  we  filled  a  pipe  with  golf  balls  and  then  inserted  an 
extra  ball  at  one  end,  causing  a  ball  to  fall  out  at  the  other  end. 
The  force  propagated  to  the  other  end  in  a  fraction  of  a  second,  but 
the  balls  themselves  only  traveled  a  few  centimeters  in  that  time. 

Because  the  reality  of  current  conduction  is  so  complex,  we  often 
describe  things  using  mental  shortcuts  that  are  technically  incorrect. 
This  is  OK  as  long  as  we  know  that  they  are  just  shortcuts.  For 
example,  suppose  the  presidents  of  France  and  Russia  shake  hands, 
and  the  French  politician  has  inadvertently  picked  up  a  positive  elec¬ 
trical  charge,  which  shocks  the  Russian.  We  may  say  that  the  excess 
positively  charged  particles  in  the  French  leader’s  body,  which  all 
repel  each  other,  take  the  handshake  as  an  opportunity  to  get  far¬ 
ther  apart  by  spreading  out  into  two  bodies  rather  than  one.  In 
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reality,  it  would  be  a  matter  of  minutes  before  the  ions  in  one  per¬ 
son’s  body  could  actually  drift  deep  into  the  other’s.  What  really 
happens  is  that  throughout  the  body  of  the  recipient  of  the  shock 
there  are  already  various  positive  and  negative  ions  which  are  free 
to  move.  Even  before  the  perpetrator’s  charged  hand  touches  the 
victim’s  sweaty  palm,  the  charges  in  the  shocker’s  body  begin  to 
repel  the  positive  ions  and  attract  the  negative  ions  in  the  other 
person.  The  split-second  sensation  of  shock  is  caused  by  the  sudden 
jumping  of  the  victim’s  ions  by  distances  of  perhaps  a  micrometer, 
this  effect  occurring  simultaneously  throughout  the  whole  body,  al¬ 
though  more  violently  in  the  hand  and  arm,  which  are  closer  to  the 
other  person. 
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9.2  Parallel  and  Series  Circuits 


In  section  9.1,  we  limited  ourselves  to  relatively  simple  circuits,  es¬ 
sentially  nothing  more  than  a  battery  and  a  single  lightbulb.  The 
purpose  of  this  chapter  is  to  introduce  you  to  more  complex  circuits, 
containing  multiple  resistors  or  voltage  sources  in  series,  in  parallel, 
or  both. 

9.2.1  Schematics 

I  see  a  chess  position;  Kasparov  sees  an  interesting  Ruy  Lopez 
variation.  To  the  uninitiated  a  schematic  may  look  as  unintelligible 
as  Mayan  hieroglyphs,  but  even  a  little  bit  of  eye  training  can  go  a 
long  way  toward  making  its  meaning  leap  off  the  page.  A  schematic 
is  a  stylized  and  simplified  drawing  of  a  circuit.  The  purpose  is  to 
eliminate  as  many  irrelevant  features  as  possible,  so  that  the  relevant 
ones  are  easier  to  pick  out. 


a  / 1 .  Wrong:  The  shapes  of  the 
wires  are  irrelevant.  2.  Wrong: 
Right  angles  should  be  used.  3. 
Wrong:  A  simple  pattern  is  made 
to  look  unfamiliar  and  compli¬ 
cated.  4.  Right. 


3  r^-Sbri 


b  /  The  two  shaded  areas 
shaped  like  the  letter  “E”  are  both 
regions  of  constant  voltage. 


An  example  of  an  irrelevant  feature  is  the  physical  shape,  length, 
and  diameter  of  a  wire.  In  nearly  all  circuits,  it  is  a  good  approxi¬ 
mation  to  assume  that  the  wires  are  perfect  conductors,  so  that  any 
piece  of  wire  uninterrupted  by  other  components  has  constant  volt¬ 
age  throughout  it.  Changing  the  length  of  the  wire,  for  instance, 
does  not  change  this  fact.  (Of  course  if  we  used  miles  and  miles 
of  wire,  as  in  a  telephone  line,  the  wire’s  resistance  would  start  to 
add  up,  and  its  length  would  start  to  matter.)  The  shapes  of  the 
wires  are  likewise  irrelevant,  so  we  draw  them  with  standardized, 
stylized  shapes  made  only  of  vertical  and  horizontal  lines  with  right- 
angle  bends  in  them.  This  has  the  effect  of  making  similar  circuits 
look  more  alike  and  helping  us  to  recognize  familiar  patterns,  just 
as  words  in  a  newspaper  are  easier  to  recognize  than  handwritten 
ones.  Figure  a  shows  some  examples  of  these  concepts. 

The  most  important  first  step  in  learning  to  read  schematics  is 
to  learn  to  recognize  contiguous  pieces  of  wire  which  must  have  con¬ 
stant  voltage  throughout.  In  figure  b,  for  example,  the  two  shaded 
E-shaped  pieces  of  wire  must  each  have  constant  voltage.  This  fo¬ 
cuses  our  attention  on  two  of  the  main  unknowns  we’d  like  to  be 
able  to  predict:  the  voltage  of  the  left-hand  E  and  the  voltage  of 
the  one  on  the  right. 
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9.2.2  Parallel  resistances  and  the  junction  rule 

One  of  the  simplest  examples  to  analyze  is  the  parallel  resistance 
circuit,  of  which  figure  b  was  an  example.  In  general  we  may  have 
unequal  resistances  R\  and  i?2,  as  in  c/1.  Since  there  are  only  two 
constant- voltage  areas  in  the  circuit,  c/2,  all  three  components  have 
the  same  voltage  difference  across  them.  A  battery  normally  suc¬ 
ceeds  in  maintaining  the  voltage  differences  across  itself  for  which  it 
was  designed,  so  the  voltage  drops  A  Vi  and  AV2  across  the  resistors 
must  both  equal  the  voltage  of  the  battery: 

A I  1  =  AV2  =  A  Vbattery 

Each  resistance  thus  feels  the  same  voltage  difference  as  if  it  was 
the  only  one  in  the  circuit,  and  Ohm’s  law  tells  us  that  the  amount 
of  current  flowing  through  each  one  is  also  the  same  as  it  would 
have  been  in  a  one-resistor  circuit.  This  is  why  household  electrical 
circuits  are  wired  in  parallel.  We  want  every  appliance  to  work 
the  same,  regardless  of  whether  other  appliances  are  plugged  in  or 
unplugged,  turned  on  or  switched  off.  (The  electric  company  doesn’t 
use  batteries  of  course,  but  our  analysis  would  be  the  same  for  any 
device  that  maintains  a  constant  voltage.) 


1  Ri 

| — VW — | 

r2 

— vw — 


I 


— vw — 

— VW— 

Lll^ 


3 


— VW — 

R2 

— vw — 


c/1.  Two  resistors  in  parallel. 
2.  There  are  two  constant-voltage 
areas.  3.  The  current  that  comes 
out  of  the  battery  splits  between 
the  two  resistors,  and  later  re¬ 
unites.  4.  The  two  resistors  in 
parallel  can  be  treated  as  a  single 
resistor  with  a  smaller  resistance 
value. 


Of  course  the  electric  company  can  tell  when  we  turn  on  every 
light  in  the  house.  How  do  they  know?  The  answer  is  that  we  draw 
more  current.  Each  resistance  draws  a  certain  amount  of  current, 
and  the  amount  that  has  to  be  supplied  is  the  sum  of  the  two  indi¬ 
vidual  currents.  The  current  is  like  a  river  that  splits  in  half,  c/3, 
and  then  reunites.  The  total  current  is 


I  total  / 1  +  h- 


This  is  an  example  of  a  general  fact  called  the  junction  rule: 

In  any  circuit  that  is  not  storing  or  releasing  charge,  conservation 
of  charge  implies  that  the  total  current  flowing  out  of  any  junction 
must  be  the  same  as  the  total  flowing  in. 

Coming  back  to  the  analysis  of  our  circuit,  we  apply  Ohm’s  law 
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to  each  resistance,  resulting  in 


Itotal  =  AV/Ri  +  AV/R-2 


=  AV 


1  1 

R~i  +  R2 


As  far  as  the  electric  company  is  concerned,  your  whole  house 
is  just  one  resistor  with  some  resistance  R,  called  the  equivalent 
resistance.  They  would  write  Ohm’s  law  as 


R.otal  —  AV /  R, 


from  which  we  can  determine  the  equivalent  resistance  by  compari¬ 
son  with  the  previous  expression: 


l/R 


1  1 

Ri  +  R 2 


R  = 


ik+k) 


[equivalent  resistance  of  two  resistors  in  parallel] 

Two  resistors  in  parallel,  c/4,  are  equivalent  to  a  single  resistor  with 
a  value  given  by  the  above  equation. 

Two  lamps  on  the  same  household  circuit  example  1 0 

t>  You  turn  on  two  lamps  that  are  on  the  same  household  circuit. 
Each  one  has  a  resistance  of  1  ohm.  What  is  the  equivalent  re¬ 
sistance,  and  how  does  the  power  dissipation  compare  with  the 
case  of  a  single  lamp? 

>  The  equivalent  resistance  of  the  two  lamps  in  parallel  is 


_  ,  1  1 
R  =  (  —  +  — 
R i  R2 


1  1 
+ 


1  a  1  a 

1  a^1  + 1  rr 

-1 


-1 


=  (2  a-1 
=  0.5  a 


The  voltage  difference  across  the  whole  circuit  is  always  the  110 
V  set  by  the  electric  company  (it’s  alternating  current,  but  that’s 
irrelevant).  The  resistance  of  the  whole  circuit  has  been  cut  in 
half  by  turning  on  the  second  lamp,  so  a  fixed  amount  of  voltage 
will  produce  twice  as  much  current.  Twice  the  current  flowing 
across  the  same  voltage  difference  means  twice  as  much  power 
dissipation,  which  makes  sense. 
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The  cutting  in  half  of  the  resistance  surprises  many  students, 
since  we  are  “adding  more  resistance”  to  the  circuit  by  putting  in 
the  second  lamp.  Why  does  the  equivalent  resistance  come  out  to  be 
less  than  the  resistance  of  a  single  lamp?  This  is  a  case  where  purely 
verbal  reasoning  can  be  misleading.  A  resistive  circuit  element,  such 
as  the  filament  of  a  lightbulb,  is  neither  a  perfect  insulator  nor 
a  perfect  conductor.  Instead  of  analyzing  this  type  of  circuit  in 
terms  of  “resistors,”  i.e.,  partial  insulators,  we  could  have  spoken  of 
“conductors.”  This  example  would  then  seem  reasonable,  since  we 
“added  more  conductance,”  but  one  would  then  have  the  incorrect 
expectation  about  the  case  of  resistors  in  series,  discussed  in  the 
following  section. 

Perhaps  a  more  productive  way  of  thinking  about  it  is  to  use 
mechanical  intuition.  By  analogy,  your  nostrils  resist  the  flow  of 
air  through  them,  but  having  two  nostrils  makes  it  twice  as  easy  to 
breathe. 

Three  resistors  in  parallel  example  1 1 

>  What  happens  if  we  have  three  or  more  resistors  in  parallel? 

>  This  is  an  important  example,  because  the  solution  involves 
an  important  technique  for  understanding  circuits:  breaking  them 
down  into  smaller  parts  and  them  simplifying  those  parts.  In  the 
circuit  d/1 ,  with  three  resistors  in  parallel,  we  can  think  of  two  of 
the  resistors  as  forming  a  single  big  resistor,  d/2,  with  equivalent 
resistance 


We  can  then  simplify  the  circuit  as  shown  in  d/3,  so  that  it  con¬ 
tains  only  two  resistances.  The  equivalent  resistance  of  the  whole 
circuit  is  then  given  by 


Substituting  for  2  and  simplifying,  we  find  the  result 

fli23 = (i + i + i)  ■ 

which  you  probably  could  have  guessed.  The  interesting  point 
here  is  the  divide-and-conquer  concept,  not  the  mathematical  re¬ 
sult. 

An  arbitrary  number  of  identical  resistors  in  parallel  example  12 

>  What  is  the  resistance  of  N  identical  resistors  in  parallel? 

>  Generalizing  the  results  for  two  and  three  resistors,  we  have 
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e  /  Uniting  four  resistors  in 
parallel  is  equivalent  to  making 
a  single  resistor  with  the  same 
length  but  four  times  the  cross- 
sectional  area.  The  result  is  to 
make  a  resistor  with  one  quarter 
the  resistance. 


where  means  that  the  sum  includes  all  the  resistors.  If  all  the 
resistors  are  identical,  this  becomes 


R 

N 


Dependence  of  resistance  on  cross-sectional  area  example  13 
We  have  alluded  briefly  to  the  fact  that  an  object’s  electrical  re¬ 
sistance  depends  on  its  size  and  shape,  but  now  we  are  ready 
to  begin  making  more  mathematical  statements  about  it.  As  sug¬ 
gested  by  figure  e,  increasing  a  resistors’s  cross-sectional  area  is 
equivalent  to  adding  more  resistors  in  parallel,  which  will  lead  to 
an  overall  decrease  in  resistance.  Any  real  resistor  with  straight, 
parallel  sides  can  be  sliced  up  into  a  large  number  of  pieces,  each 
with  cross-sectional  area  of,  say,  1  pm2.  The  number,  N,  of  such 
slices  is  proportional  to  the  total  cross-sectional  area  of  the  resis¬ 
tor,  and  by  application  of  the  result  of  the  previous  example  we 
therefore  find  that  the  resistance  of  an  object  is  inversely  propor¬ 
tional  to  its  cross-sectional  area. 


f  /  A  fat  pipe  has  less  resistance 
than  a  skinny  pipe. 


-ww - (a)— 

- vw - 

R 


g  /  A  voltmeter  is  really  an 
ammeter  with  an  internal  resistor. 
When  we  measure  the  voltage 
difference  across  a  resistor,  1 ,  we 
are  really  constructing  a  parallel 
resistance  circuit,  2. 


An  analogous  relationship  holds  for  water  pipes,  which  is  why 
high-flow  trunk  lines  have  to  have  large  cross-sectional  areas.  To 
make  lots  of  water  (current)  flow  through  a  skinny  pipe,  we’d  need 
an  impractically  large  pressure  (voltage)  difference. 

Incorrect  readings  from  a  voltmeter  example  14 

A  voltmeter  is  really  just  an  ammeter  with  an  internal  resistor,  and 
we  use  a  voltmeter  in  parallel  with  the  thing  that  we’re  trying  to 
measure  the  voltage  difference  across.  This  means  that  any  time 
we  measure  the  voltage  drop  across  a  resistor,  we’re  essentially 
putting  two  resistors  in  parallel.  The  ammeter  inside  the  voltmeter 
can  be  ignored  for  the  purpose  of  analyzing  what  how  current 
flows  in  the  circuit,  since  it  is  essentially  just  some  coiled-up  wire 
with  a  very  low  resistance. 
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Now  if  we  are  carrying  out  this  measurement  on  a  resistor  that  is 
part  of  a  larger  circuit,  we  have  changed  the  behavior  of  the  cir¬ 
cuit  through  our  act  of  measuring.  It  is  as  though  we  had  modified 
the  circuit  by  replacing  the  resistance  Ft  with  the  smaller  equiva¬ 
lent  resistance  of  Ft  and  Ftv  in  parallel.  It  is  for  this  reason  that 
voltmeters  are  built  with  the  largest  possible  internal  resistance. 
As  a  numerical  example,  if  we  use  a  voltmeter  with  an  internal 
resistance  of  1  MO.  to  measure  the  voltage  drop  across  a  one- 
ohm  resistor,  the  equivalent  resistance  is  0.999999  O,  which  is 
not  different  enough  to  make  any  difference.  But  if  we  tried  to  use 
the  same  voltmeter  to  measure  the  voltage  drop  across  a  2  -  MD. 
resistor,  we  would  be  reducing  the  resistance  of  that  part  of  the 
circuit  by  a  factor  of  three,  which  would  produce  a  drastic  change 
in  the  behavior  of  the  whole  circuit. 

This  is  the  reason  why  you  can’t  use  a  voltmeter  to  measure  the 
voltage  difference  between  two  different  points  in  mid-air,  or  between 
the  ends  of  a  piece  of  wood.  This  is  by  no  means  a  stupid  thing  to 
want  to  do,  since  the  world  around  us  is  not  a  constant-voltage 
environment,  the  most  extreme  example  being  when  an  electrical 
storm  is  brewing.  But  it  will  not  work  with  an  ordinary  voltmeter 
because  the  resistance  of  the  air  or  the  wood  is  many  gigaohms.  The 
effect  of  waving  a  pair  of  voltmeter  probes  around  in  the  air  is  that 
we  provide  a  reuniting  path  for  the  positive  and  negative  charges 
that  have  been  separated  —  through  the  voltmeter  itself,  which  is 
a  good  conductor  compared  to  the  air.  This  reduces  to  zero  the 
voltage  difference  we  were  trying  to  measure. 

In  general,  a  voltmeter  that  has  been  set  up  with  an  open  circuit 
(or  a  very  large  resistance)  between  its  probes  is  said  to  be  “float¬ 
ing.”  An  old-fashioned  analog  voltmeter  of  the  type  described  here 
will  read  zero  when  left  floating,  the  same  as  when  it  was  sitting 
on  the  shelf.  A  floating  digital  voltmeter  usually  shows  an  error 
message. 

9.2.3  Series  resistances 

The  two  basic  circuit  layouts  are  parallel  and  series,  so  a  pair  of 
resistors  in  series,  h/1,  is  another  of  the  most  basic  circuits  we  can 
make.  By  conservation  of  charge,  all  the  current  that  flows  through 
one  resistor  must  also  flow  through  the  other  (as  well  as  through  the 
battery) : 

h  =  h- 

The  only  way  the  information  about  the  two  resistance  values  is 
going  to  be  useful  is  if  we  can  apply  Ohm’s  law,  which  will  relate  the 
resistance  of  each  resistor  to  the  current  flowing  through  it  and  the 
voltage  difference  across  it.  Figure  h/2  shows  the  three  constant- 
voltage  areas.  Voltage  differences  are  more  physically  significant 
than  voltages,  so  we  define  symbols  for  the  voltage  differences  across 


-AW - WA- 

- 1 1 - 

battery 


-AW - WA- 

- ll - 


av2 


i-AW - WA-i 


^  battery 


b - d  b - d 


vrW\A-LLAWM 


h/1.  A  battery  drives  current 
through  two  resistors  in  series.  2. 
There  are  three  constant-voltage 
regions.  3.  The  three  voltage 
differences  are  related.  4.  If 
the  meter  crab-walks  around  the 
circuit  without  flipping  over  or 
crossing  its  legs,  the  resulting 
voltages  have  plus  and  minus 
signs  that  make  them  add  up  to 
zero. 
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the  two  resistors  in  figure  h/3. 

We  have  three  constant-voltage  areas,  with  symbols  for  the  dif¬ 
ference  in  voltage  between  every  possible  pair  of  them.  These  three 
voltage  differences  must  be  related  to  each  other.  It  is  as  though  I 
tell  you  that  Fred  is  a  foot  taller  than  Ginger,  Ginger  is  a  foot  taller 
than  Sally,  and  Fred  is  two  feet  taller  than  Sally.  The  information 
is  redundant,  and  you  really  only  needed  two  of  the  three  pieces  of 
data  to  infer  the  third.  In  the  case  of  our  voltage  differences,  we 
have 

|AFi|  +  \AV2\  =  \AVbattery\- 

The  absolute  value  signs  are  because  of  the  ambiguity  in  how  we 
define  our  voltage  differences.  If  we  reversed  the  two  probes  of  the 
voltmeter,  we  would  get  a  result  with  the  opposite  sign.  Digital 
voltmeters  will  actually  provide  a  minus  sign  on  the  screen  if  the 
wire  connected  to  the  “V”  plug  is  lower  in  voltage  than  the  one 
connected  to  the  “COM”  plug.  Analog  voltmeters  pin  the  needle 
against  a  peg  if  you  try  to  use  them  to  measure  negative  voltages, 
so  you  have  to  fiddle  to  get  the  leads  connected  the  right  way,  and 
then  supply  any  necessary  minus  sign  yourself. 

Figure  h/4  shows  a  standard  way  of  taking  care  of  the  ambiguity 
in  signs.  For  each  of  the  three  voltage  measurements  around  the 
loop,  we  keep  the  same  probe  (the  darker  one)  on  the  clockwise 
side.  It  is  as  though  the  voltmeter  was  sidling  around  the  circuit 
like  a  crab,  without  ever  “crossing  its  legs.”  With  this  convention, 
the  relationship  among  the  voltage  drops  becomes 

AVi  +  AV2  =  —A  Vbattery, 

or,  in  more  symmetrical  form, 


AVl  +  AV2  +  AVbattery  —  0. 


More  generally,  this  is  known  as  the  loop  rule  for  analyzing  circuits: 

Assuming  the  standard  convention  for  plus  and  minus  signs,  the 
sum  of  the  voltage  drops  around  any  closed  loop  in  a  circuit  must 
be  zero. 

Looking  for  an  exception  to  the  loop  rule  would  be  like  asking 
for  a  hike  that  would  be  downhill  all  the  way  and  that  would  come 
back  to  its  starting  point! 

For  the  circuit  we  set  out  to  analyze,  the  equation 


AVl  +  AV2  +  A  Vbattery  —  0 
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can  now  be  rewritten  by  applying  Ohm’s  law  to  each  resistor: 

I\R\  +  I2R2  +  A Vbattery  =  0. 

The  currents  are  the  same,  so  we  can  factor  them  out: 

/  ( R\  +  R2)  +  A  Vbattery  =  0, 

and  this  is  the  same  result  we  would  have  gotten  if  we  had  been 
analyzing  a  one-resistor  circuit  with  resistance  R\  +  R.2-  Thus  the 
equivalent  resistance  of  resistors  in  series  equals  the  sum  of  their 
resistances. 

Two  lightbulbs  in  series  example  15 

>  If  two  identical  lightbulbs  are  placed  in  series,  how  do  their 
brightnesses  compare  with  the  brightness  of  a  single  bulb? 

>  Taken  as  a  whole,  the  pair  of  bulbs  act  like  a  doubled  resistance, 
so  they  will  draw  half  as  much  current  from  the  wall.  Each  bulb 
will  be  dimmer  than  a  single  bulb  would  have  been. 

The  total  power  dissipated  by  the  circuit  is  I A  V.  The  voltage  drop 
across  the  whole  circuit  is  the  same  as  before,  but  the  current  is 
halved,  so  the  two-bulb  circuit  draws  half  as  much  total  power  as 
the  one-bulb  circuit.  Each  bulb  draws  one-quarter  of  the  normal 
power. 

Roughly  speaking,  we  might  expect  this  to  result  in  one  quarter 
the  light  being  produced  by  each  bulb,  but  in  reality  lightbulbs 
waste  quite  a  high  percentage  of  their  power  in  the  form  of  heat 
and  wavelengths  of  light  that  are  not  visible  (infrared  and  ultravi¬ 
olet).  Less  light  will  be  produced,  but  it’s  hard  to  predict  exactly 
how  much  less,  since  the  efficiency  of  the  bulbs  will  be  changed 
by  operating  them  under  different  conditions. 

More  than  two  equal  resistances  in  series  example  16 

By  straightforward  application  of  the  divide-and-conquer  technique 
discussed  in  the  previous  section,  we  find  that  the  equivalent  re¬ 
sistance  of  N  identical  resistances  R  in  series  will  be  NR. 

Dependence  of  resistance  on  length  example  1 7 

In  the  previous  section,  we  proved  that  resistance  is  inversely 
proportional  to  cross-sectional  area.  By  equivalent  reason  about 
resistances  in  series,  we  find  that  resistance  is  proportional  to 
length.  Analogously,  it  is  harder  to  blow  through  a  long  straw  than 
through  a  short  one. 

Putting  the  two  arguments  together,  we  find  that  the  resistance 
of  an  object  with  straight,  parallel  sides  is  given  by 

R  =  (constant)  •  L/A 

The  proportionality  constant  is  called  the  resistivity,  and  it  depends 
only  on  the  substance  of  which  the  object  is  made.  A  resistivity 
measurement  could  be  used,  for  instance,  to  help  identify  a  sample 
of  an  unknown  substance. 


[ 


- t 

i  /  Example  15. 


j  /  Doubling  the  length  of  a 
resistor  is  like  putting  two  resis¬ 
tors  in  series.  The  resistance  is 
doubled. 
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Choice  of  high  voltage  for  power  lines  example  1 8 

Thomas  Edison  got  involved  in  a  famous  technological  contro¬ 
versy  over  the  voltage  difference  that  should  be  used  for  electrical 
power  lines.  At  this  time,  the  public  was  unfamiliar  with  electricity, 
and  easily  scared  by  it.  The  president  of  the  United  States,  for 
instance,  refused  to  have  electrical  lighting  in  the  White  House 
when  it  first  became  commercially  available  because  he  consid¬ 
ered  it  unsafe,  preferring  the  known  fire  hazard  of  oil  lamps  to 
the  mysterious  dangers  of  electricity.  Mainly  as  a  way  to  over¬ 
come  public  fear,  Edison  believed  that  power  should  be  transmit¬ 
ted  using  small  voltages,  and  he  publicized  his  opinion  by  giving 
demonstrations  at  which  a  dog  was  lured  into  position  to  be  killed 
by  a  large  voltage  difference  between  two  sheets  of  metal  on  the 
ground.  (Edison’s  opponents  also  advocated  alternating  current 
rather  than  direct  current,  and  AC  is  more  dangerous  than  DC  as 
well.  As  we  will  discuss  later,  AC  can  be  easily  stepped  up  and 
down  to  the  desired  voltage  level  using  a  device  called  a  trans¬ 
former.) 

Now  if  we  want  to  deliver  a  certain  amount  of  power  PL  to  a 
load  such  as  an  electric  lightbulb,  we  are  constrained  only  by  the 
equation  PL  =  IAVL.  We  can  deliver  any  amount  of  power  we 
wish,  even  with  a  low  voltage,  if  we  are  willing  to  use  large  cur¬ 
rents.  Modern  electrical  distribution  networks,  however,  use  dan¬ 
gerously  high  voltage  differences  of  tens  of  thousands  of  volts. 
Why  did  Edison  lose  the  debate? 

It  boils  down  to  money.  The  electric  company  must  deliver  the 
amount  of  power  PL  desired  by  the  customer  through  a  transmis¬ 
sion  line  whose  resistance  RT  is  fixed  by  economics  and  geogra¬ 
phy.  The  same  current  flows  through  both  the  load  and  the  trans¬ 
mission  line,  dissipating  power  usefully  in  the  former  and  waste- 
fully  in  the  latter.  The  efficiency  of  the  system  is 

.  power  paid  for  by  the  customer 

efficiency  =  - - — .  ,  -  - — , - - 

power  paid  for  by  the  utility 

Pl 

Pl  +  Pt 

1 

"  1  +  Pt/Pl 

Putting  ourselves  in  the  shoes  of  the  electric  company,  we  wish 
to  get  rid  of  the  variable  PT,  since  it  is  something  we  control  only 
indirectly  by  our  choice  of  AVT  and  /.  Substituting  PT  =  IAVT,  we 
find 

efficiency  =  jAVj 

1  +  Pl 

We  assume  the  transmission  line  (but  not  necessarily  the  load)  is 
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ohmic,  so  substituting  AVT  =  IRT  gives 

efficiency  =  — 

1  +  Ht 

'  +  Pl 

This  quantity  can  clearly  be  maximized  by  making  /  as  small  as 
possible,  since  we  will  then  be  dividing  by  the  smallest  possible 
quantity  on  the  bottom  of  the  fraction.  A  low-current  circuit  can 
only  deliver  significant  amounts  of  power  if  it  uses  high  voltages, 
which  is  why  electrical  transmission  systems  use  dangerous  high 
voltages. 

Getting  killed  by  your  ammeter  example  1 9 

As  with  a  voltmeter,  an  ammeter  can  give  erroneous  readings  if  it 
is  used  in  such  a  way  that  it  changes  the  behavior  the  circuit.  An 
ammeter  is  used  in  series,  so  if  it  is  used  to  measure  the  current 
through  a  resistor,  the  resistor’s  value  will  effectively  be  changed 
to  R+  Ra,  where  Ra  is  the  resistance  of  the  ammeter.  Ammeters 
are  designed  with  very  low  resistances  in  order  to  make  it  unlikely 
that  R+  Ra  will  be  significantly  different  from  R. 

In  fact,  the  real  hazard  is  death,  not  a  wrong  reading!  Virtually 
the  only  circuits  whose  resistances  are  significantly  less  than  that 
of  an  ammeter  are  those  designed  to  carry  huge  currents.  An 
ammeter  inserted  in  such  a  circuit  can  easily  melt.  When  I  was 
working  at  a  laboratory  funded  by  the  Department  of  Energy,  we 
got  periodic  bulletins  from  the  DOE  safety  office  about  serious  ac¬ 
cidents  at  other  sites,  and  they  held  a  certain  ghoulish  fascination. 
One  of  these  was  about  a  DOE  worker  who  was  completely  in¬ 
cinerated  by  the  explosion  created  when  he  inserted  an  ordinary 
Radio  Shack  ammeter  into  a  high-current  circuit.  Later  estimates 
showed  that  the  heat  was  probably  so  intense  that  the  explosion 
was  a  ball  of  plasma  —  a  gas  so  hot  that  its  atoms  have  been 
ionized. 
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A  complicated  circuit  example  20 

t>  All  seven  resistors  in  the  left-hand  panel  of  figure  k  are  identi¬ 
cal.  Initially,  the  switch  S  is  open  as  shown  in  the  figure,  and  the 
current  through  resistor  A  is  l0.  The  switch  is  then  closed.  Find 
the  current  through  resistor  B,  after  the  switch  is  closed,  in  terms 
of  /0. 

>  The  second  panel  shows  the  circuit  redrawn  for  simplicity,  in  the 
initial  condition  with  the  switch  open.  When  the  switch  is  open,  no 
current  can  flow  through  the  central  resistor,  so  we  may  as  well 
ignore  it.  I’ve  also  redrawn  the  junctions,  without  changing  what’s 
connected  to  what.  This  is  the  kind  of  mental  rearranging  that 
you’ll  eventually  learn  to  do  automatically  from  experience  with 
analyzing  circuits.  The  redrawn  version  makes  it  easier  to  see 
what’s  happening  with  the  current.  Charge  is  conserved,  so  any 
charge  that  flows  past  point  1  in  the  circuit  must  also  flow  past 
points  2  and  3.  This  would  have  been  harder  to  reason  about  by 
applying  the  junction  rule  to  the  original  version,  which  appears 
to  have  nine  separate  junctions. 

In  the  new  version,  it’s  also  clear  that  the  circuit  has  a  great  deal 
of  symmetry.  We  could  flip  over  each  parallel  pair  of  identical  re¬ 
sistors  without  changing  what’s  connected  to  what,  so  that  makes 
it  clear  that  the  voltage  drops  and  currents  must  be  equal  for  the 
members  of  each  pair.  We  can  also  prove  this  by  using  the  loop 
rule.  The  loop  rule  says  that  the  two  voltage  drops  in  loop  4  must 
be  equal,  and  similarly  for  loops  5  and  6.  Since  the  resistors  obey 
Ohm’s  law,  equal  voltage  drops  across  them  also  imply  equal  cur¬ 
rents.  That  means  that  when  the  current  at  point  1  comes  to  the 
top  junction,  exactly  half  of  it  goes  through  each  resistor.  Then 
the  current  reunites  at  2,  splits  between  the  next  pair,  and  so  on. 
We  conclude  that  each  of  the  six  resistors  in  the  circuit  experi¬ 
ences  the  same  voltage  drop  and  the  same  current.  Applying  the 
loop  rule  to  loop  7,  we  find  that  the  sum  of  the  three  voltage  drops 
across  the  three  left-hand  resistors  equals  the  battery’s  voltage, 
V,  so  each  resistor  in  the  circuit  experiences  a  voltage  drop  V/3. 
Letting  R  stand  for  the  resistance  of  one  of  the  resistors,  we  find 
that  the  current  through  resistor  B,  which  is  the  same  as  the  cur¬ 
rents  through  all  the  others,  is  given  by  l0  =  V/3 R. 

We  now  pass  to  the  case  where  the  switch  is  closed,  as  shown 
in  the  third  panel.  The  battery’s  voltage  is  the  same  as  before, 
and  each  resistor’s  resistance  is  the  same,  so  we  can  still  use  the 
same  symbols  V  and  R  for  them.  It  is  no  longer  true,  however, 
that  each  resistor  feels  a  voltage  drop  V/3.  The  equivalent  resis¬ 
tance  of  the  whole  circuit  is  R/2  +  R/3  +  R/2  =  4R/3,  so  the  total 
current  drawn  from  the  battery  is  3V/4R.  In  the  middle  group 
of  resistors,  this  current  is  split  three  ways,  so  the  new  current 
through  B  is  (1  /3)(3V/4R)  =  V/4R  =  3l0/4. 
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Interpreting  this  result,  we  see  that  it  comes  from  two  effects  that 
partially  cancel.  Closing  the  switch  reduces  the  equivalent  re¬ 
sistance  of  the  circuit  by  giving  charge  another  way  to  flow,  and 
increases  the  amount  of  current  drawn  from  the  battery.  Resis¬ 
tor  B,  however,  only  gets  a  1/3  share  of  this  greater  current,  not 
1/2.  The  second  effect  turns  out  to  be  bigger  than  the  first,  and 
therefore  the  current  through  resistor  B  is  lessened  over  all. 

Discussion  Question 

A  We  have  stated  the  loop  rule  in  a  symmetric  form  where  a  series 
of  voltage  drops  adds  up  to  zero.  To  do  this,  we  had  to  define  a  standard 
way  of  connecting  the  voltmeter  to  the  circuit  so  that  the  plus  and  minus 
signs  would  come  out  right.  Suppose  we  wish  to  restate  the  junction  rule 
in  a  similar  symmetric  way,  so  that  instead  of  equating  the  current  coming 
in  to  the  current  going  out,  it  simply  states  that  a  certain  sum  of  currents  at 
a  junction  adds  up  to  zero.  What  standard  way  of  inserting  the  ammeter 
would  we  have  to  use  to  make  this  work? 

This  chapter  is  summarized  on  page  999.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 
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The  symbols  \/,  ^ ,  etc.  are  explained  on  page  557. 

1  In  a  wire  carrying  a  current  of  1.0  pA,  how  long  do  you  have 
to  wait,  on  the  average,  for  the  next  electron  to  pass  a  given  point? 
Express  your  answer  in  units  of  microseconds. 

>  Solution,  p.  979  - 

2  Referring  back  to  our  old  friend  the  neuron  from  problem  1 
on  page  510,  let’s  now  consider  what  happens  when  the  nerve  is 
stimulated  to  transmit  information.  When  the  blob  at  the  top  (the 
cell  body)  is  stimulated,  it  causes  Na+  ions  to  rush  into  the  top  of 
the  tail  (axon) .  This  electrical  pulse  will  then  travel  down  the  axon, 
like  a  flame  burning  down  from  the  end  of  a  fuse,  with  the  Na+  ions 
at  each  point  first  going  out  and  then  coming  back  in.  If  1010  Na+ 
ions  cross  the  cell  membrane  in  0.5  ms,  what  amount  of  current  is 
created? 

V  ■ 

3  If  a  typical  light  bulb  draws  about  900  nr  A  from  a  110-V 

household  circuit,  what  is  its  resistance?  (Don’t  worry  about  the 
fact  that  it’s  alternating  current.)  V 

4  (a)  Express  the  power  dissipated  by  a  resistor  in  terms  of  R 

and  AH  only,  eliminating  I.  V 

(b)  Electrical  receptacles  in  your  home  are  mostly  110  V,  but  cir¬ 
cuits  for  electric  stoves,  air  conditioners,  and  washers  and  driers  are 
usually  220  V.  The  two  types  of  circuits  have  differently  shaped  re¬ 
ceptacles.  Suppose  you  rewire  the  plug  of  a  drier  so  that  it  can  be 
plugged  in  to  a  110  V  receptacle.  The  resistor  that  forms  the  heat¬ 
ing  element  of  the  drier  would  normally  draw  200  W.  How  much 
power  does  it  actually  draw  now?  V  ^ 

5  Lightning  discharges  a  cloud  during  an  electrical  storm.  Sup¬ 
pose  that  the  current  in  the  lightning  bolt  varies  with  time  as  I  =  bt, 
where  b  is  a  constant.  Find  the  cloud’s  charge  as  a  function  of  time. 

V 


6  A  resistor  has  a  voltage  difference  AH  across  it,  causing  a 
current  /  to  flow. 

(a)  Find  an  equation  for  the  power  it  dissipates  as  heat  in  terms  of 

the  variables  I  and  R  only,  eliminating  AH.  V 

(b)  If  an  electrical  line  coming  to  your  house  is  to  carry  a  given 
amount  of  current,  interpret  your  equation  from  part  a  to  explain 
whether  the  wire’s  resistance  should  be  small,  or  large. 

7  In  AM  (amplitude- modulated)  radio,  an  audio  signal  f(t)  is 
multiplied  by  a  sine  wave  sin  cut  in  the  megahertz  frequency  range. 
For  simplicity,  let’s  imagine  that  the  transmitting  antenna  is  a  whip, 
and  that  charge  goes  back  and  forth  between  the  top  and  bottom. 
Suppose  that,  during  a  certain  time  interval,  the  audio  signal  varies 
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linearly  with  time,  giving  a  charge  q  =  (a  +  bt)  sin  cut  at  the  top  of 
the  whip  and  —q  at  the  bottom.  Find  the  current  as  a  function  of 
time.  V  ^ 

8  Problem  8  has  been  deleted.  B 

9  Use  the  result  of  problem  38  on  page  556  to  find  an  equation 
for  the  voltage  at  a  point  in  space  at  a  distance  r  from  a  point  charge 
Q.  (Take  your  V  =  0  distance  to  be  anywhere  you  like.)  V 

10  Today,  even  a  big  luxury  car  like  a  Cadillac  can  have  an 
electrical  system  that  is  relatively  low  in  power,  since  it  doesn’t 
need  to  do  much  more  than  run  headlights,  power  windows,  etc. 
In  the  near  future,  however,  manufacturers  plan  to  start  making 
cars  with  electrical  systems  about  five  times  more  powerful.  This 
will  allow  certain  energy-wasting  parts  like  the  water  pump  to  be 
run  on  electrical  motors  and  turned  off  when  they’re  not  needed 

—  currently  they’re  run  directly  on  shafts  from  the  motor,  so  they 
can’t  be  shut  off.  It  may  even  be  possible  to  make  an  engine  that 
can  shut  off  at  a  stoplight  and  then  turn  back  on  again  without 
cranking,  since  the  valves  can  be  electrically  powered.  Current  cars’ 
electrical  systems  have  12-volt  batteries  (with  14- volt  chargers),  but 
the  new  systems  will  have  36-volt  batteries  (with  42- volt  chargers). 

(a)  Suppose  the  battery  in  a  new  car  is  used  to  run  a  device  that 
requires  the  same  amount  of  power  as  the  corresponding  device  in 
the  old  car.  Based  on  the  sample  figures  above,  how  would  the 
currents  handled  by  the  wires  in  one  of  the  new  cars  compare  with 
the  currents  in  the  old  ones? 

(b)  The  real  purpose  of  the  greater  voltage  is  to  handle  devices  that 

need  more  power.  Can  you  guess  why  they  decided  to  change  to  36- 
volt  batteries  rather  than  increasing  the  power  without  increasing 
the  voltage?  V  ^ 

11  We  have  referred  to  resistors  dissipating  heat,  i.e.,  we  have 

assumed  that  P  =  IAV  is  always  greater  than  zero.  Could  IAV 
come  out  to  be  negative  for  a  resistor?  If  so,  could  one  make  a 
refrigerator  by  hooking  up  a  resistor  in  such  a  way  that  it  absorbed 
heat  instead  of  dissipating  it?  ■ 

12  What  resistance  values  can  be  created  by  combining  a  1  kfl 

resistor  and  a  10  kfl  resistor?  >  Solution,  p.  979 

13  The  figure  shows  a  circuit  containing  five  lightbulbs  con¬ 
nected  to  a  battery.  Suppose  you’re  going  to  connect  one  probe  of  a 
voltmeter  to  the  circuit  at  the  point  marked  with  a  dot.  How  many 
unique,  nonzero  voltage  differences  could  you  measure  by  connecting 
the  other  probe  to  other  wires  in  the  circuit? 

14  The  lightbulbs  in  the  figure  are  all  identical.  If  you  were 
inserting  an  ammeter  at  various  places  in  the  circuit,  how  many 
unique  currents  could  you  measure?  If  you  know  that  the  current 
measurement  will  give  the  same  number  in  more  than  one  place, 


Problem  8. 


Problems  13  and  14. 


Problems 


551 


Problem  16. 


only  count  that  as  one  unique  current. 

15  (a)  You  take  an  LP  record  out  of  its  sleeve,  and  it  acquires  a 

static  charge  of  1  nC.  You  play  it  at  the  normal  speed  of  33^  r.p.rn., 
and  the  charge  moving  in  a  circle  creates  an  electric  current.  What 
is  the  current,  in  amperes?  V 

(b)  Although  the  planetary  model  of  the  atom  can  be  made  to  work 
with  any  value  for  the  radius  of  the  electrons’  orbits,  more  advanced 
models  that  we  will  study  later  in  this  course  predict  definite  radii. 
If  the  electron  is  imagined  as  circling  around  the  proton  at  a  speed 
of  2.2  x  106  m/s,  in  an  orbit  with  a  radius  of  0.05  nm,  what  electric 
current  is  created?  V  ^ 

16  The  figure  shows  a  simplified  diagram  of  a  device  called  a 
tandem  accelerator,  used  for  accelerating  beams  of  ions  up  to  speeds 
on  the  order  of  1%  of  the  speed  of  light.  The  nuclei  of  these  ions 
collide  with  the  nuclei  of  atoms  in  a  target,  producing  nuclear  re¬ 
actions  for  experiments  studying  the  structure  of  nuclei.  The  outer 
shell  of  the  accelerator  is  a  conductor  at  zero  voltage  (i.e. ,  the  same 
voltage  as  the  Earth).  The  electrode  at  the  center,  known  as  the 
“terminal,”  is  at  a  high  positive  voltage,  perhaps  millions  of  volts. 
Negative  ions  with  a  charge  of  —1  unit  (i.e.,  atoms  with  one  extra 
electron)  are  produced  offstage  on  the  right,  typically  by  chemical 
reactions  with  cesium,  which  is  a  chemical  element  that  has  a  strong 
tendency  to  give  away  electrons.  Relatively  weak  electric  and  mag¬ 
netic  forces  are  used  to  transport  these  —1  ions  into  the  accelerator, 
where  they  are  attracted  to  the  terminal.  Although  the  center  of  the 
terminal  has  a  hole  in  it  to  let  the  ions  pass  through,  there  is  a  very 
thin  carbon  foil  there  that  they  must  physically  penetrate.  Passing 
through  the  foil  strips  off  some  number  of  electrons,  changing  the 
atom  into  a  positive  ion,  with  a  charge  of  +n  times  the  fundamental 
charge.  Now  that  the  atom  is  positive,  it  is  repelled  by  the  terminal, 
and  accelerates  some  more  on  its  way  out  of  the  accelerator. 

(a)  Find  the  velocity,  v,  of  the  emerging  beam  of  positive  ions,  in 
terms  of  n,  their  mass  m,  the  terminal  voltage  V .  and  fundamental 
constants.  Neglect  the  small  change  in  mass  caused  by  the  loss  of 
electrons  in  the  stripper  foil. 

(b)  To  fuse  protons  with  protons,  a  minimum  beam  velocity  of  about 

11%  of  the  speed  of  light  is  required.  What  terminal  voltage  would 
be  needed  in  this  case?  % 

(c)  In  the  setup  described  in  part  b,  we  need  a  target  containing 

atoms  whose  nuclei  are  single  protons,  i.e.,  a  target  made  of  hydro¬ 
gen.  Since  hydrogen  is  a  gas,  and  we  want  a  foil  for  our  target,  we 
have  to  use  a  hydrogen  compound,  such  as  a  plastic.  Discuss  what 
effect  this  would  have  on  the  experiment.  %  ^ 

17  Wire  is  sold  in  a  series  of  standard  diameters,  called  “gauges.” 
The  difference  in  diameter  between  one  gauge  and  the  next  in  the 
series  is  about  20%.  How  would  the  resistance  of  a  given  length  of 
wire  compare  with  the  resistance  of  the  same  length  of  wire  in  the 
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next  gauge  in  the  series?  - 

18  In  the  figure,  the  battery  is  9  V. 

(a)  What  are  the  voltage  differences  across  each  light  bulb?  V 

(b)  What  current  flows  through  each  of  the  three  components  of  the 

circuit?  V 

(c)  If  a  new  wire  is  added  to  connect  points  A  and  B,  how  will  the 
appearances  of  the  bulbs  change?  What  will  be  the  new  voltages 
and  currents? 

(d)  Suppose  no  wire  is  connected  from  A  to  B,  but  the  two  bulbs 

are  switched.  How  will  the  results  compare  with  the  results  from 
the  original  setup  as  drawn?  ^ 

19  A  student  in  a  biology  lab  is  given  the  following  instruc¬ 
tions:  “Connect  the  cerebral  eraser  (C.E.)  and  the  neural  depo¬ 
larizer  (N.D.)  in  parallel  with  the  power  supply  (P.S.).  (Under  no 
circumstances  should  you  ever  allow  the  cerebral  eraser  to  come 
within  20  cm  of  your  head.)  Connect  a  voltmeter  to  measure  the 
voltage  across  the  cerebral  eraser,  and  also  insert  an  ammeter  in 
the  circuit  so  that  you  can  make  sure  you  don’t  put  more  than  100 
mA  through  the  neural  depolarizer.”  The  diagrams  show  two  lab 
groups’  attempts  to  follow  the  instructions. 

(a)  Translate  diagram  1  into  a  standard-style  schematic.  What  is 
correct  and  incorrect  about  this  group’s  setup? 

(b)  Do  the  same  for  diagram  2.  ^ 

20  Referring  back  to  problem  15  on  page  512  about  the  sodium 
chloride  crystal,  suppose  the  lithium  ion  is  going  to  jump  from  the 
gap  it  is  occupying  to  one  of  the  four  closest  neighboring  gaps. 
Which  one  will  it  jump  to,  and  if  it  starts  from  rest,  how  fast  will 
it  be  going  by  the  time  it  gets  there?  (It  will  keep  on  moving  and 
accelerating  after  that,  but  that  does  not  concern  us.)  [Hint:  The 
approach  is  similar  to  the  one  used  for  the  other  problem,  but  you 
want  to  work  with  voltage  and  electrical  energy  rather  than  force.] 

V  ^ 

21  A  1.0  n  toaster  and  a  2.0  H  lamp  are  connected  in  parallel 
with  the  110-V  supply  of  your  house.  (Ignore  the  fact  that  the 
voltage  is  AC  rather  than  DC.) 

(a)  Draw  a  schematic  of  the  circuit. 

(b)  For  each  of  the  three  components  in  the  circuit,  find  the  current 

passing  through  it  and  the  voltage  drop  across  it.  V 

(c)  Suppose  they  were  instead  hooked  up  in  series.  Draw  a  schematic 

and  calculate  the  same  things.  V  ^ 

22  The  heating  element  of  an  electric  stove  is  connected  in  series 
with  a  switch  that  opens  and  closes  many  times  per  second.  When 
you  turn  the  knob  up  for  more  power,  the  fraction  of  the  time  that 
the  switch  is  closed  increases.  Suppose  someone  suggests  a  simpler 
alternative  for  controlling  the  power  by  putting  the  heating  element 
in  series  with  a  variable  resistor  controlled  by  the  knob.  (With  the 
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knob  turned  all  the  way  clockwise,  the  variable  resistor’s  resistance  is 
nearly  zero,  and  when  it’s  all  the  way  counterclockwise,  its  resistance 
is  essentially  infinite.)  (a)  Draw  schematics,  (b)  Why  would  the 
simpler  design  be  undesirable?  ^ 

23  You  have  a  circuit  consisting  of  two  unknown  resistors  in 
series,  and  a  second  circuit  consisting  of  two  unknown  resistors  in 
parallel. 

(a)  What,  if  anything,  would  you  learn  about  the  resistors  in  the 
series  circuit  by  finding  that  the  currents  through  them  were  equal? 

(b)  What  if  you  found  out  the  voltage  differences  across  the  resistors 
in  the  series  circuit  were  equal? 

(c)  What  would  you  learn  about  the  resistors  in  the  parallel  circuit 
from  knowing  that  the  currents  were  equal? 

(d)  What  if  the  voltages  in  the  parallel  circuit  were  equal?  - 

24  How  many  different  resistance  values  can  be  created  by  com¬ 

bining  three  unequal  resistors?  (Don’t  count  possibilities  in  which 
not  all  the  resistors  are  used,  i.e.,  ones  in  which  there  is  zero  current 
in  one  or  more  of  them.)  ■ 

25  Suppose  six  identical  resistors,  each  with  resistance  R ,  are 
connected  so  that  they  form  the  edges  of  a  tetrahedron  (a  pyramid 
with  three  sides  in  addition  to  the  base,  i.e.,  one  less  side  than  an 
Egyptian  pyramid) .  What  resistance  value  or  values  can  be  obtained 
by  making  connections  onto  any  two  points  on  this  arrangement? 

>  Solution,  p.  979  s& 

26  A  person  in  a  rural  area  who  has  no  electricity  runs  an 
extremely  long  extension  cord  to  a  friend’s  house  down  the  road  so 
she  can  run  an  electric  light.  The  cord  is  so  long  that  its  resistance, 
x,  is  not  negligible.  Show  that  the  lamp’s  brightness  is  greatest  if 
its  resistance,  y,  is  equal  to  x.  Explain  physically  why  the  lamp  is 
dim  for  values  of  y  that  are  too  small  or  too  large. 

27  All  three  resistors  have  the  same  resistance,  R.  Find  the 

three  unknown  currents  in  terms  of  Vi,  V2,  and  R.  V  ^ 


Problem  27. 
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28  You  are  given  a  battery,  a  flashlight  bulb,  and  a  single  piece 

of  wire.  Draw  at  least  two  configurations  of  these  items  that  would 
result  in  lighting  up  the  bulb,  and  at  least  two  that  would  not  light 
it.  (Don’t  draw  schematics.)  If  you’re  not  sure  what’s  going  on, 
borrow  the  materials  from  your  instructor  and  try  it.  Note  that  the 
bulb  has  two  electrical  contacts:  one  is  the  threaded  metal  jacket, 
and  the  other  is  the  tip  (at  the  bottom  in  the  figure).  [Problem  by 
Arnold  Arons.]  ^ 

29  The  figure  shows  a  simplified  diagram  of  an  electron  gun  such 
as  the  one  that  creates  the  electron  beam  in  a  TV  tube.  Electrons 
that  spontaneously  emerge  from  the  negative  electrode  (cathode) 
are  then  accelerated  to  the  positive  electrode,  which  has  a  hole  in  it. 
(Once  they  emerge  through  the  hole,  they  will  slow  down.  However, 
if  the  two  electrodes  are  fairly  close  together,  this  slowing  down  is  a 
small  effect,  because  the  attractive  and  repulsive  forces  experienced 
by  the  electron  tend  to  cancel.) 

(a)  If  the  voltage  difference  between  the  electrodes  is  AV,  what  is 

the  velocity  of  an  electron  as  it  emerges  at  B?  Assume  that  its  initial 
velocity,  at  A,  is  negligible,  and  that  the  velocity  is  nonrelativistic. 
(If  you  haven’t  read  ch.  7  yet,  don’t  worry  about  the  remark  about 
relativity.)  v 

(b)  Evaluate  your  expression  numerically  for  the  case  where  AV=10 

kV,  and  compare  to  the  speed  of  light.  If  you’ve  read  ch.  7  already, 
comment  on  whether  the  assumption  of  nonrelativistic  motion  was 
justified.  >  Solution,  p.  980  V  ^ 

30  (a)  Many  battery-operated  devices  take  more  than  one  bat¬ 
tery.  If  you  look  closely  in  the  battery  compartment,  you  will  see 
that  the  batteries  are  wired  in  series.  Consider  a  flashlight  circuit. 
What  does  the  loop  rule  tell  you  about  the  effect  of  putting  several 
batteries  in  series  in  this  way? 

(b)  The  cells  of  an  electric  eel’s  nervous  system  are  not  that  differ¬ 
ent  from  ours  —  each  cell  can  develop  a  voltage  difference  across 
it  of  somewhere  on  the  order  of  one  volt.  How,  then,  do  you  think 
an  electric  eel  can  create  voltages  of  thousands  of  volts  between 
different  parts  of  its  body?  ■ 

31  The  figure  shows  two  possible  ways  of  wiring  a  flashlight  with 
a  switch.  Both  will  serve  to  turn  the  bulb  on  and  off,  although  the 
switch  functions  in  the  opposite  sense.  Why  is  method  (1)  prefer¬ 
able? 


1  2 
Problem  31. 
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A  printed  circuit  board,  like 
the  kind  referred  to  in  problem 
32. 
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Problem  35. 


32  You  have  to  do  different  things  with  a  circuit  to  measure 
current  than  to  measure  a  voltage  difference.  Which  would  be  more 
practical  for  a  printed  circuit  board,  in  which  the  wires  are  actually 
strips  of  metal  embedded  inside  the  board? 

>  Solution,  p.  980  - 

33  The  bulbs  are  all  identical.  Which  one  doesn’t  light  up?  ■ 

34  Each  bulb  has  a  resistance  of  one  ohm.  How  much  power  is 

drawn  from  the  one-volt  battery?  V  ■ 

35  The  bulbs  all  have  unequal  resistances.  Given  the  three 

currents  shown  in  the  figure,  find  the  currents  through  bulbs  A,  B, 
C,  and  D.  ^ 

36  A  silk  thread  is  uniformly  charged  by  rubbing  it  with  llama 

fur.  The  thread  is  then  dangled  vertically  above  a  metal  plate  and 
released.  As  each  part  of  the  thread  makes  contact  with  the  conduct¬ 
ing  plate,  its  charge  is  deposited  onto  the  plate.  Since  the  thread  is 
accelerating  due  to  gravity,  the  rate  of  charge  deposition  increases 
with  time,  and  by  time  t  the  cumulative  amount  of  charge  is  q  =  ct2, 
where  c  is  a  constant,  (a)  Find  the  current  flowing  onto  the  plate.  V 
(b)  Suppose  that  the  charge  is  immediately  carried  away  through  a 
resistance  R.  Find  the  power  dissipated  as  heat.  V 

■ 

37  In  example  9  on  p.  530,  suppose  that  the  larger  sphere  has 

radius  a,  the  smaller  one  b.  (a)  Use  the  result  of  problem  9  to  show 
that  the  ratio  of  the  charges  on  the  two  spheres  is  qa/%  =  o,/b.  (b) 
Show  that  the  density  of  charge  (charge  per  unit  area)  is  the  other 
way  around:  the  charge  density  on  the  smaller  sphere  is  greater  than 
that  on  the  larger  sphere  in  the  ratio  a/b.  ■ 

38  (a)  Recall  that  the  gravitational  energy  of  two  gravitation¬ 
ally  interacting  spheres  is  given  by  PE  =  —Gm\m2/r,  where  r  is 
the  center-to-center  distance.  Sketch  a  graph  of  PE  as  a  function 
of  r,  making  sure  that  your  graph  behaves  properly  at  small  values 
of  r,  where  you’re  dividing  by  a  small  number,  and  at  large  ones, 
where  you’re  dividing  by  a  large  one.  Check  that  your  graph  be¬ 
haves  properly  when  a  rock  is  dropped  from  a  larger  r  to  a  smaller 
one;  the  rock  should  lose  potential  energy  as  it  gains  kinetic  energy, 
(b)  Electrical  forces  are  closely  analogous  to  gravitational  ones,  since 
both  depend  on  1/r2.  Since  the  forces  are  analogous,  the  potential 
energies  should  also  behave  analogously.  Using  this  analogy,  write 
down  the  expression  for  the  electrical  potential  energy  of  two  inter¬ 
acting  charged  particles.  The  main  uncertainty  here  is  the  sign  out 
in  front.  Like  masses  attract,  but  like  charges  repel.  To  figure  out 
whether  you  have  the  right  sign  in  your  equation,  sketch  graphs  in 
the  case  where  both  charges  are  positive,  and  also  in  the  case  where 
one  is  positive  and  one  negative;  make  sure  that  in  both  cases,  when 
the  charges  are  released  near  one  another,  their  motion  causes  them 
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to  lose  PE  while  gaining  KE. 

39  Find  the  current  drawn  from  the  battery. 


V 


Problem  39. 


Key  to  symbols: 

easy  -  typical  S  challenging  )i  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercise  9A:  Voltage  and  Current 

1.  How  many  different  currents  could  you  measure  in  this  circuit?  Make  a  prediction,  and  then 
try  it. 

- i - 

- WW — 

What  do  you  notice?  How  does  this  make  sense  in  terms  of  the  roller  coaster  metaphor  intro¬ 
duced  in  discussion  question  9.1.3A  on  page  525? 

What  is  being  used  up  in  the  resistor? 

2.  By  connecting  probes  to  these  points,  how  many  ways  could  you  measure  a  voltage?  How 
many  of  them  would  be  different  numbers?  Make  a  prediction,  and  then  do  it. 


A 

B 

■  - 

- -  A  / 

1  • 

i  A  - - 

*■ — \A/W - 

C  v  v  v  v  D 


What  do  you  notice?  Interpret  this  using  the  roller  coaster  metaphor,  and  color  in  parts  of  the 
circuit  that  represent  constant  voltages. 

3.  The  resistors  are  unequal.  How  many  different  voltages  and  currents  can  you  measure?  Make 
a  prediction,  and  then  try  it. 


WWV — \AM r1 

What  do  you  notice?  Interpret  this  using  the  roller  coaster  metaphor,  and  color  in  parts  of  the 
circuit  that  represent  constant  voltages. 
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Exercise  9B:  The  Loop  and  Junction  Rules 

Apparatus: 

DC  power  supply 

multimeter 

resistors 

1.  The  junction  rule 

Construct  a  circuit  like  this  one,  using  the  power  supply  as  your  voltage  source.  To  make  things 
more  interesting,  don’t  use  equal  resistors.  Use  nice  big  resistors  (say  100  kCl  to  1  MCI)  - 
this  will  ensure  that  you  don’t  burn  up  the  resistors,  and  that  the  multimeter’s  small  internal 
resistance  when  used  as  an  ammeter  is  negligible  in  comparison. 


I — vw — ' 


Insert  your  multimeter  in  the  circuit  to  measure  all  three  currents  that  you  need  in  order  to  test 
the  junction  rule. 

2.  The  loop  rule 

Now  come  up  with  a  circuit  to  test  the  loop  rule.  Since  the  loop  rule  is  always  supposed  to  be 
true,  it’s  hard  to  go  wrong  here!  Make  sure  you  have  at  least  three  resistors  in  a  loop,  and  make 
sure  you  hook  in  the  power  supply  in  a  way  that  creates  non-zero  voltage  differences  across  all 
the  resistors.  Measure  the  voltage  differences  you  need  to  measure  to  test  the  loop  rule.  Here 
it  is  best  to  use  fairly  small  resistances,  so  that  the  multimeter’s  large  internal  resistance  when 
used  in  parallel  as  a  voltmeter  will  not  significantly  reduce  the  resistance  of  the  circuit.  Do  not 
use  resistances  of  less  than  about  100  Cl,  however,  or  you  may  blow  a  fuse  or  burn  up  a  resistor. 
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Exercise  9C:  Reasoning  About  Circuits 

The  questions  in  this  exercise  can  all  be  solved  using  some  combination  of  the  following  ap¬ 
proaches: 

a)  There  is  constant  voltage  throughout  any  conductor. 

b)  Ohm’s  law  can  be  applied  to  any  part  of  a  circuit. 

c)  Apply  the  loop  rule. 

d)  Apply  the  junction  rule. 

In  each  case,  discuss  the  question,  decide  what  you  think  is  the  right  answer,  and  then  try  the 
experiment. 

1.  A  wire  is  added  in  parallel  with  one  bulb. 


JSlxSh, 


2/ 

I 

<  t 

1 

1 

Which  reasoning  is  correct? 


•  Each  bulb  still  has  1.2  V  across  it,  so  both  bulbs  are  still  lit  up. 

•  All  parts  of  a  wire  are  at  the  same  voltage,  and  there  is  now  a  wire  connection  from  one 
side  of  the  right-hand  bulb  to  the  other.  The  right-hand  bulb  has  no  voltage  difference 
across  it,  so  it  goes  out. 


2.  The  series  circuit  is  changed  as  shown. 

AD— SA 


2.4  V 

Which  reasoning  is  correct? 


•  Each  bulb  now  has  its  sides  connected  to  the  two  terminals  of  the  battery,  so  each  now  has 
2-4  V  across  it  instead  of  1.2  V.  They  get  brighter. 


•  Just  as  in  the  original  circuit,  the  current  goes  through  one  bulb,  then  the  other.  It’s  just 
that  now  the  current  goes  in  a  figure-8  pattern.  The  bulbs  glow  the  same  as  before. 


560 


Chapter  9  Circuits 


3.  A  wire  is  added  as  shown  to  the  original  circuit. 


What  is  wrong  with  the  following  reasoning? 

The  top  right  bulb  will  go  out,  because  its  two  sides  are  now  connected  with  wire,  so  there  will 
be  no  voltage  difference  across  it.  The  other  three  bidbs  will  not  be  affected. 

4.  A  wire  is  added  as  shown  to  the  original  circuit. 


What  is  wrong  with  the  following  reasoning? 

The  current  flows  out  of  the  right  side  of  the  battery.  When  it  hits  the  first  junction,  some  of 
it  will  go  left  and  some  will  keep  going  up  The  part  that  goes  up  lights  the  top  right  bulb.  The 
part  that  turns  left  then  follows  the  path  of  least  resistance,  going  through  the  new  wire  instead 
of  the  bottom  bulb.  The  top  bulb  stays  lit,  the  bottom  one  goes  out,  and  others  stay  the  same. 

5.  What  happens  when  one  bulb  is  unscrewed,  leaving  an  air  gap? 
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Fields 


“Okay.  Your  duties  are  as  follows:  Get  Breen.  I  don’t  care  how  you  get 
him,  but  get  him  soon.  That  faker!  He  posed  for  twenty  years  as  a  scientist 
without  ever  being  apprehended.  Well,  I’m  going  to  do  some  apprehend¬ 
ing  that’ll  make  all  previous  apprehending  look  like  no  apprehension  at  all. 
You  with  me?” 

“Yes,”  said  Battle,  very  much  confused.  “What’s  that  thing  you  have?” 

“Piggy-back  heat-ray.  You  transpose  the  air  in  its  path  into  an  unstable 
isotope  which  tends  to  carry  all  energy  as  heat.  Then  you  shoot  your  juice 
light,  or  whatever  along  the  isotopic  path  and  you  burn  whatever’s  on  the 
receiving  end.  You  want  a  few?” 

“No,”  said  Battle.  “I  have  my  gats.  What  else  have  you  got  for  offense 
and  defense?”  Underbottam  opened  a  cabinet  and  proudly  waved  an 
arm.  “Everything,”  he  said. 

“Disintegrates,  heat-rays,  bombs  of  every  type.  And  impenetrable 
shields  of  energy,  massive  and  portable.  What  more  do  I  need?” 

From  THE  REVERSIBLE  REVOLUTIONS  by  Cecil  Corwin,  Cosmic 
Stories,  March  1941.  Art  by  Morey,  Bok,  Kyle,  Hunt,  Forte.  Copyright 
expired. 


10.1  Fields  of  Force 

Cutting-edge  science  readily  infiltrates  popular  culture,  though  some¬ 
times  in  garbled  form.  The  Newtonian  imagination  populated  the 
universe  mostly  with  that  nice  solid  stuff  called  matter,  which  was 
made  of  little  hard  balls  called  atoms.  In  the  early  twentieth  cen¬ 
tury,  consumers  of  pulp  fiction  and  popularized  science  began  to 
hear  of  a  new  image  of  the  universe,  full  of  x-rays,  N-rays,  and 
Hertzian  waves.  What  they  were  beginning  to  soak  up  through 
their  skins  was  a  drastic  revision  of  Newton’s  concept  of  a  universe 
made  of  chunks  of  matter  which  happened  to  interact  via  forces.  In 
the  newly  emerging  picture,  the  universe  was  made  of  force,  or,  to 
be  more  technically  accurate,  of  ripples  in  universal  fields  of  force. 
Unlike  the  average  reader  of  Cosmic  Stories  in  1941,  you  now  pos¬ 
sess  enough  technical  background  to  understand  what  a  “force  field” 
really  is. 

10.1.1  Why  fields? 
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a  /  A  bar  magnet’s  atoms  are 
(partially)  aligned. 


b  /  A  bar  magnet  interacts 
with  our  magnetic  planet. 


c  /  Magnets  aligned  north-south. 


d  /  The  second  magnet  is  re¬ 
versed. 


e  /  Both  magnets  are  reversed. 


Time  delays  in  forces  exerted  at  a  distance 

What  convinced  physicists  that  they  needed  this  new  concept  of 
a  field  of  force?  Although  we  have  been  dealing  mostly  with  elec¬ 
trical  forces,  let’s  start  with  a  magnetic  example.  (In  fact  the  main 
reason  I’ve  delayed  a  detailed  discussion  of  magnetism  for  so  long 
is  that  mathematical  calculations  of  magnetic  effects  are  handled 
much  more  easily  with  the  concept  of  a  field  of  force.)  First  a  little 
background  leading  up  to  our  example.  A  bar  magnet,  a,  has  an  axis 
about  which  many  of  the  electrons’  orbits  are  oriented.  The  earth 
itself  is  also  a  magnet,  although  not  a  bar-shaped  one.  The  interac¬ 
tion  between  the  earth-magnet  and  the  bar  magnet,  b,  makes  them 
want  to  line  up  their  axes  in  opposing  directions  (in  other  words 
such  that  their  electrons  rotate  in  parallel  planes,  but  with  one  set 
rotating  clockwise  and  the  other  counterclockwise  as  seen  looking 
along  the  axes).  On  a  smaller  scale,  any  two  bar  magnets  placed 
near  each  other  will  try  to  align  themselves  head-to-tail,  c. 

Now  we  get  to  the  relevant  example.  It  is  clear  that  two  people 
separated  by  a  paper-thin  wall  could  use  a  pair  of  bar  magnets  to 
signal  to  each  other.  Each  person  would  feel  her  own  magnet  trying 
to  twist  around  in  response  to  any  rotation  performed  by  the  other 
person’s  magnet.  The  practical  range  of  communication  would  be 
very  short  for  this  setup,  but  a  sensitive  electrical  apparatus  could 
pick  up  magnetic  signals  from  much  farther  away.  In  fact,  this  is 
not  so  different  from  what  a  radio  does:  the  electrons  racing  up 
and  down  the  transmitting  antenna  create  forces  on  the  electrons 
in  the  distant  receiving  antenna.  (Both  magnetic  and  electric  forces 
are  involved  in  real  radio  signals,  but  we  don’t  need  to  worry  about 
that  yet.) 

A  question  now  naturally  arises  as  to  whether  there  is  any  time 
delay  in  this  kind  of  communication  via  magnetic  (and  electric) 
forces.  Newton  would  have  thought  not,  since  he  conceived  of 
physics  in  terms  of  instantaneous  action  at  a  distance.  We  now 
know,  however,  that  there  is  such  a  time  delay.  If  you  make  a 
long-distance  phone  call  that  is  routed  through  a  communications 
satellite,  you  should  easily  be  able  to  detect  a  delay  of  about  half  a 
second  over  the  signal’s  round  trip  of  50,000  miles.  Modern  measure¬ 
ments  have  shown  that  electric,  magnetic,  and  gravitational  forces 
all  travel  at  the  speed  of  light,  3  x  10s  m/s.  (In  fact,  we  will  soon 
discuss  how  light  itself  is  made  of  electricity  and  magnetism.) 

If  it  takes  some  time  for  forces  to  be  transmitted  through  space, 
then  apparently  there  is  some  thing  that  travels  through  space.  The 
fact  that  the  phenomenon  travels  outward  at  the  same  speed  in  all 
directions  strongly  evokes  wave  metaphors  such  as  ripples  on  a  pond. 
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More  evidence  that  fields  of  force  are  real:  they  carry  energy. 

The  smoking-gun  argument  for  this  strange  notion  of  traveling 
force  ripples  comes  from  the  fact  that  they  carry  energy. 

First  suppose  that  the  person  holding  the  bar  magnet  on  the 
right  decides  to  reverse  hers,  resulting  in  configuration  d.  She  had 
to  do  mechanical  work  to  twist  it,  and  if  she  releases  the  magnet, 
energy  will  be  released  as  it  flips  back  to  c.  She  has  apparently  stored 
energy  by  going  from  c  to  d.  So  far  everything  is  easily  explained 
without  the  concept  of  a  field  of  force. 

But  now  imagine  that  the  two  people  start  in  position  c  and 
then  simultaneously  flip  their  magnets  extremely  quickly  to  position 
e,  keeping  them  lined  up  with  each  other  the  whole  time.  Imagine, 
for  the  sake  of  argument,  that  they  can  do  this  so  quickly  that 
each  magnet  is  reversed  while  the  force  signal  from  the  other  is 
still  in  transit.  (For  a  more  realistic  example,  we’d  have  to  have 
two  radio  antennas,  not  two  magnets,  but  the  magnets  are  easier 
to  visualize.)  During  the  flipping,  each  magnet  is  still  feeling  the 
forces  arising  from  the  way  the  other  magnet  used  to  be  oriented. 
Even  though  the  two  magnets  stay  aligned  during  the  flip,  the  time 
delay  causes  each  person  to  feel  resistance  as  she  twists  her  magnet 
around.  How  can  this  be?  Both  of  them  are  apparently  doing 
mechanical  work,  so  they  must  be  storing  magnetic  energy  somehow. 
But  in  the  traditional  Newtonian  conception  of  matter  interacting 
via  instantaneous  forces  at  a  distance,  interaction  energy  arises  from 
the  relative  positions  of  objects  that  are  interacting  via  forces.  If 
the  magnets  never  changed  their  orientations  relative  to  each  other, 
how  can  any  magnetic  energy  have  been  stored? 

The  only  possible  answer  is  that  the  energy  must  have  gone 
into  the  magnetic  force  ripples  crisscrossing  the  space  between  the 
magnets.  Fields  of  force  apparently  carry  energy  across  space,  which 
is  strong  evidence  that  they  are  real  things. 

This  is  perhaps  not  as  radical  an  idea  to  us  as  it  was  to  our 
ancestors.  We  are  used  to  the  idea  that  a  radio  transmitting  antenna 
consumes  a  great  deal  of  power,  and  somehow  spews  it  out  into  the 
universe.  A  person  working  around  such  an  antenna  needs  to  be 
careful  not  to  get  too  close  to  it,  since  all  that  energy  can  easily 
cook  flesh  (a  painful  phenomenon  known  as  an  “RF  burn”). 

1 0.1 .2  The  gravitational  field 

Given  that  fields  of  force  are  real,  how  do  we  define,  measure, 
and  calculate  them?  A  fruitful  metaphor  will  be  the  wind  patterns 
experienced  by  a  sailing  ship.  Wherever  the  ship  goes,  it  will  feel  a 
certain  amount  of  force  from  the  wind,  and  that  force  will  be  in  a 
certain  direction.  The  weather  is  ever-changing,  of  course,  but  for 
now  let’s  just  imagine  steady  wind  patterns.  Definitions  in  physics 
are  operational,  i.e.,  they  describe  how  to  measure  the  thing  being 
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f  /  The  wind  patterns  in  a 
certain  area  of  the  ocean  could 
be  charted  in  a  “sea  of  arrows” 
representation  like  this.  Each 
arrow  represents  both  the  wind’s 
strength  and  its  direction  at  a 
certain  location. 
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defined.  The  ship’s  captain  can  measure  the  wind’s  “field  of  force” 
by  going  to  the  location  of  interest  and  determining  both  the  direc¬ 
tion  of  the  wind  and  the  strength  with  which  it  is  blowing.  Charting 
all  these  measurements  on  a  map  leads  to  a  depiction  of  the  field  of 
wind  force  like  the  one  shown  in  the  figure.  This  is  known  as  the 
“sea  of  arrows”  method  of  visualizing  a  field. 

Now  let’s  see  how  these  concepts  are  applied  to  the  fundamental 
force  fields  of  the  universe.  We’ll  start  with  the  gravitational  field, 
which  is  the  easiest  to  understand.  As  with  the  wind  patterns, 
we’ll  start  by  imagining  gravity  as  a  static  field,  even  though  the 
existence  of  the  tides  proves  that  there  are  continual  changes  in  the 
gravity  field  in  our  region  of  space.  When  the  gravitational  field  was 
introduced  in  chapter  2,  I  avoided  discussing  its  direction  explicitly, 
but  defining  it  is  easy  enough:  we  simply  go  to  the  location  of 
interest  and  measure  the  direction  of  the  gravitational  force  on  an 
object,  such  as  a  weight  tied  to  the  end  of  a  string. 

In  chapter  2,  I  defined  the  gravitational  field  in  terms  of  the  en¬ 
ergy  required  to  raise  a  unit  mass  through  a  unit  distance.  However, 
I’m  going  to  give  a  different  definition  now,  using  an  approach  that 
will  be  more  easily  adapted  to  electric  and  magnetic  fields.  This 
approach  is  based  on  force  rather  than  energy.  We  couldn’t  carry 
out  the  energy-based  definition  without  dividing  by  the  mass  of  the 
object  involved,  and  the  same  is  true  for  the  force-based  definition. 
For  example,  gravitational  forces  are  weaker  on  the  moon  than  on 
the  earth,  but  we  cannot  specify  the  strength  of  gravity  simply  by 
giving  a  certain  number  of  newtons.  The  number  of  newtons  of 
gravitational  force  depends  not  just  on  the  strength  of  the  local 
gravitational  field  but  also  on  the  mass  of  the  object  on  which  we’re 
testing  gravity,  our  “test  mass.”  A  boulder  on  the  moon  feels  a 
stronger  gravitational  force  than  a  pebble  on  the  earth.  We  can  get 
around  this  problem  by  defining  the  strength  of  the  gravitational 
field  as  the  force  acting  on  an  object,  divided,  by  the  object’s  mass: 

The  gravitational  field  vector,  g,  at  any  location  in  space  is  found 
by  placing  a  test  mass  nit  at  that  point.  The  field  vector  is  then 
given  by  g  =  F /mt,  where  F  is  the  gravitational  force  on  the  test 
mass. 

We  now  have  three  ways  of  representing  a  gravitational  field. 
The  magnitude  of  the  gravitational  field  near  the  surface  of  the 
earth,  for  instance,  could  be  written  as  9.8  N/kg,  9.8  J/kg  •  nr,  or 
9.8  m/s2.  If  we  already  had  two  names  for  it,  why  invent  a  third? 
The  main  reason  is  that  it  prepares  us  with  the  right  approach  for 
defining  other  fields. 

The  most  subtle  point  about  all  this  is  that  the  gravitational 
field  tells  us  about  what  forces  would  be  exerted  on  a  test  mass  by 
the  earth,  sun,  moon,  and  the  rest  of  the  universe,  if  we  inserted  a 
test  mass  at  the  point  in  question.  The  field  still  exists  at  all  the 
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places  where  we  didn’t  measure  it. 

Gravitational  field  of  the  earth  example  1 

o  What  is  the  magnitude  of  the  earth’s  gravitational  field,  in  terms 
of  its  mass,  M,  and  the  distance  r  from  its  center? 

>  Substituting  |F|  =  GMmt/r 2  into  the  definition  of  the  gravitational 
field,  we  find  jgj  =  GM/r2.  This  expression  could  be  used  for 
the  field  of  any  spherically  symmetric  mass  distribution,  since  the 
equation  we  assumed  for  the  gravitational  force  would  apply  in 
any  such  case. 

Sources  and  sinks 

If  we  make  a  sea-of-arrows  picture  of  the  gravitational  fields 
surrounding  the  earth,  g,  the  result  is  evocative  of  water  going  down 
a  drain.  For  this  reason,  anything  that  creates  an  inward-pointing 
field  around  itself  is  called  a  sink.  The  earth  is  a  gravitational  sink. 
The  term  “source”  can  refer  specifically  to  things  that  make  outward 
fields,  or  it  can  be  used  as  a  more  general  term  for  both  “outies” 
and  “innies.”  However  confusing  the  terminology,  we  know  that 
gravitational  fields  are  only  attractive,  so  we  will  never  find  a  region 
of  space  with  an  outward-pointing  field  pattern. 

Knowledge  of  the  field  is  interchangeable  with  knowledge  of  its 
sources  (at  least  in  the  case  of  a  static,  unchanging  field).  If  aliens 
saw  the  earth’s  gravitational  held  pattern  they  could  immediately 
infer  the  existence  of  the  planet,  and  conversely  if  they  knew  the 
mass  of  the  earth  they  could  predict  its  influence  on  the  surrounding 
gravitational  held. 

Superposition  of  fields 

A  very  important  fact  about  all  helds  of  force  is  that  when  there 
is  more  than  one  source  (or  sink),  the  helds  add  according  to  the 
rules  of  vector  addition.  The  gravitational  held  certainly  will  have 
this  property,  since  it  is  dehned  in  terms  of  the  force  on  a  test 
mass,  and  forces  add  like  vectors.  Superposition  is  an  important 
characteristic  of  waves,  so  the  superposition  property  of  helds  is 
consistent  with  the  idea  that  disturbances  can  propagate  outward 
as  waves  in  a  held. 


T 
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g/The  gravitational  field  sur¬ 
rounding  a  clump  of  mass  such 
as  the  earth. 


->  o 


h/The  gravitational  fields  of 
the  earth  and  moon  superpose. 
Note  how  the  fields  cancel  at  one 
point,  and  how  there  is  no  bound¬ 
ary  between  the  interpenetrating 
fields  surrounding  the  two  bodies. 
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Reduction  in  gravity  on  lo  due  to  Jupiter’s  gravity  example  2 

>  The  average  gravitational  field  on  Jupiter’s  moon  lo  is  1 .81  N/kg. 
By  how  much  is  this  reduced  when  Jupiter  is  directly  overhead? 
lo’s  orbit  has  a  radius  of  4.22  x  108  m,  and  Jupiter’s  mass  is 
1.899  x  1027  kg. 

>  By  the  shell  theorem,  we  can  treat  the  Jupiter  as  if  its  mass  was 
all  concentrated  at  its  center,  and  likewise  for  lo.  If  we  visit  lo  and 
land  at  the  point  where  Jupiter  is  overhead,  we  are  on  the  same 
line  as  these  two  centers,  so  the  whole  problem  can  be  treated 
one-dimensionally,  and  vector  addition  is  just  like  scalar  addition. 
Let’s  use  positive  numbers  for  downward  fields  (toward  the  center 
of  lo)  and  negative  for  upward  ones.  Plugging  the  appropriate 
data  into  the  expression  derived  in  example  1 ,  we  find  that  the 
Jupiter’s  contribution  to  the  field  is  -0.71  N/kg.  Superposition 
says  that  we  can  find  the  actual  gravitational  field  by  adding  up 
the  fields  created  by  lo  and  Jupiter:  1 .81  -  0.71  N/kg  =  1.1  N/kg. 
You  might  think  that  this  reduction  would  create  some  spectacular 
effects,  and  make  lo  an  exciting  tourist  destination.  Actually  you 
would  not  detect  any  difference  if  you  flew  from  one  side  of  lo 
to  the  other.  This  is  because  your  body  and  lo  both  experience 
Jupiter’s  gravity,  so  you  follow  the  same  orbital  curve  through  the 
space  around  Jupiter. 


i/The  part  of  the  LIGO  grav¬ 
ity  wave  detector  at  Hanford  Nu¬ 
clear  Reservation,  near  Richland, 
Washington.  The  other  half  of  the 
detector  is  in  Louisiana. 


Gravitational  waves 

A  source  that  sits  still  will  create  a  static  field  pattern,  like  a 
steel  ball  sitting  peacefully  on  a  sheet  of  rubber.  A  moving  source 
will  create  a  spreading  wave  pattern  in  the  held,  like  a  bug  thrash- 
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ing  on  the  surface  of  a  pond.  Although  we  have  started  with  the 
gravitational  field  as  the  simplest  example  of  a  static  field,  stars 
and  planets  do  more  stately  gliding  than  thrashing,  so  gravitational 
waves  are  not  easy  to  detect.  Newton’s  theory  of  gravity  does  not 
describe  gravitational  waves,  but  they  are  predicted  by  Einstein’s 
general  theory  of  relativity. 

A  Caltech-MIT  collaboration  has  built  a  pair  of  gravitational 
wave  detectors  called  LIGO  to  search  for  direct  evidence  of  gravita¬ 
tional  waves.  Since  they  are  essentially  the  most  sensitive  vibration 
detectors  ever  made,  they  are  located  in  quiet  rural  areas,  and  sig¬ 
nals  are  compared  between  them  to  make  sure  that  they  were  not 
due  to  passing  trucks.  The  signature  of  a  gravitational  wave  is  if  the 
same  wiggle  is  seen  in  both  detectors  within  a  short  time.  The  de¬ 
tectors  are  able  to  sense  a  vibration  that  causes  a  change  of  10~18  m 
in  the  distance  between  the  mirrors  at  the  ends  of  the  4-krn  vacuum 
tunnels.  This  is  a  thousand  times  less  than  the  size  of  an  atomic 
nucleus!  In  2016,  the  collaboration  announced  the  first  detection  of 
a  gravitational  wave,  which  is  believed  to  have  originated  from  the 
collision  of  two  black  holes.  Propagation  of  gravitational  waves  at 
c  was  verified  through  multiple  methods  both  by  study  of  the  2016 
event  and  through  an  event  in  2017,  interpreted  as  a  collision  of  two 
neutron  stars,  in  which  both  gravitational  waves  and  electromag¬ 
netic  waves  were  detected  simultaneously. 

10.1.3  The  electric  field 

Definition 

The  definition  of  the  electric  field  is  directly  analogous  to,  and 
has  the  same  motivation  as,  the  definition  of  the  gravitational  field: 

The  electric  field  vector,  E,  at  any  location  in  space  is  found  by 
placing  a  test  charge  qt  at  that  point.  The  electric  field  vector  is 
then  given  by  E  =  F /qt,  where  F  is  the  electric  force  on  the  test 
charge. 

Charges  are  what  create  electric  fields.  Unlike  gravity,  which  is 
always  attractive,  electricity  displays  both  attraction  and  repulsion. 
A  positive  charge  is  a  source  of  electric  fields,  and  a  negative  one  is 
a  sink. 

The  most  difficult  point  about  the  definition  of  the  electric  field 
is  that  the  force  on  a  negative  charge  is  in  the  opposite  direction 
compared  to  the  field.  This  follows  from  the  definition,  since  di¬ 
viding  a  vector  by  a  negative  number  reverses  its  direction.  It’s  as 
though  we  had  some  objects  that  fell  upward  instead  of  down. 

seif-check  A 

Find  an  equation  for  the  magnitude  of  the  field  of  a  single  point  charge 

Q.  >  Answer,  p.  965 

Superposition  of  electric  fields  example  3 
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>  Charges  q  and  -q  are  at  a  distance  b  from  each  other,  as 
shown  in  the  figure.  What  is  the  electric  field  at  the  point  P,  which 
lies  at  a  third  corner  of  the  square? 

>  The  field  at  P  is  the  vector  sum  of  the  fields  that  would  have 
been  created  by  the  two  charges  independently.  Let  positive  x  be 
to  the  right  and  let  positive  y  be  up. 

Negative  charges  have  fields  that  point  at  them,  so  the  charge 
-q  makes  a  field  that  points  to  the  right,  i.e.,  has  a  positive  x 
component.  Using  the  answer  to  the  self-check,  we  have 


E-q,y  -  0. 

Note  that  if  we  had  blindly  ignored  the  absolute  value  signs  and 
plugged  in  -q  to  the  equation,  we  would  have  incorrectly  con¬ 
cluded  that  the  field  went  to  the  left. 

By  the  Pythagorean  theorem,  the  positive  charge  is  at  a  distance 
\/2b  from  P,  so  the  magnitude  of  its  contribution  to  the  field  is 
E  =  kq/2b2.  Positive  charges  have  fields  that  point  away  from 
them,  so  the  field  vector  is  at  an  angle  of  135°counterclockwise 
from  the  x  axis. 


-q,x 


-q,y 


kq_ 
2  b2 


cos  135° 
kq 


23/262 

kq 


2  b2 


sin  135° 


kq 

23/2^2 


k/A  dipole  field.  Electric  fields 
diverge  from  a  positive  charge 
and  converge  on  a  negative 
charge. 


The  total  field  is 


p  _  kq 

Cy  2 3/2  62 


Dipoles 

The  simplest  set  of  sources  that  can  occur  with  electricity  but 
not  with  gravity  is  the  dipole ,  consisting  of  a  positive  charge  and  a 
negative  charge  with  equal  magnitudes.  More  generally,  an  electric 
dipole  can  be  any  object  with  an  imbalance  of  positive  charge  on 
one  side  and  negative  on  the  other.  A  water  molecule,  1,  is  a  dipole 
because  the  electrons  tend  to  shift  away  from  the  hydrogen  atoms 
and  onto  the  oxygen  atom. 
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m  / 1 .  A  uniform  electric  field  created  by  some  charges  “off-stage.” 
2.  A  dipole  is  placed  in  the  field.  3.  The  dipole  aligns  with  the  field. 

Your  microwave  oven  acts  on  water  molecules  with  electric  fields. 
Let  us  imagine  what  happens  if  we  start  with  a  uniform  electric  field, 
m/1,  made  by  some  external  charges,  and  then  insert  a  dipole,  m/2, 
consisting  of  two  charges  connected  by  a  rigid  rod.  The  dipole  dis¬ 
turbs  the  field  pattern,  but  more  important  for  our  present  purposes 
is  that  it  experiences  a  torque.  In  this  example,  the  positive  charge 
feels  an  upward  force,  but  the  negative  charge  is  pulled  down.  The 
result  is  that  the  dipole  wants  to  align  itself  with  the  field,  m/3.  The 
microwave  oven  heats  food  with  electrical  (and  magnetic)  waves. 
The  alternation  of  the  torque  causes  the  molecules  to  wiggle  and  in¬ 
crease  the  amount  of  random  motion.  The  slightly  vague  definition 
of  a  dipole  given  above  can  be  improved  by  saying  that  a  dipole  is 
any  object  that  experiences  a  torque  in  an  electric  field. 

What  determines  the  torque  on  a  dipole  placed  in  an  externally 
created  held?  Torque  depends  on  the  force,  the  distance  from  the 
axis  at  which  the  force  is  applied,  and  the  angle  between  the  force 
and  the  line  from  the  axis  to  the  point  of  application.  Let  a  dipole 
consisting  of  charges  +q  and  —q  separated  by  a  distance  i  be  placed 
in  an  external  held  of  magnitude  |E|,  at  an  angle  9  with  respect  to 
the  held.  The  total  torque  on  the  dipole  is 


£g|E|  sin#. 


(Note  that  even  though  the  two  forces  are  in  opposite  directions, 
the  torques  do  not  cancel,  because  they  are  both  trying  to  twist 
the  dipole  in  the  same  direction.)  The  quantity  is  called  the  dipole 
moment,  notated  D.  (More  complex  dipoles  can  also  be  assigned 
a  dipole  moment  —  they  are  dehned  as  having  the  same  dipole 
moment  as  the  two-charge  dipole  that  would  experience  the  same 
torque.) 

Employing  a  little  more  mathematical  elegance,  we  can  define  a 
dipole  moment  vector , 
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q  -q 

n  /  Example  5. 


where  r *  is  the  position  vector  of  the  charge  labeled  by  the  index 
i.  We  can  then  write  the  torque  in  terms  of  a  vector  cross  product 
(page  281), 

r  =  D  x  E. 

No  matter  how  we  notate  it,  the  definition  of  the  dipole  moment 
requires  that  we  choose  a  point  from  which  we  measure  all  the  posi¬ 
tion  vectors  of  the  charges.  However,  in  the  commonly  encountered 
special  case  where  the  total  charge  of  the  object  is  zero,  the  dipole 
moment  is  the  same  regardless  of  this  choice. 

Dipole  moment  of  a  molecule  of  NaC!  gas  example  4 

>  In  a  molecule  of  NaCI  gas,  the  center-to-center  distance  be¬ 
tween  the  two  atoms  is  about  0.6  nm.  Assuming  that  the  chlo¬ 
rine  completely  steals  one  of  the  sodium’s  electrons,  compute  the 
magnitude  of  this  molecule’s  dipole  moment. 

>  The  total  charge  is  zero,  so  it  doesn’t  matter  where  we  choose 
the  origin  of  our  coordinate  system.  For  convenience,  let’s  choose 
it  to  be  at  one  of  the  atoms,  so  that  the  charge  on  that  atom 
doesn’t  contribute  to  the  dipole  moment.  The  magnitude  of  the 
dipole  moment  is  then 

D  =  { 6  x  1(T10  m)(e) 

=  (6  x  1(T10  m)(1 .6  x  1(T19  C) 

=  1  x  1CT28  C  •  m 


Dipole  moments  as  vectors  example  5 

t>  The  horizontal  and  vertical  spacing  between  the  charges  in  the 
figure  is  b.  Find  the  dipole  moment. 

>  Let  the  origin  of  the  coordinate  system  be  at  the  leftmost  charge. 

D  =  X>r/ 

=  (<7)0 0)  +  (~q){bt)  +  ( q){b*  +  by)  +  {-q){2bx) 

=  -2  bqii  +  bqy 


Alternative  definition  of  the  electric  field 

The  behavior  of  a  dipole  in  an  externally  created  field  leads  us 
to  an  alternative  definition  of  the  electric  field: 

The  electric  field  vector,  E,  at  any  location  in  space  is  defined 
by  observing  the  torque  exerted  on  a  test  dipole  Dt  placed  there. 
The  direction  of  the  field  is  the  direction  in  which  the  held  tends 
to  align  a  dipole  (from  —  to  +),  and  the  field’s  magnitude  is  |E|  = 
t/ Dt  sin  0.  In  other  words,  the  held  vector  is  the  vector  that  satishes 
the  equation  r  =  Df  x  E  for  any  test  dipole  placed  at  that  point 
in  space. 
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The  main  reason  for  introducing  a  second  definition  for  the  same 
concept  is  that  the  magnetic  field  is  most  easily  defined  using  a 
similar  approach. 

Energy  of  a  dipole  in  a  field 

A  dipole  in  an  external  field  has  a  stable  equilibrium  orientation 
in  which  D  is  parallel  to  E.  Since  the  vector  cross  product  vanishes 
for  parallel  vectors,  the  field’s  torque  on  the  dipole  is  zero.  (There 
is  also  an  unstable  equilibrium  with  D  and  E  antiparallel.)  This 
fact  is  probably  more  familiar  from  the  magnetic  context,  where  a 
magnetic  dipole  such  as  a  compass  needle  tends  to  align  itself  with 
an  external  magnetic  field.  We  will  encounter  magnetic  fields  and 
dipoles  later,  and  they  have  many  properties  analogous  to  those  of 
their  electric  counterparts.  Depending  on  its  orientation,  the  dipole 
will  have  some  interaction  energy  U  when  it  interacts  with  the  field. 
If  the  physical  size  of  the  dipole  is  small,  the  electric  field  can  be 
approximated  as  having  a  single  value  throughout.  Since  energy  is 
a  scalar,  and  the  dot  product  is  the  only  way  (up  to  a  multiplicative 
constant)  to  multiply  two  vectors  to  get  a  scalar  (sec.  3.4.5,  p.  212), 
we  must  have  U  oc  D  E.  Because  the  energy  is  minimized  when  D 
and  E  are  parallel,  the  constant  of  proportionality  must  be  negative, 
and  one  can  easily  show  by  considering  a  concrete  example  that  this 
constant  equals  —1.  Therefore  we  have 

U  =  —  D  E.  [energy  of  a  pointlike  dipole  in  a  field] 


Force  on  a  dipole  in  a  nonuniform  field  example  6 

If  a  dipole  with  zero  total  charge  is  placed  in  a  uniform  field,  it 
may  experience  a  torque,  but  it  will  not  experience  any  force.  The 
situation  changes  if  the  field  is  nonuniform.  The  force  can  be 
nonzero,  and,  perhaps  more  surprisingly,  the  force  depends  only 
on  the  dipole  moment,  not  on  the  details  of  the  arrangement  of 
the  charges  inside  the  dipole,  provided  that  the  dipole  is  small 
enough.  This  can  be  shown  either  by  a  brute-force  calculation 
(problem  59,  648)  or  by  the  following  slightly  slicker  technique. 
The  force  in  the  x  direction  is 


where  the  symbol  d  indicates  a  derivative  with  respect  to  the  x 
coordinate  only,  holding  y  and  z  constant.  (This  is  referred  to  as 
a  partial  derivative.)  We  then  have 


_  d  <9E 

F*-^adD'E)-D'av 

which  depends  on  the  dipole’s  properties  only  through  D.  Similar 
expressions  apply  for  the  y  and  z  components. 


This  principle  can  be  used  as  a  way  of  measuring  the  unknown 
dipole  moments  of  a  beam  of  particles,  as  in  figure  o. 


Section  10.1 


fixed  fixed 


<sf 


®f© 


o/An  electric  dipole  spec¬ 
trometer.  A  beam  of  randomly 
oriented  dipoles  is  shot  through  a 
“croquet  hoop”  consisting  of  two 
fixed  positive  charges.  Although 
the  field  along  the  central  axis  of 
symmetry  equals  zero,  the  field 
is  nonuniform,  and  therefore  the 
dipoles  feel  a  nonvanishing  force, 
and  are  sorted  out  according  to 
their  orientations.  A  magnetic 
version  of  this  device,  described 
on  p.  722,  was  used  in  the  his¬ 
toric  Stern-Gerlach  experiment, 
sec.  13.5.1,  p.  905. 
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p  /  Discussion 


+  2 
+  + 


A  In  the  definition  of  the  electric  field,  does  the  test  charge  need  to  be 
1  coulomb?  Does  it  need  to  be  positive? 

B  Does  a  charged  particle  such  as  an  electron  or  proton  feel  a  force 
from  its  own  electric  field? 

C  Is  there  an  electric  field  surrounding  a  wall  socket  that  has  nothing 
plugged  into  it,  or  a  battery  that  is  just  sitting  on  a  table? 

D  In  a  flashlight  powered  by  a  battery,  which  way  do  the  electric  fields 
point?  What  would  the  fields  be  like  inside  the  wires?  Inside  the  filament 
of  the  bulb? 

E  Criticize  the  following  statement:  “An  electric  field  can  be  represented 
by  a  sea  of  arrows  showing  how  current  is  flowing.” 

F  The  field  of  a  point  charge,  |E|  =  kQ/r2,  was  derived  in  a  self-check. 
How  would  the  field  pattern  of  a  uniformly  charged  sphere  compare  with 
the  field  of  a  point  charge? 

G  The  interior  of  a  perfect  electrical  conductor  in  equilibrium  must 
have  zero  electric  field,  since  otherwise  the  free  charges  within  it  would 
be  drifting  in  response  to  the  field,  and  it  would  not  be  in  equilibrium.  What 
about  the  field  right  at  the  surface  of  a  perfect  conductor?  Consider  the 
possibility  of  a  field  perpendicular  to  the  surface  or  parallel  to  it. 

H  Compare  the  dipole  moments  of  the  molecules  and  molecular  ions 
shown  in  the  figure. 

I  Small  pieces  of  paper  that  have  not  been  electrically  prepared  in 
any  way  can  be  picked  up  with  a  charged  object  such  as  a  charged  piece 
of  tape.  In  our  new  terminology,  we  could  describe  the  tape’s  charge  as 
inducing  a  dipole  moment  in  the  paper.  Can  a  similar  technique  be  used 
to  induce  not  just  a  dipole  moment  but  a  charge? 


10.2  Voltage  Related  To  Field 

10.2.1  One  dimension 

Voltage  is  electrical  energy  per  unit  charge,  and  electric  field  is 
force  per  unit  charge.  For  a  particle  moving  in  one  dimension,  along 
the  x  axis,  we  can  therefore  relate  voltage  and  field  if  we  start  from 
the  relationship  between  interaction  energy  and  force, 


dlJ  =  —Fx  dx, 


and  divide  by  charge, 


giving 

or 


d  U  Fx 

-  = - dx, 

q  q 


dV  =  —Ex  dx, 
dV 


dx 


=  —Er,.. 
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The  interpretation  is  that  a  strong  electric  field  occurs  in  a  region 
of  space  where  the  voltage  is  rapidly  changing.  By  analogy,  a  steep 
hillside  is  a  place  on  the  map  where  the  altitude  is  rapidly  changing. 

Field  generated  by  an  electric  eel  example  7 

>  Suppose  an  electric  eel  is  1  m  long,  and  generates  a  voltage 
difference  of  1000  volts  between  its  head  and  tail.  What  is  the 
electric  field  in  the  water  around  it? 

o  We  are  only  calculating  the  amount  of  field,  not  its  direction,  so 
we  ignore  positive  and  negative  signs.  Subject  to  the  possibly  in¬ 
accurate  assumption  of  a  constant  field  parallel  to  the  eel’s  body, 
we  have 


AV 

Ax 


[assumption  of  constant  field] 


=  1000  V/m. 


Relating  the  units  of  electric  field  and  voltage  example  8 

From  our  original  definition  of  the  electric  field,  we  expect  it  to 
have  units  of  newtons  per  coulomb,  N/C.  The  example  above, 
however,  came  out  in  volts  per  meter,  V/m.  Are  these  inconsis¬ 
tent?  Let’s  reassure  ourselves  that  this  all  works.  In  this  kind 
of  situation,  the  best  strategy  is  usually  to  simplify  the  more  com¬ 
plex  units  so  that  they  involve  only  mks  units  and  coulombs.  Since 
voltage  is  defined  as  electrical  energy  per  unit  charge,  it  has  units 
of  J/C: 


V  _  J/C 
m  m 
_  J 
_  C  •  m ' 


To  connect  joules  to  newtons,  we  recall  that  work  equals  force 
times  distance,  so  J  =  N  •  m,  so 

V  _  N  •  m 
m  Cm 

_  N 
“  C 

As  with  other  such  difficulties  with  electrical  units,  one  quickly 
begins  to  recognize  frequently  occurring  combinations. 
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Voltage  associated  with  a  point  charge  example  9 

>  What  is  the  voltage  associated  with  a  point  charge? 

>  As  derived  previously  in  self-check  A  on  page  569,  the  field  is 


E 


kQ 

7^ 


The  difference  in  voltage  between  two  points  on  the  same  radius 
line  is 


a  /  A  topographical  map  of 
Shelburne  Falls,  Mass.  (USGS) 


AV  = 


1 


6V 


Ex  dx 


In  the  general  discussion  above,  x  was  just  a  generic  name  for 
distance  traveled  along  the  line  from  one  point  to  the  other,  so  in 
this  case  x  really  means  r. 


AV  =  — 


Erdr 


" r '2  kQ  ^ 
-^rdr 


kQV 2 

71 

kQ  kQ 


F2 


The  standard  convention  is  to  use  p  =  oo  as  a  reference  point,  so 
that  the  voltage  at  any  distance  r  from  the  charge  is 


V  = 


kQ 

r 


The  interpretation  is  that  if  you  bring  a  positive  test  charge  closer 
to  a  positive  charge,  its  electrical  energy  is  increased;  if  it  was 
released,  it  would  spring  away,  releasing  this  as  kinetic  energy. 

self-check  B 

Show  that  you  can  recover  the  expression  for  the  field  of  a  point  charge 
by  evaluating  the  derivative  Ex  =  -  d  V /  dx.  >  Answer,  p.  965 


10.2.2  Two  or  three  dimensions 


b  /  The  constant-voltage  curves 
surrounding  a  point  charge.  Near 
the  charge,  the  curves  are  so 
closely  spaced  that  they  blend 
together  on  this  drawing  due  to 
the  finite  width  with  which  they 
were  drawn.  Some  electric  fields 
are  shown  as  arrows. 


The  topographical  map  in  figure  a  suggests  a  good  way  to  visu¬ 
alize  the  relationship  between  field  and  voltage  in  two  dimensions. 
Each  contour  on  the  map  is  a  line  of  constant  height;  some  of  these 
are  labeled  with  their  elevations  in  units  of  feet.  Height  is  related 
to  gravitational  energy,  so  in  a  gravitational  analogy,  we  can  think 
of  height  as  representing  voltage.  Where  the  contour  lines  are  far 
apart,  as  in  the  town,  the  slope  is  gentle.  Lines  close  together  indi¬ 
cate  a  steep  slope. 
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If  we  walk  along  a  straight  line,  say  straight  east  from  the  town, 
then  height  (voltage)  is  a  function  of  the  east- west  coordinate  x. 
Using  the  usual  mathematical  definition  of  the  slope,  and  writing 
V  for  the  height  in  order  to  remind  us  of  the  electrical  analogy,  the 
slope  along  such  a  line  is  dU/dx  (the  rise  over  the  run). 

What  if  everything  isn’t  confined  to  a  straight  line?  Water  flows 
downhill.  Notice  how  the  streams  on  the  map  cut  perpendicularly 
through  the  lines  of  constant  height. 

It  is  possible  to  map  voltages  in  the  same  way,  as  shown  in 
figure  b.  The  electric  field  is  strongest  where  the  constant-voltage 
curves  are  closest  together,  and  the  electric  field  vectors  always  point 
perpendicular  to  the  constant-voltage  curves. 

The  one-dimensional  relationship  E  =  —  dU /  dx  generalizes  to 
three  dimensions  as  follows: 

dx 

f-~E 

Ay 


This  can  be  notated  as  a  gradient  (page  215), 

E  =  VU, 


and  if  we  know  the  field  and  want  to  find  the  voltage,  we  can  use  a 
line  integral, 

AV  =  I  E-dr, 

Jc 

where  the  quantity  inside  the  integral  is  a  vector  dot  product. 

self-check  C 

Imagine  that  figure  a  represents  voltage  rather  than  height,  (a)  Con¬ 
sider  the  stream  the  starts  near  the  center  of  the  map.  Determine  the 
positive  and  negative  signs  of  dV / dx  and  dV / d y,  and  relate  these  to 
the  direction  of  the  force  that  is  pushing  the  current  forward  against  the 
resistance  of  friction,  (b)  If  you  wanted  to  find  a  lot  of  electric  charge  on 
this  map,  where  would  you  look?  o  Answer,  p.  965 

Figure  c  shows  some  examples  of  ways  to  visualize  held  and 
voltage  patterns. 
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c  /  Two-dimensional  field  and  volt¬ 
age  patterns.  Top:  A  uni¬ 
formly  charged  rod.  Bottom:  A 
dipole.  In  each  case,  the  diagram 
on  the  left  shows  the  field  vec¬ 
tors  and  constant-voltage  curves, 
while  the  one  on  the  right  shows 
the  voltage  (up-down  coordinate) 
as  a  function  of  x  and  y.  Inter¬ 
preting  the  field  diagrams:  Each 
arrow  represents  the  field  at  the 
point  where  its  tail  has  been  po¬ 
sitioned.  For  clarity,  some  of  the 
arrows  in  regions  of  very  strong 
field  strength  are  not  shown  — 
they  would  be  too  long  to  show. 
Interpreting  the  constant-voltage 
curves:  In  regions  of  very  strong 
fields,  the  curves  are  not  shown 
because  they  would  merge  to¬ 
gether  to  make  solid  black  re¬ 
gions.  Interpreting  the  perspec¬ 
tive  plots:  Keep  in  mind  that  even 
though  we’re  visualizing  things  in 
three  dimensions,  these  are  really 
two-dimensional  voltage  patterns 
being  represented.  The  third  (up- 
down)  dimension  represents  volt¬ 
age,  not  position. 


L 

< - ► 


-< - ► 

d 

a  /  Example  10. 


10.3  Fields  by  Superposition 

10.3.1  Electric  field  of  a  continuous  charge  distribution 

Charge  really  comes  in  discrete  chunks,  but  often  it  is  mathe¬ 
matically  convenient  to  treat  a  set  of  charges  as  if  they  were  like  a 
continuous  fluid  spread  throughout  a  region  of  space.  For  example, 
a  charged  metal  ball  will  have  charge  spread  nearly  uniformly  all 
over  its  surface,  and  for  most  purposes  it  will  make  sense  to  ignore 
the  fact  that  this  uniformity  is  broken  at  the  atomic  level.  The 
electric  field  made  by  such  a  continuous  charge  distribution  is  the 
sum  of  the  fields  created  by  every  part  of  it.  If  we  let  the  “parts” 
become  infinitesimally  small,  we  have  a  sum  of  an  infinitely  many 
infinitesimal  numbers:  an  integral.  If  it  was  a  discrete  sum,  as  in 
example  3  on  page  570,  we  would  have  a  total  electric  field  in  the  x 
direction  that  was  the  sum  of  all  the  x  components  of  the  individual 
fields,  and  similarly  we’d  have  sums  for  the  y  and  z  components.  In 
the  continuous  case,  we  have  three  integrals.  Let’s  keep  it  simple  by 
starting  with  a  one-dimensional  example. 

Field  of  a  uniformly  charged  rod  example  1 0 

>  A  rod  of  length  L  has  charge  Q  spread  uniformly  along  it.  Find 
the  electric  field  at  a  point  a  distance  d  from  the  center  of  the  rod, 
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along  the  rod’s  axis. 


o  This  is  a  one-dimensional  situation,  so  we  really  only  need  to 
do  a  single  integral  representing  the  total  field  along  the  axis.  We 
imagine  breaking  the  rod  down  into  short  pieces  of  length  dz, 
each  with  charge  d q.  Since  charge  is  uniformly  spread  along  the 
rod,  we  have  d q  =  Adz,  where  A  =  Q/L  (Greek  lambda)  is  the 
charge  per  unit  length,  in  units  of  coulombs  per  meter.  Since 
the  pieces  are  infinitesimally  short,  we  can  treat  them  as  point 
charges  and  use  the  expression  kdq/r2  for  their  contributions  to 
the  field,  where  r  =  d  -  z  is  the  distance  from  the  charge  at  z  to 
the  point  in  which  we  are  interested. 


Ez 


1 


kdq 

~r ^ 


r+/-/2  AAdz 


J-L/2  r 

r+L/ 2 

AA 


dz 


—L/2  ( d-zy 


The  integral  can  be  looked  up  in  a  table,  or  reduced  to  an  ele¬ 
mentary  form  by  substituting  a  new  variable  for  d  -  z.  The  result 
is 


Ez  =  AA 


d-z 


+L/2 

-L/2 


kQ  {  1  1  A 

L  \d  —  L/2  d+L/2j ' 


For  large  values  of  d,  this  expression  gets  smaller  for  two  rea¬ 
sons:  (1)  the  denominators  of  the  fractions  become  large,  and 
(2)  the  two  fractions  become  nearly  the  same,  and  tend  to  can¬ 
cel  out.  This  makes  sense,  since  the  field  should  get  weaker  as 
we  get  farther  away  from  the  charge.  In  fact,  the  field  at  large 
distances  must  approach  kQ/d 2  (homework  problem  2). 


It’s  also  interesting  to  note  that  the  field  becomes  infinite  at  the 
ends  of  the  rod,  but  is  not  infinite  on  the  interior  of  the  rod.  Can 
you  explain  physically  why  this  happens? 


Example  10  was  one-dimensional.  In  the  general  three-dimensional 
case,  we  might  have  to  integrate  all  three  components  of  the  field. 
However,  there  is  a  trick  that  lets  us  avoid  this  much  complication. 
The  voltage  is  a  scalar,  so  we  can  find  the  voltage  by  doing  just  a 
single  integral,  then  use  the  voltage  to  find  the  field. 

Voltage,  then  field  example  1 1 

>  A  rod  of  length  L  is  uniformly  charged  with  charge  Q.  Find  the 
field  at  a  point  lying  in  the  midplane  of  the  rod  at  a  distance  R. 


4 


4 


R 


o  By  symmetry,  the  field  has  only  a  radial  component,  ER,  point-  b  /  Example  1 1 . 
ing  directly  away  from  the  rod  (or  toward  it  for  Q  <  0).  The 
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brute-force  approach,  then,  would  be  to  evaluate  the  integral  E  = 
f  |  dE|cos  0,  where  dE  is  the  contribution  to  the  field  from  a  charge 
d q  at  some  point  along  the  rod,  and  0  is  the  angle  dE  makes  with 
the  radial  line. 

It’s  easier,  however,  to  find  the  voltage  first,  and  then  find  the 
field  from  the  voltage.  Since  the  voltage  is  a  scalar,  we  simply 
integrate  the  contribution  6V  from  each  charge  d q,  without  even 
worrying  about  angles  and  directions.  Let  z  be  the  coordinate 
that  measures  distance  up  and  down  along  the  rod,  with  z  =  0  at 
the  center  of  the  rod.  Then  the  distance  between  a  point  z  on  the 
rod  and  the  point  of  interest  is  r  =  Vz2  +  R2,  and  we  have 


V  = 


1 


kdq 

r 


=  kk 

=  kk 


r+L/2  d Iz 

t-L/2  r 

rL/2  dz 


l-L/2  Vz2  +  R2 


The  integral  can  be  looked  up  in  a  table,  or  evaluated  using  com¬ 
puter  software: 


V  =  kk  In 


(z  +  Vz2  +  R2^j 


+L/2 

-L/2 


f  L/2  +  y/L2/4  +  R2  \ 
\-L/2  +  ^L2/4  +  R2) 


The  expression  inside  the  parentheses  can  be  simplified  a  little. 
Leaving  out  some  tedious  algebra,  the  result  is 


V-2kMn  iW1  +  4^ 


This  can  readily  be  differentiated  to  find  the  field: 
dV 

£r  = - 


d  R 
(— 2/cA) 


-L/2R2  +  (1  /2)(1  +  L2/4R2)~y2{-L2/2R3) 


L/2R  +  (1  +  L2/4R2y/2 

or,  after  some  simplification, 

_  kkL 
R  ~  R2sJ  1  +L2/4W 
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For  large  values  of  R,  the  square  root  approaches  one,  and  we 
have  simply  ER  «  kkL/R2  =  kQ/R2.  In  other  words,  the  field 
very  far  away  is  the  same  regardless  of  whether  the  charge  is  a 
point  charge  or  some  other  shape  like  a  rod.  This  is  intuitively 
appealing,  and  doing  this  kind  of  check  also  helps  to  reassure 
one  that  the  final  result  is  correct. 

The  preceding  example,  although  it  involved  some  messy  alge¬ 
bra,  required  only  straightforward  calculus,  and  no  vector  operations 
at  all,  because  we  only  had  to  integrate  a  scalar  function  to  find  the 
voltage.  The  next  example  is  one  in  which  we  can  integrate  either 
the  field  or  the  voltage  without  too  much  complication. 


On-axis  field  of  a  ring  of  charge  example  12 

>  Find  the  voltage  and  field  along  the  axis  of  a  uniformly  charged 
ring. 

>  Integrating  the  voltage  is  straightforward. 

V  =  J^ 

=  k  [  dc? 

J  Vb2  +  z2 

Vb2  +  z2  J 
kQ 

Vb2  +  z2’ 

where  Q  is  the  total  charge  of  the  ring.  This  result  could  have 
been  derived  without  calculus,  since  the  distance  r  is  the  same 
for  every  point  around  the  ring,  i.e. ,  the  integrand  is  a  constant. 
It  would  also  be  straightforward  to  find  the  field  by  differentiating 
this  expression  with  respect  to  z  (homework  problem  10). 


Instead,  let’s  see  how  to  find  the  field  by  direct  integration.  By 
symmetry,  the  field  at  the  point  of  interest  can  have  only  a  com¬ 
ponent  along  the  axis  of  symmetry,  the  z  axis: 

Ex  =  0 


Ey=  0 

To  find  the  field  in  the  z  direction,  we  integrate  the  z  components 
contributed  to  the  field  by  each  infinitesimal  part  of  the  ring. 

E7=  /  d  E7 


=  y  |dE|  cos  0, 

where  0  is  the  angle  shown  in  the  figure. 

kdq 


E7  = 


cos  0 


=  k 


d  q 


b2  +  z2 


cos  0 


« - - Z - ► 


c  /  Example  12. 
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Everything  inside  the  integral  is  a  constant,  so  we  have 


d  /  Example  13:  geometry. 


e  /  Example  13:  the  field  on 
both  sides  (for  cr  >  0). 


E2  E2 


f/A  capacitor  consisting  of 
two  disks  with  opposite  charges. 


b2  +  z2 
kQ 

b2  +  z2 
kQ 

b2  +  z2 
kQz 


cos  0  /  d q 


cos  0 

z 

r 


(. b 2  +  z2 


,3/2 


In  all  the  examples  presented  so  far,  the  charge  has  been  con¬ 
fined  to  a  one- dimensional  line  or  curve.  Although  it  is  possible,  for 
example,  to  put  charge  on  a  piece  of  wire,  it  is  more  common  to 
encounter  practical  devices  in  which  the  charge  is  distributed  over  a 
two-dimensional  surface,  as  in  the  flat  metal  plates  used  in  Thom¬ 
son’s  experiments.  Mathematically,  we  can  approach  this  type  of 
calculation  with  the  divide-and-conquer  technique:  slice  the  surface 
into  lines  or  curves  whose  fields  we  know  how  to  calculate,  and  then 
add  up  the  contributions  to  the  field  from  all  these  slices.  In  the 
limit  where  the  slices  are  imagined  to  be  infinitesimally  thin,  we 
have  an  integral. 

Field  of  a  uniformly  charged  disk  example  13 

>  A  circular  disk  is  uniformly  charged.  (The  disk  must  be  an  in¬ 
sulator;  if  it  was  a  conductor,  then  the  repulsion  of  all  the  charge 
would  cause  it  to  collect  more  densely  near  the  edge.)  Find  the 
field  at  a  point  on  the  axis,  at  a  distance  z  from  the  plane  of  the 
disk. 

>  We’re  given  that  every  part  of  the  disk  has  the  same  charge  per 
unit  area,  so  rather  than  working  with  Q,  the  total  charge,  it  will 
be  easier  to  use  the  charge  per  unit  area,  conventionally  notated 
a  (Greek  sigma),  a  =  Q/nb2. 

Since  we  already  know  the  field  due  to  a  ring  of  charge,  we  can 
solve  the  problem  by  slicing  the  disk  into  rings,  with  each  ring 
extending  from  r  to  r  +  dr.  The  area  of  such  a  ring  equals  its 
circumference  multiplied  by  its  width,  i.e. ,  2nrdr,  so  its  charge  is 
dq  =  2 7t or  dr,  and  from  the  result  of  example  12,  its  contribution 
to  the  field  is 


d  Ez  = 


kzdq 


(r2  +  z2)3/2 
2nokzrdr 
(r2  +  z2)312 
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The  total  field  is 


d  E7 


=  2nukz 

1  ~ 

rdr 

Jo  ( 

[r2  +  z2)' 

= 2nokz 

- 

1 

r= b 

+  z2 

r=0 

=  2n<jk  ( 

1  - 

z 

3/2 


\fW 


+ 


The  result  of  example  13  has  some  interesting  properties.  First, 
we  note  that  it  was  derived  on  the  unspoken  assumption  of  z  >  0. 
By  symmetry,  the  field  on  the  other  side  of  the  disk  must  be  equally 
strong,  but  in  the  opposite  direction,  as  shown  in  figures  e  and  g. 
Thus  there  is  a  discontinuity  in  the  field  at  z  =  0.  In  reality,  the 
disk  will  have  some  finite  thickness,  and  the  switching  over  of  the 
field  will  be  rapid,  but  not  discontinuous. 

At  large  values  of  z,  i.e. ,  z  ^$>  b,  the  field  rapidly  approaches  the 
1/r2  variation  that  we  expect  when  we  are  so  far  from  the  disk  that 
the  disk’s  size  and  shape  cannot  matter  (homework  problem  2). 


z/b 

g  /  Example  13:  variation  of  the  field  (a  >  0). 

A  practical  application  is  the  case  of  a  capacitor,  f,  having  two 
parallel  circular  plates  very  close  together.  In  normal  operation,  the 
charges  on  the  plates  are  opposite,  so  one  plate  has  fields  pointing 
into  it  and  the  other  one  has  fields  pointing  out.  In  a  real  capacitor, 
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h  /  Close  to  the  surface,  the 
relationship  between  E  and  a 
is  a  fixed  one,  regardless  of 
the  geometry.  The  flea  can’t 
determine  the  size  or  shape  of 
her  world  by  comparing  E  and  cr. 


the  plates  are  a  metal  conductor,  not  an  insulator,  so  the  charge 
will  tend  to  arrange  itself  more  densely  near  the  edges,  rather  than 
spreading  itself  uniformly  on  each  plate.  Furthermore,  we  have  only 
calculated  the  on-axis  field  in  example  13;  in  the  off-axis  region, 
each  disk’s  contribution  to  the  field  will  be  weaker,  and  it  will  also 
point  away  from  the  axis  a  little.  But  if  we  are  willing  to  ignore 
these  complications  for  the  sake  of  a  rough  analysis,  then  the  fields 
superimpose  as  shown  in  figure  f:  the  fields  cancel  the  outside  of 
the  capacitor,  but  between  the  plates  its  value  is  double  that  con¬ 
tributed  by  a  single  plate.  This  cancellation  on  the  outside  is  a  very 
useful  property  for  a  practical  capacitor.  For  instance,  if  you  look  at 
the  printed  circuit  board  in  a  typical  piece  of  consumer  electronics, 
there  are  many  capacitors,  often  placed  fairly  close  together.  If  their 
exterior  fields  didn’t  cancel  out  nicely,  then  each  capacitor  would  in¬ 
teract  with  its  neighbors  in  a  complicated  way,  and  the  behavior  of 
the  circuit  would  depend  on  the  exact  physical  layout,  since  the 
interaction  would  be  stronger  or  weaker  depending  on  distance.  In 
reality,  a  capacitor  does  create  weak  external  electric  fields,  but  their 
effects  are  often  negligible,  and  we  can  then  use  the  lumped- circuit 
approximation ,  which  states  that  each  component’s  behavior  de¬ 
pends  only  on  the  currents  that  flow  in  and  out  of  it,  not  on  the 
interaction  of  its  fields  with  the  other  components. 

10.3.2  The  field  near  a  charged  surface 

From  a  theoretical  point  of  view,  there  is  something  even  more 
intriguing  about  example  13:  the  magnitude  of  the  field  for  small 
values  of  z  (z  <C  b)  is  E  =  2-Kko,  which  doesn’t  depend  on  b  at 
all  for  a  fixed  value  of  a.  If  we  made  a  disk  with  twice  the  radius, 
and  covered  it  with  the  same  number  of  coulombs  per  square  meter 
(resulting  in  a  total  charge  four  times  as  great),  the  field  close  to 
the  disk  would  be  unchanged!  That  is,  a  flea  living  near  the  center 
of  the  disk,  h,  would  have  no  way  of  determining  the  size  of  her  flat 
“planet”  by  measuring  the  local  field  and  charge  density.  (Only  by 
leaping  off  the  surface  into  outer  space  would  she  be  able  to  measure 
fields  that  were  dependent  on  b.  If  she  traveled  very  far,  to  z  i$>  b, 
she  would  be  in  the  region  where  the  field  is  well  approximated  by 
| E |  ~  kQ/z2  =  kirb2a/z2,  which  she  could  solve  for  b .) 

What  is  the  reason  for  this  surprisingly  simple  behavior  of  the 
held?  Is  it  a  piece  of  mathematical  trivia,  true  only  in  this  particular 
case?  What  if  the  shape  was  a  square  rather  than  a  circle?  In  other 
words,  the  flea  gets  no  information  about  the  size  of  the  disk  from 
measuring  E,  since  E  =  2irka,  independent  of  b,  but  what  if  she 
didn’t  know  the  shape ,  either?  If  the  result  for  a  square  had  some 
other  geometrical  factor  in  front  instead  of  27t,  then  she  could  tell 
which  shape  it  was  by  measuring  E.  The  surprising  mathematical 
fact,  however,  is  that  the  result  for  a  square,  indeed  for  any  shape 
whatsoever,  is  E  =  2nak.  It  doesn’t  even  matter  whether  the  sur- 
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face  is  flat  or  warped,  or  whether  the  density  of  charge  is  different 
at  parts  of  the  surface  which  are  far  away  compared  to  the  flea’s 
distance  above  the  surface. 

This  universal  E±  =  2irkcr  held  perpendicular  to  a  charged 
surface  can  be  proved  mathematically  based  on  Gauss’s  law1  (sec¬ 
tion  10.6),  but  we  can  understand  what’s  happening  on  qualitative 
grounds.  Suppose  on  night,  while  the  flea  is  asleep,  someone  adds 
more  surface  area,  also  positively  charged,  around  the  outside  edge 
of  her  disk-shaped  world,  doubling  its  radius.  The  added  charge, 
however,  has  very  little  effect  on  the  held  in  her  environment,  as 
long  as  she  stays  at  low  altitudes  above  the  surface.  As  shown  in 
hgure  i,  the  new  charge  to  her  west  contributes  a  held,  T,  that  is 
almost  purely  “horizontal”  (i.e.,  parallel  to  the  surface)  and  to  the 
east.  It  has  a  negligible  upward  component,  since  the  angle  is  so 
shallow.  This  new  eastward  contribution  to  the  held  is  exactly  can¬ 
celed  out  by  the  westward  held,  S,  created  by  the  new  charge  to 
her  east.  There  is  likewise  almost  perfect  cancellation  between  any 
other  pair  of  opposite  compass  directions. 

A  similar  argument  can  be  made  as  to  the  shape-independence  of 
the  result,  as  long  as  the  shape  is  symmetric.  For  example,  suppose 
that  the  next  night,  the  tricky  real  estate  developers  decide  to  add 
corners  to  the  disk  and  transform  it  into  a  square.  Each  corner’s 
contribution  to  the  held  measured  at  the  center  is  canceled  by  the 
held  due  to  the  corner  diagonally  across  from  it. 

What  if  the  hea  goes  on  a  trip  away  from  the  center  of  the  disk? 
The  perfect  cancellation  of  the  “horizontal”  helds  contributed  by 
distant  charges  will  no  longer  occur,  but  the  “vertical”  held  (i.e.,  the 
held  perpendicular  to  the  surface)  will  still  be  E_ j_  =  2irka,  where  a  is 
the  local  charge  density,  since  the  distant  charges  can’t  contribute  to 
the  vertical  held.  The  same  result  applies  if  the  shape  of  the  surface 
is  asymmetric,  and  doesn’t  even  have  any  well-defined  geometric 
center:  the  component  perpendicular  to  the  surface  is  E±  =  2irka, 
but  we  may  have  E\\  ^  0.  All  of  the  above  arguments  can  be  made 
more  rigorous  by  discussing  mathematical  limits  rather  than  using 
words  like  “very  small.”  There  is  not  much  point  in  giving  a  rigorous 
proof  here,  however,  since  we  will  be  able  to  demonstrate  this  fact 
as  a  corollary  of  Gauss’  Law  in  section  10.6.  The  result  is  as  follows: 

At  a  point  lying  a  distance  z  from  a  charged  surface,  the  com¬ 
ponent  of  the  electric  held  perpendicular  to  the  surface  obeys 

lim  E  i  =  2nk(J, 

0 

where  a  is  the  charge  per  unit  area.  This  is  true  regardless  of  the 
shape  or  size  of  the  surface. 


1  rhymes  with  “mouse” 


P 


i  /  Fields  contributed  by  nearby 
parts  of  the  surface,  P,  Q,  and  R, 
contribute  to  E±.  Fields  due  to 
distant  charges,  S,  and  T,  have 
very  small  contributions  to  E± 
because  of  their  shallow  angles. 


Eocd"2 


d"1 


d0 


j  /  Example  14. 
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The  field  near  a  point,  line,  or  surface  charge  example  14 

t>  Compare  the  variation  of  the  electric  field  with  distance,  d,  for 
small  values  of  d  in  the  case  of  a  point  charge,  an  infinite  line  of 
charge,  and  an  infinite  charged  surface. 

>  For  a  point  charge,  we  have  already  found  E  oc  d  2  for  the 
magnitude  of  the  field,  where  we  are  now  using  d  for  the  quantity 
we  would  ordinarily  notate  as  r.  This  is  true  for  all  values  of  d, 
not  just  for  small  d  —  it  has  to  be  that  way,  because  the  point 
charge  has  no  size,  so  if  E  behaved  differently  for  small  and  large 
d,  there  would  be  no  way  to  decide  what  d  had  to  be  small  or 
large  relative  to. 

For  a  line  of  charge,  the  result  of  example  1 1  is 
F  -  /cAL 

~  c/V1  +  L2/4cfi 

In  the  limit  of  d  <  L,  the  quantity  inside  the  square  root  is  domi¬ 
nated  by  the  second  term,  and  we  have  E  oc  d~\ 

Finally,  in  the  case  of  a  charged  surface,  the  result  is  simply  E  = 
2nok,  or  E  oc  d°. 

Notice  the  lovely  simplicity  of  the  pattern,  as  shown  in  figure  j.  A 
point  is  zero-dimensional:  it  has  no  length,  width,  or  breadth.  A 
line  is  one-dimensional,  and  a  surface  is  two-dimensional.  As  the 
dimensionality  of  the  charged  object  changes  from  0  to  1,  and 
then  to  2,  the  exponent  in  the  near-field  expression  goes  from  2 
to  1  to  0. 
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10.4  Energy  In  Fields 

10.4.1  Electric  field  energy 

Fields  possess  energy,  as  argued  on  page  565,  but  how  much 
energy?  The  answer  can  be  found  using  the  following  elegant  ap¬ 
proach.  We  assume  that  the  electric  energy  contained  in  an  infinites¬ 
imal  volume  of  space  dv  is  given  by  d Ue  =  /(E)  dv,  where  /  is  some 
function,  which  we  wish  to  determine,  of  the  Held  E.  It  might  seem 
that  we  would  have  no  easy  way  to  determine  the  function  /,  but 
many  of  the  functions  we  could  cook  up  would  violate  the  symme¬ 
try  of  space.  For  instance,  we  could  imagine  /(E)  =  aEy ,  where  a 
is  some  constant  with  the  appropriate  units.  However,  this  would 
violate  the  symmetry  of  space,  because  it  would  give  the  y  axis  a 
different  status  from  x  and  z.  As  discussed  on  page  212,  if  we  wish 
to  calculate  a  scalar  based  on  some  vectors,  the  dot  product  is  the 
only  way  to  do  it  that  has  the  correct  symmetry  properties.  If  all 
we  have  is  one  vector,  E,  then  the  only  scalar  we  can  form  is  E  •  E, 
which  is  the  square  of  the  magnitude  of  the  electric  field  vector. 

In  principle,  the  energy  function  we  are  seeking  could  be  propor¬ 
tional  to  E  •  E,  or  to  any  function  computed  from  it,  such  as  VE  •  E 
or  (E  •  E)' .  On  physical  grounds,  however,  the  only  possibility  that 
works  is  E  •  E.  Suppose,  for  instance,  that  we  pull  apart  two  oppo¬ 
sitely  charged  capacitor  plates,  as  shown  in  figure  a.  We  are  doing 
work  by  pulling  them  apart  against  the  force  of  their  electrical  at¬ 
traction,  and  this  quantity  of  mechanical  work  equals  the  increase 
in  electrical  energy,  XJe.  Using  our  previous  approach  to  energy,  we 
would  have  thought  of  Ue  as  a  quantity  which  depended  on  the  dis¬ 
tance  of  the  positive  and  negative  charges  from  each  other,  but  now 
we’re  going  to  imagine  Ue  as  being  stored  within  the  electric  field 
that  exists  in  the  space  between  and  around  the  charges.  When  the 
plates  are  touching,  their  fields  cancel  everywhere,  and  there  is  zero 
electrical  energy.  When  they  are  separated,  there  is  still  approxi¬ 
mately  zero  field  on  the  outside,  but  the  field  between  the  plates  is 
nonzero,  and  holds  some  energy. 

Now  suppose  we  carry  out  the  whole  process,  but  with  the  plates 
carrying  double  their  previous  charges.  Since  Coulomb’s  law  in¬ 
volves  the  product  q\q2  of  two  charges,  we  have  quadrupled  the  force 
between  any  given  pair  of  charged  particles,  and  the  total  attractive 
force  is  therefore  also  four  times  greater  than  before.  This  means 
that  the  work  done  in  separating  the  plates  is  four  times  greater,  and 
so  is  the  energy  Ue  stored  in  the  field.  The  field,  however,  has  merely 
been  doubled  at  any  given  location:  the  electric  field  E+  due  to  the 
positively  charged  plate  is  doubled,  and  similarly  for  the  contribu¬ 
tion  E_  from  the  negative  one,  so  the  total  electric  field  E+  +  E 
is  also  doubled.  Thus  doubling  the  field  results  in  an  electrical  en¬ 
ergy  which  is  four  times  greater,  i.e.,  the  energy  density  must  be 
proportional  to  the  square  of  the  field,  dUe  oc  (E  •  E)  du.  For  ease 
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a  /  Two  oppositely  charged 
capacitor  plates  are  pulled  apart. 
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of  notation,  we  write  this  as  d Ue  oc  E2  dv,  or  d Ue  =  aE 2  du,  where 
a  is  a  constant  of  proportionality.  Note  that  we  never  really  made 
use  of  any  of  the  details  of  the  geometry  of  figure  a,  so  the  reasoning 
is  of  general  validity.  In  other  words,  not  only  is  d Ue  =  aE 2  du  the 
function  that  works  in  this  particular  case,  but  there  is  every  reason 
to  believe  that  it  would  work  in  other  cases  as  well. 

It  now  remains  only  to  find  a.  Since  the  constant  must  be  the 
same  in  all  situations,  we  only  need  to  find  one  example  in  which 
we  can  compute  the  field  and  the  energy,  and  then  we  can  deter¬ 
mine  a.  The  situation  shown  in  figure  a  is  just  about  the  easiest 
example  to  analyze.  We  let  the  square  capacitor  plates  be  uni¬ 
formly  covered  with  charge  densities  +cr  and  — a ,  and  we  write  b 
for  the  lengths  of  their  sides.  Let  h  be  the  gap  between  the  plates 
after  they  have  been  separated.  We  choose  h  <C  6,  so  that  the 
held  experienced  by  the  negative  plate  due  to  the  positive  plate 
is  E+  =  2nka.  The  charge  of  the  negative  plate  is  —o b2,  so  the 
magnitude  of  the  force  attracting  it  back  toward  the  positive  plate 
is  (force)  =  (charge)  (held)  =  2nka2b2.  The  amount  of  work  done 
in  separating  the  plates  is  (work)  =  (force) (distance)  =  2nkcr2b2h. 
This  is  the  amount  of  energy  that  has  been  stored  in  the  held  be¬ 
tween  the  two  plates,  Ue  =  2n ka2b2h  =  2nka2v ,  where  v  is  the 
volume  of  the  region  between  the  plates. 

We  want  to  equate  this  to  XJe  =  aE2v.  (We  can  write  Ue  and 
v  rather  than  d Ue  and  du,  since  the  held  is  constant  in  the  region 
between  the  plates.)  The  held  between  the  plates  has  contributions 
from  both  plates,  E  =  E+  +  E-  =  Anker.  (We  only  used  half  this 
value  in  the  computation  of  the  work  done  on  the  moving  plate, 
since  the  moving  plate  can’t  make  a  force  on  itself.  Mathematically, 
each  plate  is  in  a  region  where  its  own  held  is  reversing  directions, 
so  we  can  think  of  its  own  contribution  to  the  held  as  being  zero 
within  itself.)  We  then  have  aE2v  =  a  ■  I6n2k2a2  ■  v ,  and  setting 
this  equal  to  Ue  =  2nka2v  from  the  result  of  the  work  computation, 
we  find  a  =  l/8nk.  Our  hnal  result  is  as  follows: 

The  electric  energy  possessed  by  an  electric  held  E  occupying  an 
inhnitesimal  volume  of  space  du  is  given  by 


d  Ue 


1 

8nk 


E2  dv, 


where  E2  =  E  •  E  is  the  square  of  the  magnitude  of  the  electric  held. 


This  is  reminiscent  of  how  waves  behave:  the  energy  content  of 
a  wave  is  typically  proportional  to  the  square  of  its  amplitude. 
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self-check  D 

We  can  think  of  the  quantity  6Ue/  tiv  as  the  energy  density  due  to  the 
electric  field,  i.e.,  the  number  of  joules  per  cubic  meter  needed  in  order 
to  create  that  field,  (a)  How  does  this  quantity  depend  on  the  compo¬ 
nents  of  the  field  vector,  Ex,  Ey,  and  Ez?  (b)  Suppose  we  have  a  field 
with  Ex  ^  0,  Ey= 0,  and  Ez= 0.  What  would  happen  to  the  energy  density 
if  we  reversed  the  sign  of  Ex?  >  Answer,  p.  965 

A  numerical  example  example  15 

>  A  capacitor  has  plates  whose  areas  are  10~4  m2,  separated  by 
a  gap  of  10“5  m.  A  1.5-volt  battery  is  connected  across  it.  How 
much  energy  is  sucked  out  of  the  battery  and  stored  in  the  elec¬ 
tric  field  between  the  plates?  (A  real  capacitor  typically  has  an 
insulating  material  between  the  plates  whose  molecules  interact 
electrically  with  the  charge  in  the  plates.  For  this  example,  we’ll 
assume  that  there  is  just  a  vacuum  in  between  the  plates.  The 
plates  are  also  typically  rolled  up  rather  than  flat.) 

>  To  connect  this  with  our  previous  calculations,  we  need  to  find 
the  charge  density  on  the  plates  in  terms  of  the  voltage  we  were 
given.  Our  previous  examples  were  based  on  the  assumption  that 
the  gap  between  the  plates  was  small  compared  to  the  size  of  the 
plates.  Is  this  valid  here?  Well,  if  the  plates  were  square,  then 
the  area  of  10~4  m2  would  imply  that  their  sides  were  10~2  m  in 
length.  This  is  indeed  very  large  compared  to  the  gap  of  10~5 
m,  so  this  assumption  appears  to  be  valid  (unless,  perhaps,  the 
plates  have  some  very  strange,  long  and  skinny  shape). 

Based  on  this  assumption,  the  field  is  relatively  uniform  in  the 
whole  volume  between  the  plates,  so  we  can  use  a  single  symbol, 
E,  to  represent  its  magnitude,  and  the  relation  E  =  dl//dx  is 
equivalent  to  E  =  AV /Ax  =  (1 .5  V)/(gap)  =  1 .5  x  105  V/m. 

Since  the  field  is  uniform,  we  can  dispense  with  the  calculus,  and 
replace  d Ue  =  (1  /8nk)E2bv  with  Ue  =  (1  /8nk)E2v.  The  volume 
equals  the  area  multiplied  by  the  gap,  so  we  have 

Ue  =  (1/87t/c)E2(area)(gap) 

= - - - *(1.5  x  105  V/m)2(1CT4  m2)(1CT5  m) 

87t  x  9  x  109  N-m2/C2 

=  1  x  1CT10  J 


self-check  E 

Show  that  the  units  in  the  preceding  example  really  do  work  out  to  be 
joules.  >  Answer,  p.  966 

Why  k  is  on  the  bottom  example  16 

It  may  also  seem  strange  that  the  constant  k  is  in  the  denomina¬ 
tor  of  the  equation  d Ue  =  (1  /8nk)E2dv.  The  Coulomb  constant  k 
tells  us  how  strong  electric  forces  are,  so  shouldn’t  it  be  on  top? 
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b  /  Example  17. 


c  /  Example  B.  Part  of  the 
outside  sphere  has  been  drawn 
as  if  it  is  transparent,  in  order  to 
show  the  inside  sphere. 


No.  Consider,  for  instance,  an  alternative  universe  in  which  elec¬ 
tric  forces  are  twice  as  strong  as  in  ours.  The  numerical  value  of 
k  is  doubled.  Because  k  is  doubled,  all  the  electric  field  strengths 
are  doubled  as  well,  which  quadruples  the  quantity  E2.  In  the  ex¬ 
pression  Ez/8nk,  we’ve  quadrupled  something  on  top  and  dou¬ 
bled  something  on  the  bottom,  which  makes  the  energy  twice  as 
big.  That  makes  perfect  sense. 

Potential  energy  of  a  pair  of  opposite  charges  example  1 7 
Imagine  taking  two  opposite  charges,  b,  that  were  initially  far 
apart  and  allowing  them  to  come  together  under  the  influence 
of  their  electrical  attraction. 

According  to  our  old  approach,  electrical  energy  is  lost  because 
the  electric  force  did  positive  work  as  it  brought  the  charges  to¬ 
gether.  (This  makes  sense  because  as  they  come  together  and 
accelerate  it  is  their  electrical  energy  that  is  being  lost  and  con¬ 
verted  to  kinetic  energy.) 

By  the  new  method,  we  must  ask  how  the  energy  stored  in  the 
electric  field  has  changed.  In  the  region  indicated  approximately 
by  the  shading  in  the  figure,  the  superposing  fields  of  the  two 
charges  undergo  partial  cancellation  because  they  are  in  oppos¬ 
ing  directions.  The  energy  in  the  shaded  region  is  reduced  by 
this  effect.  In  the  unshaded  region,  the  fields  reinforce,  and  the 
energy  is  increased. 

It  would  be  quite  a  project  to  do  an  actual  numerical  calculation  of 
the  energy  gained  and  lost  in  the  two  regions  (this  is  a  case  where 
the  old  method  of  finding  energy  gives  greater  ease  of  computa¬ 
tion),  but  it  is  fairly  easy  to  convince  oneself  that  the  energy  is 
less  when  the  charges  are  closer.  This  is  because  bringing  the 
charges  together  shrinks  the  high-energy  unshaded  region  and 
enlarges  the  low-energy  shaded  region. 


A  spherical  capacitor  example  1 8 

>  A  spherical  capacitor,  c,  consists  of  two  concentric  spheres  of 
radii  a  and  b.  Find  the  energy  required  to  charge  up  the  capacitor 
so  that  the  plates  hold  charges  +q  and  -q. 


>  On  page  102,  I  proved  that  for  gravitational  forces,  the  inter¬ 
action  of  a  spherical  shell  of  mass  with  other  masses  outside  it 
is  the  same  as  if  the  shell’s  mass  was  concentrated  at  its  cen¬ 
ter.  On  the  interior  of  such  a  shell,  the  forces  cancel  out  exactly. 
Since  gravity  and  the  electric  force  both  vary  as  1  /r2,  the  same 
proof  carries  over  immediately  to  electrical  forces.  The  magnitude 
of  the  outward  electric  field  contributed  by  the  charge  +q  of  the 
central  sphere  is  therefore 


(  0,  r  <  a 
\  kq/r2,  r  >  a 


where  r  is  the  distance  from  the  center.  Similarly,  the  magnitude 
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of  the  inward  field  contributed  by  the  outside  sphere  is 

ip  ,  _  /  °>  r  <  b 

11  \  kq/r2,  r  >  b 

In  the  region  outside  the  whole  capacitor,  the  two  fields  are  equal 
in  magnitude,  but  opposite  in  direction,  so  they  cancel.  We  then 
have  for  the  total  field 

(  0,  r  <  a 

|E|  =  <  kq/r2,  a  <  r  <  b  , 

I  0,  r  >  b 


so  to  calculate  the  energy,  we  only  need  to  worry  about  the  region 
a<  r  <  b.  The  energy  density  in  this  region  is 

dUe  ^  2 


=  kq2r~\ 

87T 

This  expression  only  depends  on  r,  so  the  energy  density  is  con¬ 
stant  across  any  sphere  of  radius  r.  We  can  slice  the  region 
a  <  r  <  b  into  concentric  spherical  layers,  like  an  onion,  and 
the  energy  within  one  such  layer,  extending  from  r  to  r  +  dr  is 


dUe 


dUe 

dv 

dUe 

dv 


dv 


(area  of  shell)(thickness  of  shell) 


=  (^r-4)(47tr2)(dr) 


8n 


dr. 


Integrating  over  all  the  layers  to  find  the  total  energy,  we  have 


Ue  =  J  dUe 

dr 


2  U  b) 
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(1)  (2) 

d  /  Discussion  question  A. 


Discussion  Questions 

A  The  figure  shows  a  positive  charge  in  the  gap  between  two  capacitor 
plates.  Compare  the  energy  of  the  electric  fields  in  the  two  cases.  Does 
this  agree  with  what  you  would  have  expected  based  on  your  knowledge 
of  electrical  forces? 


B  The  figure  shows  a  spherical  capacitor.  In  the  text,  the  energy  stored 
in  its  electric  field  is  shown  to  be 


What  happens  if  the  difference  between  b  and  a  is  very  small?  Does  this 
make  sense  in  terms  of  the  mechanical  work  needed  in  order  to  separate 
the  charges?  Does  it  make  sense  in  terms  of  the  energy  stored  in  the 
electric  field?  Should  these  two  energies  be  added  together? 

Similarly,  discuss  the  cases  of  b  ->•  oo  and  a  ->•  0. 

C  Criticize  the  following  statement:  “A  solenoid  makes  a  charge  in  the 
space  surrounding  it,  which  dissipates  when  you  release  the  energy.” 

D  In  example  17  on  page  590,  I  argued  that  for  the  charges  shown 
in  the  figure,  the  fields  contain  less  energy  when  the  charges  are  closer 
together,  because  the  region  of  cancellation  expanded,  while  the  region 
of  reinforcing  fields  shrank.  Perhaps  a  simpler  approach  is  to  consider 
the  two  extreme  possibilities:  the  case  where  the  charges  are  infinitely 
far  apart,  and  the  one  in  which  they  are  at  zero  distance  from  each  other, 
i.e.,  right  on  top  of  each  other.  Carry  out  this  reasoning  for  the  case  of 
(1)  a  positive  charge  and  a  negative  charge  of  equal  magnitude,  (2)  two 
positive  charges  of  equal  magnitude,  (3)  the  gravitational  energy  of  two 
equal  masses. 


10.4.2  Gravitational  field  energy 

Example  B  depended  on  the  close  analogy  between  electric  and 
gravitational  forces.  In  fact,  every  argument,  proof,  and  example 
discussed  so  far  in  this  section  is  equally  valid  as  a  gravitational 
example,  provided  we  take  into  account  one  fact:  only  positive  mass 
exists,  and  the  gravitational  force  between  two  masses  is  attractive. 
This  is  the  opposite  of  what  happens  with  electrical  forces,  which 
are  repulsive  in  the  case  of  two  positive  charges.  As  a  consequence  of 
this,  we  need  to  assign  a  negative  energy  density  to  the  gravitational 
field!  For  a  gravitational  field,  we  have 


where  g2  =  g  •  g  is  the  square  of  the  magnitude  of  the  gravitational 
field. 


10.4.3  Magnetic  field  energy 

So  far  we’ve  only  touched  in  passing  on  the  topic  of  magnetic 
fields,  which  will  deal  with  in  detail  in  chapter  11.  Magnetism  is 
an  interaction  between  moving  charge  and  moving  charge,  i.e.,  be¬ 
tween  currents  and  currents.  Since  a  current  has  a  direction  in 
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space,2  while  charge  doesn’t,  we  can  anticipate  that  the  mathemat¬ 
ical  rule  connecting  a  magnetic  field  to  its  source-currents  will  have 
to  be  completely  different  from  the  one  relating  the  electric  field  to 
its  source-charges.  However,  if  you  look  carefully  at  the  argument 
leading  to  the  relation  d Ue/ dv  =  E2/8irk,  you’ll  see  that  these 
mathematical  details  were  only  necessary  to  the  part  of  the  argu¬ 
ment  in  which  we  fixed  the  constant  of  proportionality.  To  establish 
dUe/  du  oc  E2,  we  only  had  to  use  three  simple  facts: 

•  The  field  is  proportional  to  the  source. 

•  Forces  are  proportional  to  fields. 

•  Field  contributed  by  multiple  sources  add  like  vectors. 

All  three  of  these  statements  are  true  for  the  magnetic  field  as  well, 
so  without  knowing  anything  more  specific  about  magnetic  fields  — 
not  even  what  units  are  used  to  measure  them!  —  we  can  state 
with  certainty  that  the  energy  density  in  the  magnetic  field  is  pro¬ 
portional  to  the  square  of  the  magnitude  of  the  magnetic  field.  The 
constant  of  proportionality  is  given  on  p.  671. 


2Current  is  a  scalar,  since  the  definition  /  =  d q/  df  is  the  derivative  of  a 
scalar.  However,  there  is  a  closely  related  quantity  called  the  current  density,  J, 
which  is  a  vector,  and  J  is  in  fact  the  more  fundamentally  important  quantity. 
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The  long  road  leading  from  the  light  bulb  to  the  computer  started 
with  one  very  important  step:  the  introduction  of  feedback  into 
electronic  circuits.  Although  the  principle  of  feedback  has  been  un¬ 
derstood  and  and  applied  to  mechanical  systems  for  centuries,  and 
to  electrical  ones  since  the  early  twentieth  century,  for  most  of  us 
the  word  evokes  an  image  of  Jirni  Hendrix  (or  some  more  recent 
guitar  hero)  intentionally  creating  earsplitting  screeches,  or  of  the 
school  principal  doing  the  same  inadvertently  in  the  auditorium.  In 
the  guitar  example,  the  musician  stands  in  front  of  the  amp  and 
turns  it  up  so  high  that  the  sound  waves  coming  from  the  speaker 
come  back  to  the  guitar  string  and  make  it  shake  harder.  This  is 
an  example  of  positive  feedback:  the  harder  the  string  vibrates,  the 
stronger  the  sound  waves,  and  the  stronger  the  sound  waves,  the 
harder  the  string  vibrates.  The  only  limit  is  the  power-handling 
ability  of  the  amplifier. 

Negative  feedback  is  equally  important.  Your  thermostat,  for 
example,  provides  negative  feedback  by  kicking  the  heater  off  when 
the  house  gets  warm  enough,  and  by  firing  it  up  again  when  it 
gets  too  cold.  This  causes  the  house’s  temperature  to  oscillate  back 
and  forth  within  a  certain  range.  Just  as  out-of-control  exponential 
freak-outs  are  a  characteristic  behavior  of  positive-feedback  systems, 
oscillation  is  typical  in  cases  of  negative  feedback.  You  have  already 
studied  negative  feedback  extensively  in  section  3.3  in  the  case  of  a 
mechanical  system,  although  we  didn’t  call  it  that. 

10.5.1  Capacitance  and  inductance 

In  a  mechanical  oscillation,  energy  is  exchanged  repetitively  be¬ 
tween  potential  and  kinetic  forms,  and  may  also  be  siphoned  off  in 
the  form  of  heat  dissipated  by  friction.  In  an  electrical  circuit,  re¬ 
sistors  are  the  circuit  elements  that  dissipate  heat.  What  are  the 
electrical  analogs  of  storing  and  releasing  the  potential  and  kinetic 
energy  of  a  vibrating  object?  When  you  think  of  energy  storage  in 
an  electrical  circuit,  you  are  likely  to  imagine  a  battery,  but  even 
rechargeable  batteries  can  only  go  through  10  or  100  cycles  before 
they  wear  out.  In  addition,  batteries  are  not  able  to  exchange  en¬ 
ergy  on  a  short  enough  time  scale  for  most  applications.  The  circuit 
in  a  musical  synthesizer  may  be  called  upon  to  oscillate  thousands 
of  times  a  second,  and  your  microwave  oven  operates  at  gigahertz 
frequencies.  Instead  of  batteries,  we  generally  use  capacitors  and 
inductors  to  store  energy  in  oscillating  circuits.  Capacitors,  which 
you’ve  already  encountered,  store  energy  in  electric  fields.  An  in¬ 
ductor  does  the  same  with  magnetic  fields. 

Capacitors 

A  capacitor’s  energy  exists  in  its  surrounding  electric  fields.  It  is 
proportional  to  the  square  of  the  field  strength,  which  is  proportional 
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to  the  charges  on  the  plates.  If  we  assume  the  plates  carry  charges 
that  are  the  same  in  magnitude,  +q  and  —q,  then  the  energy  stored 
in  the  capacitor  must  be  proportional  to  q2.  For  historical  reasons, 
we  write  the  constant  of  proportionality  as  1/2 C, 


U°  -  2 C*  ' 


The  constant  C  is  a  geometrical  property  of  the  capacitor,  called  its 
capacitance. 


Based  on  this  definition,  the  units  of  capacitance  must  be  coulombs 
squared  per  joule,  and  this  combination  is  more  conveniently  abbre¬ 
viated  as  the  farad,  1  F  =  1  C2/J.  “Condenser”  is  a  less  formal 
term  for  a  capacitor.  Note  that  the  labels  printed  on  capacitors 
often  use  MF  to  mean  pF,  even  though  MF  should  really  be  the 
symbol  for  megafarads,  not  microfarads.  Confusion  doesn’t  result 
from  this  nonstandard  notation,  since  picofarad  and  microfarad  val¬ 
ues  are  the  most  common,  and  it  wasn’t  until  the  1990’s  that  even 
millifarad  and  farad  values  became  available  in  practical  physical 
sizes.  Figure  a  shows  the  symbol  used  in  schematics  to  represent  a 
capacitor. 

A  parallel-plate  capacitor  example  19 

>  Suppose  a  capacitor  consists  of  two  parallel  metal  plates  with 
area  A,  and  the  gap  between  them  is  h.  The  gap  is  small  com¬ 
pared  to  the  dimensions  of  the  plates.  What  is  the  capacitance? 

o  Since  the  plates  are  metal,  the  charges  on  each  plate  are  free 
to  move,  and  will  tend  to  cluster  themselves  more  densely  near 
the  edges  due  to  the  mutual  repulsion  of  the  other  charges  in  the 
same  plate.  However,  it  turns  out  that  if  the  gap  is  small,  this  is 
a  small  effect,  so  we  can  get  away  with  assuming  uniform  charge 
density  on  each  plate.  The  result  of  example  1 5  then  applies,  and 
for  the  region  between  the  plates,  we  have  E  =  4nko  =  4nkq/A 
and  Ue  =  (1  /8nk)E2Ah.  Substituting  the  first  expression  into  the 
second,  we  find  Ue  =  2nkq2h/A.  Comparing  this  to  the  definition 
of  capacitance,  we  end  up  with  C  =  A/4nkh. 


Inductors 

Any  current  will  create  a  magnetic  field,  so  in  fact  every  current- 
carrying  wire  in  a  circuit  acts  as  an  inductor!  However,  this  type 
of  “stray”  inductance  is  typically  negligible,  just  as  we  can  usually 
ignore  the  stray  resistance  of  our  wires  and  only  take  into  account 
the  actual  resistors.  To  store  any  appreciable  amount  of  magnetic 
energy,  one  usually  uses  a  coil  of  wire  designed  specifically  to  be 
an  inductor.  All  the  loops’  contribution  to  the  magnetic  field  add 
together  to  make  a  stronger  held.  Unlike  capacitors  and  resistors, 
practical  inductors  are  easy  to  make  by  hand.  One  can  for  instance 
spool  some  wire  around  a  short  wooden  dowel.  An  inductor  like 
this,  in  the  form  cylindrical  coil  of  wire,  is  called  a  solenoid,  c,  and 
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a  /  The  symbol  for  a  capaci¬ 
tor. 


b  /  Some  capacitors. 


c  /  Two  common  geometries 
for  inductors.  The  cylindrical 
shape  on  the  left  is  called  a 
solenoid. 


d/The  symbol  for  an  induc¬ 
tor. 


e  /  Some  inductors. 


Section  10.5  LRC  Circuits 


595 


a  stylized  solenoid,  d,  is  the  symbol  used  to  represent  an  inductor 
in  a  circuit  regardless  of  its  actual  geometry. 


How  much  energy  does  an  inductor  store?  The  energy  density  is 
proportional  to  the  square  of  the  magnetic  field  strength,  which  is 
in  turn  proportional  to  the  current  flowing  through  the  coiled  wire, 
so  the  energy  stored  in  the  inductor  must  be  proportional  to  1 2 .  We 
write  L/ 2  for  the  constant  of  proportionality,  giving 


UL  = 


l=l1+l2 


f  /  Inductances  in  series  add. 


C— C  ji-C 


2 


g  /  Capacitances  in  parallel 
add. 


As  in  the  definition  of  capacitance,  we  have  a  factor  of  1/2, 
which  is  purely  a  matter  of  definition.  The  quantity  L  is  called  the 
inductance  of  the  inductor,  and  we  see  that  its  units  must  be  joules 
per  ampere  squared.  This  clumsy  combination  of  units  is  more 
commonly  abbreviated  as  the  henry,  1  henry  =  1  J/A2.  Rather 
than  memorizing  this  definition,  it  makes  more  sense  to  derive  it 
when  needed  from  the  definition  of  inductance.  Many  people  know 
inductors  simply  as  “coils,”  or  “chokes,”  and  will  not  understand 
you  if  you  refer  to  an  “inductor,”  but  they  will  still  refer  to  L  as  the 
“inductance,”  not  the  “coilance”  or  “chokeance!” 

There  is  a  lumped  circuit  approximation  for  inductors,  just  like 
the  one  for  capacitors  (p.  584).  For  a  capacitor,  this  means  assuming 
that  the  electric  fields  are  completely  internal,  so  that  components 
only  interact  via  currents  that  flow  through  wires,  not  due  to  the 
physical  overlapping  of  their  fields  in  space.  Similarly  for  an  induc¬ 
tor,  the  lumped  circuit  approximation  is  the  assumption  that  the 
magnetic  fields  are  completely  internal. 

Identical  inductances  in  series  example  20 

If  two  inductors  are  placed  in  series,  any  current  that  passes 
through  the  combined  double  inductor  must  pass  through  both 
its  parts.  If  we  assume  the  lumped  circuit  approximation,  the 
two  inductors’  fields  don’t  interfere  with  each  other,  so  the  energy 
is  doubled  for  a  given  current.  Thus  by  the  definition  of  induc¬ 
tance,  the  inductance  is  doubled  as  well.  In  general,  inductances 
in  series  add,  just  like  resistances.  The  same  kind  of  reason¬ 
ing  also  shows  that  the  inductance  of  a  solenoid  is  approximately 
proportional  to  its  length,  assuming  the  number  of  turns  per  unit 
length  is  kept  constant.  (This  is  only  approximately  true,  because 
putting  two  solenoids  end-to-end  causes  the  fields  just  outside 
their  mouths  to  overlap  and  add  together  in  a  complicated  man¬ 
ner.  In  other  words,  the  lumped-circuit  approximation  may  not  be 
very  good.) 

Identical  capacitances  in  parallel  example  21 

When  two  identical  capacitances  are  placed  in  parallel,  any  charge 
deposited  at  the  terminals  of  the  combined  double  capacitor  will 
divide  itself  evenly  between  the  two  parts.  The  electric  fields  sur- 
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rounding  each  capacitor  will  be  half  the  intensity,  and  therefore 
store  one  quarter  the  energy.  Two  capacitors,  each  storing  one 
quarter  the  energy,  give  half  the  total  energy  storage.  Since  ca¬ 
pacitance  is  inversely  related  to  energy  storage,  this  implies  that 
identical  capacitances  in  parallel  give  double  the  capacitance.  In 
general,  capacitances  in  parallel  add.  This  is  unlike  the  behav¬ 
ior  of  inductors  and  resistors,  for  which  series  configurations  give 
addition. 

This  is  consistent  with  the  result  of  example  19,  which  had  the 
capacitance  of  a  single  parallel-plate  capacitor  proportional  to  the 
area  of  the  plates.  If  we  have  two  parallel-plate  capacitors,  and 
we  combine  them  in  parallel  and  bring  them  very  close  together 
side  by  side,  we  have  produced  a  single  capacitor  with  plates  of 
double  the  area,  and  it  has  approximately  double  the  capacitance, 
subject  to  any  violation  of  the  lumped-circuit  approximation  due  to 
the  interaction  of  the  fields  where  the  edges  of  the  capacitors  are 
joined  together. 

Inductances  in  parallel  and  capacitances  in  series  are  explored 
in  homework  problems  36  and  33. 

A  variable  capacitor  example  22 

Figure  h/1  shows  the  construction  of  a  variable  capacitor  out  of 
two  parallel  semicircles  of  metal.  One  plate  is  fixed,  while  the 
other  can  be  rotated  about  their  common  axis  with  a  knob.  The 
opposite  charges  on  the  two  plates  are  attracted  to  one  another, 
and  therefore  tend  to  gather  in  the  overlapping  area.  This  over¬ 
lapping  area,  then,  is  the  only  area  that  effectively  contributes  to 
the  capacitance,  and  turning  the  knob  changes  the  capacitance. 
The  simple  design  can  only  provide  very  small  capacitance  val¬ 
ues,  so  in  practice  one  usually  uses  a  bank  of  capacitors,  wired 
in  parallel,  with  all  the  moving  parts  on  the  same  shaft. 

Discussion  Questions 

A  Suppose  that  two  parallel-plate  capacitors  are  wired  in  parallel,  and 
are  placed  very  close  together,  side  by  side,  so  that  the  lumped  circuit 
approximation  is  not  very  accurate.  Will  the  resulting  capacitance  be  too 
small,  or  too  big?  Could  you  twist  the  circuit  into  a  different  shape  and 
make  the  effect  be  the  other  way  around,  or  make  the  effect  vanish?  How 
about  the  case  of  two  inductors  in  series? 

B  Most  practical  capacitors  do  not  have  an  air  gap  or  vacuum  gap 
between  the  plates;  instead,  they  have  an  insulating  substance  called  a 
dielectric.  We  can  think  of  the  molecules  in  this  substance  as  dipoles  that 
are  free  to  rotate  (at  least  a  little),  but  that  are  not  free  to  move  around, 
since  it  is  a  solid.  The  figure  shows  a  highly  stylized  and  unrealistic  way 
of  visualizing  this.  We  imagine  that  all  the  dipoles  are  intially  turned  side¬ 
ways,  (1 ),  and  that  as  the  capacitor  is  charged,  they  all  respond  by  turning 
through  a  certain  angle,  (2).  (In  reality,  the  scene  might  be  much  more 
random,  and  the  alignment  effect  much  weaker.) 

For  simplicity,  imagine  inserting  just  one  electric  dipole  into  the  vacuum 


h  /  A  variable  capacitor. 
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i  /  Discussion  question  B. 
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L  R  C 


j  /  A  series  LRC  circuit. 


m 


P— M 


friction  =  -bv 


k/A  mechanical  analogy  for 
the  LRC  circuit. 


gap.  For  a  given  amount  of  charge  on  the  plates,  how  does  this  affect 
the  amount  of  energy  stored  in  the  electric  field?  How  does  this  affect  the 
capacitance? 

Now  redo  the  analysis  in  terms  of  the  mechanical  work  needed  in  order 
to  charge  up  the  plates. 


10.5.2  Oscillations 

Figure  j  shows  the  simplest  possible  oscillating  circuit.  For  any 
useful  application  it  would  actually  need  to  include  more  compo¬ 
nents.  For  example,  if  it  was  a  radio  tuner,  it  would  need  to  be 
connected  to  an  antenna  and  an  amplifier.  Nevertheless,  all  the 
essential  physics  is  there. 

We  can  analyze  it  without  any  sweat  or  tears  whatsoever,  sim¬ 
ply  by  constructing  an  analogy  with  a  mechanical  system.  In  a 
mechanical  oscillator,  k,  we  have  two  forms  of  stored  energy, 


U, 


spring 


=  -kx2 


K  =  —mvl 
2 


a) 

(2) 


In  the  case  of  a  mechanical  oscillator,  we  have  usually  assumed 
a  friction  force  of  the  form  that  turns  out  to  give  the  nicest  math¬ 
ematical  results,  F  =  —bv.  In  the  circuit,  the  dissipation  of  energy 
into  heat  occurs  via  the  resistor,  with  no  mechanical  force  involved, 
so  in  order  to  make  the  analogy,  we  need  to  restate  the  role  of  the 
friction  force  in  terms  of  energy.  The  power  dissipated  by  friction 
equals  the  mechanical  work  it  does  in  a  time  interval  d t,  divided  by 
dt,  P  =  IF/  dt  =  F  dx/  d t  =  Fv  =  —bv2,  so 

rate  of  heat  dissipation  =  —  bv2.  (3) 


self-check  F 

Equation  (1)  has  x  squared,  and  equations  (2)  and  (3)  have  v  squared. 
Because  they’re  squared,  the  results  don’t  depend  on  whether  these 
variables  are  positive  or  negative.  Does  this  make  physical  sense?  > 
Answer,  p.  966 

In  the  circuit,  the  stored  forms  of  energy  are 

Uc  =  Fe  U') 

Ul  =  \LI2,  (2') 

and  the  rate  of  heat  dissipation  in  the  resistor  is 

rate  of  heat  dissipation  =  —RI2.  {%!) 

Comparing  the  two  sets  of  equations,  we  first  form  analogies  between 
quantities  that  represent  the  state  of  the  system  at  some  moment 
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in  time: 


X  -H-  q 


self-check  G 

How  is  v  related  mathematically  to  x?  How  is  /  connected  to  q ?  Are  the 
two  relationships  analogous?  >  Answer,  p.  966 

Next  we  relate  the  ones  that  describe  the  system’s  permanent 
characteristics: 


k  l/C 
m  o  L 
b  ^  R 


Since  the  mechanical  system  naturally  oscillates  with  a  frequency3 
uj  ~  yjk/m  ,  we  can  immediately  solve  the  electrical  version  by  anal¬ 
ogy,  giving 

Since  the  resistance  R  is  analogous  to  b  in  the  mechanical  case, 
we  find  that  the  Q  (quality  factor,  not  charge)  of  the  resonance 
is  inversely  proportional  to  R,  and  the  width  of  the  resonance  is 
directly  proportional  to  R. 

Tuning  a  radio  receiver  example  23 

A  radio  receiver  uses  this  kind  of  circuit  to  pick  out  the  desired 
station.  Since  the  receiver  resonates  at  a  particular  frequency, 
stations  whose  frequencies  are  far  off  will  not  excite  any  response 
in  the  circuit.  The  value  of  R  has  to  be  small  enough  so  that  only 
one  station  at  a  time  is  picked  up,  but  big  enough  so  that  the 
tuner  isn’t  too  touchy.  The  resonant  frequency  can  be  tuned  by 
adjusting  either  L  or  C,  but  variable  capacitors  are  easier  to  build 
than  variable  inductors. 

A  numerical  calculation  example  24 

The  phone  company  sends  more  than  one  conversation  at  a  time 
over  the  same  wire,  which  is  accomplished  by  shifting  each  voice 
signal  into  different  range  of  frequencies  during  transmission.  The 
number  of  signals  per  wire  can  be  maximized  by  making  each 
range  of  frequencies  (known  as  a  bandwidth)  as  small  as  possi¬ 
ble.  It  turns  out  that  only  a  relatively  narrow  range  of  frequencies 
is  necessary  in  order  to  make  a  human  voice  intelligible,  so  the 

3  As  in  chapter  2,  we  use  the  word  “frequency”  to  mean  either  /  or  w  =  2-rvf 
when  the  context  makes  it  clear  which  is  being  referred  to. 
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phone  company  filters  out  all  the  extreme  highs  and  lows.  (This  is 
why  your  phone  voice  sounds  different  from  your  normal  voice.) 

>  If  the  filter  consists  of  an  LRC  circuit  with  a  broad  resonance 
centered  around  1.0  kHz,  and  the  capacitor  is  1  pF  (microfarad), 
what  inductance  value  must  be  used? 

>  Solving  for  L,  we  have 


Ccu2 

1 

“  (10-6  F)(2ttx  103  s-1)2 
=  2.5  x  10”3  F-1s2 

Checking  that  these  really  are  the  same  units  as  henries  is  a  little 
tedious,  but  it  builds  character: 

F_1s2  =  (C2/J)-V 
=  JC-2s2 
=  J/A2 
=  H 

The  result  is  25  mH  (millihenries). 

This  is  actually  quite  a  large  inductance  value,  and  would  require 
a  big,  heavy,  expensive  coil.  In  fact,  there  is  a  trick  for  making 
this  kind  of  circuit  small  and  cheap.  There  is  a  kind  of  silicon 
chip  called  an  op-amp,  which,  among  other  things,  can  be  used 
to  simulate  the  behavior  of  an  inductor.  The  main  limitation  of  the 
op-amp  is  that  it  is  restricted  to  low-power  applications. 

10.5.3  Voltage  and  current 

What  is  physically  happening  in  one  of  these  oscillating  circuits? 
Let’s  first  look  at  the  mechanical  case,  and  then  draw  the  analogy 
to  the  circuit.  For  simplicity,  let’s  ignore  the  existence  of  damping, 
so  there  is  no  friction  in  the  mechanical  oscillator,  and  no  resistance 
in  the  electrical  one. 

Suppose  we  take  the  mechanical  oscillator  and  pull  the  mass 
away  from  equilibrium,  then  release  it.  Since  friction  tends  to  resist 
the  spring’s  force,  we  might  naively  expect  that  having  zero  friction 
would  allow  the  mass  to  leap  instantaneously  to  the  equilibrium 
position.  This  can’t  happen,  however,  because  the  mass  would  have 
to  have  infinite  velocity  in  order  to  make  such  an  instantaneous  leap. 
Infinite  velocity  would  require  infinite  kinetic  energy,  but  the  only 
kind  of  energy  that  is  available  for  conversion  to  kinetic  is  the  energy 
stored  in  the  spring,  and  that  is  finite,  not  infinite.  At  each  step  on 
its  way  back  to  equilibrium,  the  mass’s  velocity  is  controlled  exactly 
by  the  amount  of  the  spring’s  energy  that  has  so  far  been  converted 
into  kinetic  energy.  After  the  mass  reaches  equilibrium,  it  overshoots 
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due  to  its  own  momentum.  It  performs  identical  oscillations  on  both 
sides  of  equilibrium,  and  it  never  loses  amplitude  because  friction  is 
not  available  to  convert  mechanical  energy  into  heat. 

Now  with  the  electrical  oscillator,  the  analog  of  position  is  charge. 
Pulling  the  mass  away  from  equilibrium  is  like  depositing  charges 
+q  and  —  q  on  the  plates  of  the  capacitor.  Since  resistance  tends 
to  resist  the  flow  of  charge,  we  might  imagine  that  with  no  fric¬ 
tion  present,  the  charge  would  instantly  flow  through  the  inductor 
(which  is,  after  all,  just  a  piece  of  wire),  and  the  capacitor  would 
discharge  instantly.  However,  such  an  instant  discharge  is  impossi¬ 
ble,  because  it  would  require  infinite  current  for  one  instant.  Infinite 
current  would  create  infinite  magnetic  fields  surrounding  the  induc¬ 
tor,  and  these  fields  would  have  infinite  energy.  Instead,  the  rate 
of  flow  of  current  is  controlled  at  each  instant  by  the  relationship 
between  the  amount  of  energy  stored  in  the  magnetic  field  and  the 
amount  of  current  that  must  exist  in  order  to  have  that  strong  a 
field.  After  the  capacitor  reaches  q  =  0,  it  overshoots.  The  circuit 
has  its  own  kind  of  electrical  “inertia,”  because  if  charge  was  to  stop 
flowing,  there  would  have  to  be  zero  current  through  the  inductor. 
But  the  current  in  the  inductor  must  be  related  to  the  amount  of 
energy  stored  in  its  magnetic  fields.  When  the  capacitor  is  at  q  =  0, 
all  the  circuit’s  energy  is  in  the  inductor,  so  it  must  therefore  have 
strong  magnetic  fields  surrounding  it  and  quite  a  bit  of  current  going 
through  it. 

The  only  thing  that  might  seem  spooky  here  is  that  we  used  to 
speak  as  if  the  current  in  the  inductor  caused  the  magnetic  field, 
but  now  it  sounds  as  if  the  field  causes  the  current.  Actually  this  is 
symptomatic  of  the  elusive  nature  of  cause  and  effect  in  physics.  It’s 
equally  valid  to  think  of  the  cause  and  effect  relationship  in  either 
way.  This  may  seem  unsatisfying,  however,  and  for  example  does  not 
really  get  at  the  question  of  what  brings  about  a  voltage  difference 
across  the  resistor  (in  the  case  where  the  resistance  is  finite);  there 
must  be  such  a  voltage  difference,  because  without  one,  Ohm’s  law 
would  predict  zero  current  through  the  resistor. 

Voltage,  then,  is  what  is  really  missing  from  our  story  so  far. 

Let’s  start  by  studying  the  voltage  across  a  capacitor.  Voltage  is 
electrical  potential  energy  per  unit  charge,  so  the  voltage  difference 
between  the  two  plates  of  the  capacitor  is  related  to  the  amount  by 
which  its  energy  would  increase  if  we  increased  the  absolute  values 
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of  the  charges  on  the  plates  from  q  to  q  +  dq: 


VC 


( Uq+dq  -  uq)/dq 

dUc 


dq 


q 


c 


I /The  inductor  releases  en¬ 
ergy  and  gives  it  to  the  black  box. 


Many  books  use  this  as  the  definition  of  capacitance.  This  equation, 
by  the  way,  probably  explains  the  historical  reason  why  C  was  de¬ 
fined  so  that  the  energy  was  inversely  proportional  to  C  for  a  given 
value  of  q:  the  people  who  invented  the  definition  were  thinking  of  a 
capacitor  as  a  device  for  storing  charge  rather  than  energy,  and  the 
amount  of  charge  stored  for  a  fixed  voltage  (the  charge  “capacity”) 
is  proportional  to  C. 

In  the  case  of  an  inductor,  we  know  that  if  there  is  a  steady,  con¬ 
stant  current  flowing  through  it,  then  the  magnetic  field  is  constant, 
and  so  is  the  amount  of  energy  stored;  no  energy  is  being  exchanged 
between  the  inductor  and  any  other  circuit  element.  But  what  if 
the  current  is  changing?  The  magnetic  field  is  proportional  to  the 
current,  so  a  change  in  one  implies  a  change  in  the  other.  For  con¬ 
creteness,  let’s  imagine  that  the  magnetic  field  and  the  current  are 
both  decreasing.  The  energy  stored  in  the  magnetic  field  is  there¬ 
fore  decreasing,  and  by  conservation  of  energy,  this  energy  can’t  just 
go  away  —  some  other  circuit  element  must  be  taking  energy  from 
the  inductor.  The  simplest  example,  shown  in  figure  1,  is  a  series 
circuit  consisting  of  the  inductor  plus  one  other  circuit  element.  It 
doesn’t  matter  what  this  other  circuit  element  is,  so  we  just  call  it  a 
black  box,  but  if  you  like,  we  can  think  of  it  as  a  resistor,  in  which 
case  the  energy  lost  by  the  inductor  is  being  turned  into  heat  by 
the  resistor.  The  junction  rule  tells  us  that  both  circuit  elements 
have  the  same  current  through  them,  so  I  could  refer  to  either  one, 
and  likewise  the  loop  rule  tells  us  VindUctor  +  Vbiack  box  =  0,  so  the 
two  voltage  drops  have  the  same  absolute  value,  which  we  can  refer 
to  as  V .  Whatever  the  black  box  is,  the  rate  at  which  it  is  taking 
energy  from  the  inductor  is  given  by  |P|  =  \IV\,  so 


\IV\ 


d  UL 

dt 


-  I  -LI 


dt  \2 

lA 

dt 
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or 


\V\  = 


if 

dt 


which  in  many  books  is  taken  to  be  the  definition  of  inductance. 
The  direction  of  the  voltage  drop  (plus  or  minus  sign)  is  such  that 
the  inductor  resists  the  change  in  current. 

There’s  one  very  intriguing  thing  about  this  result.  Suppose, 
for  concreteness,  that  the  black  box  in  figure  1  is  a  resistor,  and 
that  the  inductor’s  energy  is  decreasing,  and  being  converted  into 
heat  in  the  resistor.  The  voltage  drop  across  the  resistor  indicates 
that  it  has  an  electric  field  across  it,  which  is  driving  the  current. 
But  where  is  this  electric  field  coming  from?  There  are  no  charges 
anywhere  that  could  be  creating  it!  What  we’ve  discovered  is  one 
special  case  of  a  more  general  principle,  the  principle  of  induction:  a 
changing  magnetic  field  creates  an  electric  field,  which  is  in  addition 
to  any  electric  field  created  by  charges.  (The  reverse  is  also  true: 
any  electric  field  that  changes  over  time  creates  a  magnetic  field.) 
Induction  forms  the  basis  for  such  technologies  as  the  generator  and 
the  transformer,  and  ultimately  it  leads  to  the  existence  of  light, 
which  is  a  wave  pattern  in  the  electric  and  magnetic  fields.  These 
are  all  topics  for  chapter  11,  but  it’s  truly  remarkable  that  we  could 
come  to  this  conclusion  without  yet  having  learned  any  details  about 
magnetism. 


© 


m  /  Electric  fields  made  by  charges,  1 ,  and  by  changing  magnetic  fields,  2  and  3. 

The  cartoons  in  figure  m  compares  electric  fields  made  by  charges, 

1,  to  electric  fields  made  by  changing  magnetic  fields,  2-3.  In  m/1, 
two  physicists  are  in  a  room  whose  ceiling  is  positively  charged  and 
whose  floor  is  negatively  charged.  The  physicist  on  the  bottom 
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n  /  Ballasts  for  fluorescent  lights. 
Top:  a  big,  heavy  inductor  used 
as  a  ballast  in  an  old-fashioned 
fluorescent  bulb.  Bottom:  a 
small  solid-state  ballast,  built  into 
the  base  of  a  modern  compact 
fluorescent  bulb. 


throws  a  positively  charged  bowling  ball  into  the  curved  pipe.  The 
physicist  at  the  top  uses  a  radar  gun  to  measure  the  speed  of  the 
ball  as  it  comes  out  of  the  pipe.  They  find  that  the  ball  has  slowed 
down  by  the  time  it  gets  to  the  top.  By  measuring  the  change  in  the 
ball’s  kinetic  energy,  the  two  physicists  are  acting  just  like  a  volt¬ 
meter.  They  conclude  that  the  top  of  the  tube  is  at  a  higher  voltage 
than  the  bottom  of  the  pipe.  A  difference  in  voltage  indicates  an 
electric  field,  and  this  field  is  clearly  being  caused  by  the  charges  in 
the  floor  and  ceiling. 

In  m/2,  there  are  no  charges  anywhere  in  the  room  except  for 
the  charged  bowling  ball.  Moving  charges  make  magnetic  fields,  so 
there  is  a  magnetic  field  surrounding  the  helical  pipe  while  the  ball 
is  moving  through  it.  A  magnetic  field  has  been  created  where  there 
was  none  before,  and  that  field  has  energy.  Where  could  the  energy 
have  come  from?  It  can  only  have  come  from  the  ball  itself,  so 
the  ball  must  be  losing  kinetic  energy.  The  two  physicists  working 
together  are  again  acting  as  a  voltmeter,  and  again  they  conclude 
that  there  is  a  voltage  difference  between  the  top  and  bottom  of 
the  pipe.  This  indicates  an  electric  field,  but  this  electric  field  can’t 
have  been  created  by  any  charges,  because  there  aren’t  any  in  the 
room.  This  electric  field  was  created  by  the  change  in  the  magnetic 
field. 

The  bottom  physicist  keeps  on  throwing  balls  into  the  pipe,  until 
the  pipe  is  full  of  balls,  m/3,  and  finally  a  steady  current  is  estab¬ 
lished.  While  the  pipe  was  filling  up  with  balls,  the  energy  in  the 
magnetic  field  was  steadily  increasing,  and  that  energy  was  being 
stolen  from  the  balls’  kinetic  energy.  But  once  a  steady  current  is 
established,  the  energy  in  the  magnetic  field  is  no  longer  changing. 
The  balls  no  longer  have  to  give  up  energy  in  order  to  build  up  the 
field,  and  the  physicist  at  the  top  finds  that  the  balls  are  exiting  the 
pipe  at  full  speed  again.  There  is  no  voltage  difference  any  more. 
Although  there  is  a  current,  d I /  df  is  zero. 

Ballasts  example  25 

In  a  gas  discharge  tube,  such  as  a  neon  sign,  enough  voltage 
is  applied  to  a  tube  full  of  gas  to  ionize  some  of  the  atoms  in  the 
gas.  Once  ions  have  been  created,  the  voltage  accelerates  them, 
and  they  strike  other  atoms,  ionizing  them  as  well  and  resulting 
in  a  chain  reaction.  This  is  a  spark,  like  a  bolt  of  lightning.  But 
once  the  spark  starts  up,  the  device  begins  to  act  as  though  it  has 
no  resistance:  more  and  more  current  flows,  without  the  need  to 
apply  any  more  voltage.  The  power,  P  =  IV,  would  grow  without 
limit,  and  the  tube  would  burn  itself  out. 

The  simplest  solution  is  to  connect  an  inductor,  known  as  the 
“ballast,”  in  series  with  the  tube,  and  run  the  whole  thing  on  an 
AC  voltage.  During  each  cycle,  as  the  voltage  reaches  the  point 
where  the  chain  reaction  begins,  there  is  a  surge  of  current,  but 
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the  inductor  resists  such  a  sudden  change  of  current,  and  the 
energy  that  would  otherwise  have  burned  out  the  bulb  is  instead 
channeled  into  building  a  magnetic  field. 

A  common  household  fluorescent  lightbulb  consists  of  a  gas  dis¬ 
charge  tube  in  which  the  glass  is  coated  with  a  fluorescent  mate¬ 
rial.  The  gas  in  the  tube  emits  ultraviolet  light,  which  is  absorbed 
by  the  coating,  and  the  coating  then  glows  in  the  visible  spectrum. 

Until  recently,  it  was  common  for  a  fluroescent  light’s  ballast  to 
be  a  simple  inductor,  and  for  the  whole  device  to  be  operated  at 
the  60  Hz  frequency  of  the  electrical  power  lines.  This  caused 
the  lights  to  flicker  annoyingly  at  120  Hz,  and  could  also  cause  an 
audible  hum,  since  the  magnetic  field  surrounding  the  inductor 
could  exert  mechanical  forces  on  things.  These  days,  the  trend 
is  toward  using  a  solid-state  circuit  that  mimics  the  behavior  of 
an  inductor,  but  at  a  frequency  in  the  kilohertz  range,  eliminating 
the  flicker  and  hum.  Modern  compact  fluorescent  bulbs  electronic 
have  ballasts  built  into  their  bases,  so  they  can  be  used  as  plug-in 
replacements  for  incandescent  bulbs.  A  compact  fluorescent  bulb 
uses  about  1/4  the  electricity  of  an  incandescent  bulb,  lasts  ten 
times  longer,  and  saves  $30  worth  of  electricity  over  its  lifetime. 

Discussion  Question 

A  What  happens  when  the  physicist  at  the  bottom  in  figure  m/3  starts 

getting  tired,  and  decreases  the  current? 


10.5.4  Decay 

Up  until  now  I’ve  soft-pedaled  the  fact  that  by  changing  the  char¬ 
acteristics  of  an  oscillator,  it  is  possible  to  produce  non-oscillatory 
behavior.  For  example,  imagine  taking  the  mass-on-a-spring  system 
and  making  the  spring  weaker  and  weaker.  In  the  limit  of  small 
k,  it’s  as  though  there  was  no  spring  whatsoever,  and  the  behavior 
of  the  system  is  that  if  you  kick  the  mass,  it  simply  starts  slowing 
down.  For  friction  proportional  to  v ,  as  we’ve  been  assuming,  the  re¬ 
sult  is  that  the  velocity  approaches  zero,  but  never  actually  reaches 
zero.  This  is  unrealistic  for  the  mechanical  oscillator,  which  will  not 
have  vanishing  friction  at  low  velocities,  but  it  is  quite  realistic  in 
the  case  of  an  electrical  circuit,  for  which  the  voltage  drop  across  the 
resistor  really  does  approach  zero  as  the  current  approaches  zero. 

We  do  not  even  have  to  reduce  k  to  exactly  zero  in  order  to  get 
non-oscillatory  behavior.  There  is  actually  a  finite,  critical  value  be¬ 
low  which  the  behavior  changes,  so  that  the  mass  never  even  makes 
it  through  one  cycle.  This  is  the  case  of  overdamping,  discussed  on 
page  186. 

Electrical  circuits  can  exhibit  all  the  same  behavior.  For  sim¬ 
plicity  we  will  analyze  only  the  cases  of  LRC  circuits  with  L  =  0  or 
C  =  0. 
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The  RC  circuit 
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o  /  An  RC  circuit. 


t/RC 

p/Over  a  time  interval  RC, 
the  charge  on  the  capacitor  is 
reduced  by  a  factor  of  e. 

R  L 

— mm. 


q  /  An  RL  circuit. 

A 

LU 


G 


S 


r  /  Example  27. 


We  first  analyze  the  RC  circuit,  o.  In  reality  one  would  have 
to  “kick”  the  circuit,  for  example  by  briefly  inserting  a  battery,  in 
order  to  get  any  interesting  behavior.  We  start  with  Ohm’s  law  and 
the  equation  for  the  voltage  across  a  capacitor: 

Vr  =  IR 
Vc  =  q/C 


The  loop  rule  tells  us 


Vr  +  Vc  =  0, 


and  combining  the  three  equations  results  in  a  relationship  between 
q  and  I: 

1  =  ~RCQ 

The  negative  sign  tells  us  that  the  current  tends  to  reduce  the  charge 
on  the  capacitor,  i.e.,  to  discharge  it.  It  makes  sense  that  the  current 
is  proportional  to  q:  if  q  is  large,  then  the  attractive  forces  between 
the  +q  and  —q  charges  on  the  plates  of  the  capacitor  are  large, 
and  charges  will  flow  more  quickly  through  the  resistor  in  order  to 
reunite.  If  there  was  zero  charge  on  the  capacitor  plates,  there  would 
be  no  reason  for  current  to  flow.  Since  amperes,  the  unit  of  current, 
are  the  same  as  coulombs  per  second,  it  appears  that  the  quantity 
RC  must  have  units  of  seconds,  and  you  can  check  for  yourself  that 
this  is  correct.  RC  is  therefore  referred  to  as  the  time  constant  of 
the  circuit. 


How  exactly  do  I  and  q  vary  with  time?  Rewriting  I  as  dq/  dt, 
we  have 

dq  1 

dt  =  ~RCQ' 

We  need  a  function  q(t)  whose  derivative  equals  itself,  but  multiplied 
by  a  negative  constant.  A  function  of  the  form  aet ,  where  e  = 
2.718...  is  the  base  of  natural  logarithms,  is  the  only  one  that  has  its 
derivative  equal  to  itself,  and  aebt  has  its  derivative  equal  to  itself 
multiplied  by  b.  Thus  our  solution  is 


q  =  qQ  exp 


The  RL  circuit 

The  RL  circuit,  q,  can  be  attacked  by  similar  methods,  and  it 
can  easily  be  shown  that  it  gives 


I  =  I0  exp 


The  RL  time  constant  equals  L/R. 
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Death  by  solenoid;  spark  plugs  example  26 

When  we  suddenly  break  an  RL  circuit,  what  will  happen?  It  might 
seem  that  we’re  faced  with  a  paradox,  since  we  only  have  two 
forms  of  energy,  magnetic  energy  and  heat,  and  if  the  current 
stops  suddenly,  the  magnetic  field  must  collapse  suddenly.  But 
where  does  the  lost  magnetic  energy  go?  It  can’t  go  into  resistive 
heating  of  the  resistor,  because  the  circuit  has  now  been  broken, 
and  current  can’t  flow! 

The  way  out  of  this  conundrum  is  to  recognize  that  the  open  gap 
in  the  circuit  has  a  resistance  which  is  large,  but  not  infinite.  This 
large  resistance  causes  the  RL  time  constant  L/R  to  be  very 
small.  The  current  thus  continues  to  flow  for  a  very  brief  time, 
and  flows  straight  across  the  air  gap  where  the  circuit  has  been 
opened.  In  other  words,  there  is  a  spark! 

We  can  determine  based  on  several  different  lines  of  reasoning 
that  the  voltage  drop  from  one  end  of  the  spark  to  the  other  must 
be  very  large.  First,  the  air’s  resistance  is  large,  so  V  =  IR  re¬ 
quires  a  large  voltage.  We  can  also  reason  that  all  the  energy 
in  the  magnetic  field  is  being  dissipated  in  a  short  time,  so  the 
power  dissipated  in  the  spark,  P  =  IV,  is  large,  and  this  requires 
a  large  value  of  V.  (/  isn’t  large  —  it  is  decreasing  from  its  initial 
value.)  Yet  a  third  way  to  reach  the  same  result  is  to  consider  the 
equation  VL  =  61/ 6t:  since  the  time  constant  is  short,  the  time 
derivative  61 /6t  is  large. 

This  is  exactly  how  a  car’s  spark  plugs  work.  Another  application 
is  to  electrical  safety:  it  can  be  dangerous  to  break  an  inductive 
circuit  suddenly,  because  so  much  energy  is  released  in  a  short 
time.  There  is  also  no  guarantee  that  the  spark  will  discharge 
across  the  air  gap;  it  might  go  through  your  body  instead,  since 
your  body  might  have  a  lower  resistance. 

A  spark-gap  radio  transmitter  example  27 

Figure  r  shows  a  primitive  type  of  radio  transmitter,  called  a  spark 
gap  transmitter,  used  to  send  Morse  code  around  the  turn  of  the 
twentieth  century.  The  high  voltage  source,  V,  is  typically  about 
10,000  volts.  When  the  telegraph  switch,  S,  is  closed,  the  RC 
circuit  on  the  left  starts  charging  up.  An  increasing  voltage  differ¬ 
ence  develops  between  the  electrodes  of  the  spark  gap,  G.  When 
this  voltage  difference  gets  large  enough,  the  electric  field  in  the 
air  between  the  electrodes  causes  a  spark,  partially  discharging 
the  RC  circuit,  but  charging  the  LC  circuit  on  the  right.  The  LC 
circuit  then  oscillates  at  its  resonant  frequency  (typically  about  1 
MFIz),  but  the  energy  of  these  oscillations  is  rapidly  radiated  away 
by  the  antenna,  A,  which  sends  out  radio  waves  (chapter  11). 

Discussion  Questions 

A  A  gopher  gnaws  through  one  of  the  wires  in  the  DC  lighting  system 

in  your  front  yard,  and  the  lights  turn  off.  At  the  instant  when  the  circuit 
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s  /  Visualizing  complex  num¬ 
bers  as  points  in  a  plane. 


t  /  Addition  of  complex  num¬ 
bers  is  just  like  addition  of 
vectors,  although  the  real  and 
imaginary  axes  don’t  actually 
represent  directions  in  space. 


2+i 


2-i 


u/A  complex  number  and 
its  conjugate. 


becomes  open,  we  can  consider  the  bare  ends  of  the  wire  to  be  like  the 
plates  of  a  capacitor,  with  an  air  gap  (or  gopher  gap)  between  them.  What 
kind  of  capacitance  value  are  we  talking  about  here?  What  would  this  tell 
you  about  the  RC  time  constant? 

10.5.5  Review  of  complex  numbers 

For  a  more  detailed  treatment  of  complex  numbers,  see  ch.  3  of 
James  Nearing’s  free  book  at 

http:  / /www.  physics,  miami.edu/nearing/mathmethods/. 

We  assume  there  is  a  number,  i,  such  that  % 2  =  —1.  The  square 
roots  of  —1  are  then  %  and  —  i.  (In  electrical  engineering  work,  where 
i  stands  for  current,  j  is  sometimes  used  instead.)  This  gives  rise  to 
a  number  system,  called  the  complex  numbers,  containing  the  real 
numbers  as  a  subset.  Any  complex  number  z  can  be  written  in  the 
form  z  =  a  +  bi,  where  a  and  b  are  real,  and  a  and  b  are  then  referred 
to  as  the  real  and  imaginary  parts  of  z.  A  number  with  a  zero  real 
part  is  called  an  imaginary  number.  The  complex  numbers  can  be 
visualized  as  a  plane,  with  the  real  number  line  placed  horizontally 
like  the  x  axis  of  the  familiar  x  —  y  plane,  and  the  imaginary  numbers 
running  along  the  y  axis.  The  complex  numbers  are  complete  in  a 
way  that  the  real  numbers  aren’t:  every  nonzero  complex  number 
has  two  square  roots.  For  example,  1  is  a  real  number,  so  it  is  also 
a  member  of  the  complex  numbers,  and  its  square  roots  are  —1  and 
1.  Likewise,  —1  has  square  roots  i  and  —i,  and  the  number  i  has 
square  roots  \/\J~2  +  i/V 2  and  — 1/\/2  —  i/\f  2. 

Complex  numbers  can  be  added  and  subtracted  by  adding  or 
subtracting  their  real  and  imaginary  parts.  Geometrically,  this  is 
the  same  as  vector  addition. 

The  complex  numbers  a  +  bi  and  a  —  bi ,  lying  at  equal  distances 
above  and  below  the  real  axis,  are  called  complex  conjugates.  The 
results  of  the  quadratic  formula  are  either  both  real,  or  complex 
conjugates  of  each  other.  The  complex  conjugate  of  a  number  z  is 
notated  as  z  or  z* . 

The  complex  numbers  obey  all  the  same  rules  of  arithmetic  as 
the  reals,  except  that  they  can’t  be  ordered  along  a  single  line.  That 
is,  it’s  not  possible  to  say  whether  one  complex  number  is  greater 
than  another.  We  can  compare  them  in  terms  of  their  magnitudes 
(their  distances  from  the  origin),  but  two  distinct  complex  numbers 
may  have  the  same  magnitude,  so,  for  example,  we  can’t  say  whether 
1  is  greater  than  i  or  i  is  greater  than  1. 

A  square  root  of  i  example  28 

>  Prove  that  1  /\/2  +  i/V 2  is  a  square  root  of  /. 

>  Our  proof  can  use  any  ordinary  rules  of  arithmetic,  except  for 
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ordering. 

1  /  2  1  1  1  /  /'I  /  / 

(^  +  Vf}  =  Vl'Vl+Vl'Vl+Vl'Vl  +  Vl'Vl 

=  1(1  +/  +  /-  1) 

=  / 

Example  28  showed  one  method  of  multiplying  complex  num¬ 
bers.  However,  there  is  another  nice  interpretation  of  complex  mul¬ 
tiplication.  We  define  the  argument  of  a  complex  number  as  its  angle 
in  the  complex  plane,  measured  counterclockwise  from  the  positive 
real  axis.  Multiplying  two  complex  numbers  then  corresponds  to 
multiplying  their  magnitudes,  and  adding  their  arguments. 

self-check  H 

Using  this  interpretation  of  multiplication,  how  could  you  find  the  square 
roots  of  a  complex  number?  >  Answer,  p.  966 

An  identity  example  29 

The  magnitude  \z\  of  a  complex  number  z  obeys  the  identity  |z|2  = 
zz.  To  prove  this,  we  first  note  that  z  has  the  same  magnitude 
as  z,  since  flipping  it  to  the  other  side  of  the  real  axis  doesn’t 
change  its  distance  from  the  origin.  Multiplying  z  by  z  gives  a 
result  whose  magnitude  is  found  by  multiplying  their  magnitudes, 
so  the  magnitude  of  zz  must  therefore  equal  |z|2.  Now  we  just 
have  to  prove  that  zz  is  a  positive  real  number.  But  if,  for  example, 
z  lies  counterclockwise  from  the  real  axis,  then  z  lies  clockwise 
from  it.  If  z  has  a  positive  argument,  then  z  has  a  negative  one,  or 
vice-versa.  The  sum  of  their  arguments  is  therefore  zero,  so  the 
result  has  an  argument  of  zero,  and  is  on  the  positive  real  axis. 

4 


v/A  complex  number  can 
be  described  in  terms  of  its 
magnitude  and  argument. 


w/The  argument  of  uv  is 
the  sum  of  the  arguments  of  u 
and  v. 


This  whole  system  was  built  up  in  order  to  make  every  number 
have  square  roots.  What  about  cube  roots,  fourth  roots,  and  so  on? 
Does  it  get  even  more  weird  when  you  want  to  do  those  as  well?  No. 
The  complex  number  system  we’ve  already  discussed  is  sufficient  to 
handle  all  of  them.  The  nicest  way  of  thinking  about  it  is  in  terms 
of  roots  of  polynomials.  In  the  real  number  system,  the  polynomial 
x2—  1  has  two  roots,  i.e.,  two  values  of  x  (plus  and  minus  one)  that  we 
can  plug  in  to  the  polynomial  and  get  zero.  Because  it  has  these  two 
real  roots,  we  can  rewrite  the  polynomial  as  (x—  l)(x  + 1).  However, 
the  polynomial  x2  +  1  has  no  real  roots.  It’s  ugly  that  in  the  real 
number  system,  some  second-order  polynomials  have  two  roots,  and 
can  be  factored,  while  others  can’t.  In  the  complex  number  system, 
they  all  can.  For  instance,  x2  +  1  has  roots  i  and  —  i,  and  can  be 

4I  cheated  a  little.  If  z’s  argument  is  30  degrees,  then  we  could  say  z’s  was 
-30,  but  we  could  also  call  it  330.  That’s  OK,  because  330+30  gives  360,  and  an 
argument  of  360  is  the  same  as  an  argument  of  zero. 
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factored  as  (x  —  i)(x  +  i).  In  general,  the  fundamental  theorem  of 
algebra  states  that  in  the  complex  number  system,  any  nth-order 
polynomial  can  be  factored  completely  into  n  linear  factors,  and  we 
can  also  say  that  it  has  n  complex  roots,  with  the  understanding  that 
some  of  the  roots  may  be  the  same.  For  instance,  the  fourth-order 
polynomial  x4+x 2  can  be  factored  as  (x  —  i)(x  +  i)(x  —  0)(x  —  0),  and 
we  say  that  it  has  four  roots,  i,  —i,  0,  and  0,  two  of  which  happen 
to  be  the  same.  This  is  a  sensible  way  to  think  about  it,  because 
in  real  life,  numbers  are  always  approximations  anyway,  and  if  we 
make  tiny,  random  changes  to  the  coefficients  of  this  polynomial,  it 
will  have  four  distinct  roots,  of  which  two  just  happen  to  be  very 
close  to  zero. 


Discussion  Questions 


A  Find  arg  /,  arg( — /),  and  arg  37,  where  arg  z  denotes  the  argument  of 
the  complex  number  z. 

B  Visualize  the  following  multiplications  in  the  complex  plane  using 
the  interpretation  of  multiplication  in  terms  of  multiplying  magnitudes  and 
adding  arguments:  (/')(/')  =  -1,  (/)(-/)  =  1,  (-/)(-/)  =  -1. 


C  If  we  visualize  z  as  a  point  in  the  complex  plane,  how  should  we 
visualize  -z?  What  does  this  mean  in  terms  of  arguments?  Give  similar 
interpretations  for  z2  and  \fz. 

D  Find  four  different  complex  numbers  z  such  that  z4  =  1 . 

E  Compute  the  following.  For  the  final  two,  use  the  magnitude  and 
argument,  not  the  real  and  imaginary  parts. 


|1+/j  ,  arg(1  +  /') 


From  these,  find  the  real  and  imaginary  parts  of  1  /(I  +  /'). 


10.5.6  Euler’s  formula 


Having  expanded  our  horizons  to  include  the  complex  numbers, 
it’s  natural  to  want  to  extend  functions  we  knew  and  loved  from 
the  world  of  real  numbers  so  that  they  can  also  operate  on  complex 
numbers.  The  only  really  natural  way  to  do  this  in  general  is  to 
use  Taylor  series.  A  particularly  beautiful  thing  happens  with  the 
functions  ex,  sinx,  and  cosx: 


=  1 + hx* + 1*3 


1 


1 


cos  X  =  1  —  —x"  +  —x  —  .  .  . 

1  3  1  5 

sm  x  =  x - -x  -| — -x  —  . . . 

3!  5! 


If  x  =  i(f>  is  an  imaginary  number,  we  have 


cos  (j>  +  i  sin  <f>, 


a  result  known  as  Euler’s  formula.  The  geometrical  interpretation 
in  the  complex  plane  is  shown  in  figure  x. 
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Although  the  result  may  seem  like  something  out  of  a  freak  show 
at  first,  applying  the  definition  of  the  exponential  function  makes  it 
clear  how  natural  it  is: 


e*  =  lim  (l  +  -Y  . 

n— »oo  V  nJ 

When  x  =  icj)  is  imaginary,  the  quantity  (1  +  i(j)/n )  represents  a 
number  lying  just  above  1  in  the  complex  plane.  For  large  n,  (1  + 
i(j)/n)  becomes  very  close  to  the  unit  circle,  and  its  argument  is  the 
small  angle  <p/n.  Raising  this  number  to  the  nth  power  multiplies 
its  argument  by  n,  giving  a  number  with  an  argument  of  q i>. 

Euler’s  formula  is  used  frequently  in  physics  and  engineering. 

Trig  functions  in  terms  of  complex  exponentials  example  30 
>  Write  the  sine  and  cosine  functions  in  terms  of  exponentials. 

o  Euler’s  formula  for  x  =  -/'4>  gives  cos  4? — / sin  4>,  since  cos(-0)  = 
cos  0,  and  sin(-0)  =  -  sin  0. 


cosx  = 


sinx 


eix  -  e~lx 

2 7 


A  hard  integral  made  easy 
>  Evaluate 


j  excosxdx 


example  3 1 


>  This  seemingly  impossible  integral  becomes  easy  if  we  rewrite 
the  cosine  in  terms  of  exponentials: 


y  /  Leonhard  Euler  (1 707-1 783) 


J  excosxdx 


dx 


1 

2 

1 

2 


e(1+')x  +  e(1  ')x)  eix 


e(1+/)x  0(1  -i)x\ 

i  +7  +  i  -  /  ) 


+  C 


Since  this  result  is  the  integral  of  a  real-valued  function,  we’d  like 
it  to  be  real,  and  in  fact  it  is,  since  the  first  and  second  terms  are 
complex  conjugates  of  one  another.  If  we  wanted  to,  we  could 
use  Euler’s  theorem  to  convert  it  back  to  a  manifestly  real  result.5 

5 In  general,  the  use  of  complex  number  techniques  to  do  an  integral  could 
result  in  a  complex  number,  but  that  complex  number  would  be  a  constant, 
which  could  be  subsumed  within  the  usual  constant  of  integration. 
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10.5.7  Impedance 

So  far  we  have  been  thinking  in  terms  of  the  free  oscillations  of  a 
circuit.  This  is  like  a  mechanical  oscillator  that  has  been  kicked  but 
then  left  to  oscillate  on  its  own  without  any  external  force  to  keep 
the  vibrations  from  dying  out.  Suppose  an  LRC  circuit  is  driven 
with  a  sinusoidally  varying  voltage,  such  as  will  occur  when  a  radio 
tuner  is  hooked  up  to  a  receiving  antenna.  We  know  that  a  current 
will  flow  in  the  circuit,  and  we  know  that  there  will  be  resonant 
behavior,  but  it  is  not  necessarily  simple  to  relate  current  to  voltage 
in  the  most  general  case.  Let’s  start  instead  with  the  special  cases 
of  LRC  circuits  consisting  of  only  a  resistance,  only  a  capacitance, 
or  only  an  inductance.  We  are  interested  only  in  the  steady-state 
response. 

The  purely  resistive  case  is  easy.  Ohm’s  law  gives 


In  the  purely  capacitive  case,  the  relation  V  =  q/C  lets  us  cal¬ 
culate 


I 


d  q 
d  t 


C 


dR 

dt 


This  is  partly  analogous  to  Ohm’s  law.  For  example,  if  we  double 
the  amplitude  of  a  sinusoidally  varying  AC  voltage,  the  derivative 
dR/  dt  will  also  double,  and  the  amplitude  of  the  sinusoidally  varying 
current  will  also  double.  However,  it  is  not  true  that  I  =  V/R,  be¬ 
cause  taking  the  derivative  of  a  sinusoidal  function  shifts  its  phase  by 
90  degrees.  If  the  voltage  varies  as,  for  example,  R(t)  =  R0sin(cuf), 
then  the  current  will  be  I(t)  =  ujCV0  cos(cut).  The  amplitude  of  the 
current  is  uCVQ,  which  is  proportional  to  VQ,  but  it’s  not  true  that 
I(t)  =  V(t)/R  for  some  constant  R. 

A  second  problem  that  crops  up  is  that  our  entire  analysis  of 
DC  resistive  circuits  was  built  on  the  foundation  of  the  loop  rule 
and  the  junction  rule,  both  of  which  are  statements  about  sums.  To 
apply  the  junction  rule  to  an  AC  circuit,  for  exarnpe,  we  would  say 
that  the  sum  of  the  sine  waves  describing  the  currents  coming  into 
the  junction  is  equal  (at  every  moment  in  time)  to  the  sum  of  the 
sine  waves  going  out.  Now  sinusoidal  functions  have  a  remarkable 
property,  which  is  that  if  you  add  two  different  sinusoidal  functions 
having  the  same  frequency,  the  result  is  also  a  sinusoid  with  that 
frequency.  For  example,  cos  cut  +  sin  cut  =  \/2sin(cuf  +  7r/4),  which 
can  be  proved  using  trig  identities.  The  trig  identities  can  get  very 
cumbersome,  however,  and  there  is  a  much  easier  technique  involv¬ 
ing  complex  numbers. 
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Figure  aa  shows  a  useful  way  to  visualize  what’s  going  on.  When 
a  circuit  is  oscillating  at  a  frequency  u,  we  use  points  in  the  plane  to 
represent  sinusoidal  functions  with  various  phases  and  amplitudes. 


self-check  I 

Which  of  the  following  functions  can  be  represented  in  this  way?  cos(6 1— 
4),  cos2 1,  tan  f  >  Answer,  p.  966 

The  simplest  examples  of  how  to  visualize  this  in  polar  coordi¬ 
nates  are  ones  like  cos  cut  +  cos  cut  =  2  cos  cut,  where  everything  has 
the  same  phase,  so  all  the  points  lie  along  a  single  line  in  the  polar 
plot,  and  addition  is  just  like  adding  numbers  on  the  number  line. 
The  less  trivial  example  cos  cut  +  sin  cut  =  \/2sin(u;£  +  7t/4),  can  be 
visualized  as  in  figure  ab. 


Figure  ab  suggests  that  all  of  this  can  be  tied  together  nicely 
if  we  identify  our  plane  with  the  plane  of  complex  numbers.  For 
example,  the  complex  numbers  1  and  i  represent  the  functions  sin  cut 
and  coscut.  In  figure  z,  for  example,  the  voltage  across  the  capacitor 
is  a  sine  wave  multiplied  by  a  number  that  gives  its  amplitude,  so 
we  associate  that  function  with  a  number  V  lying  on  the  real  axis. 
Its  magnitude,  |Vj,  gives  the  amplitude  in  units  of  volts,  while  its 
argument  aigV,  gives  its  phase  angle,  which  is  zero.  The  current 
is  a  multiple  of  a  sine  wave,  so  we  identify  it  with  a  number  I 
lying  on  the  imaginary  axis.  We  have  arg I  =  90°,  and  |/|  is  the 
amplitude  of  the  current,  in  units  of  amperes.  But  comparing  with 
our  result  above,  we  have  |/|  =  ojC\V\.  Bringing  together  the  phase 
and  magnitude  information,  we  have  I  =  iojCV.  This  looks  very 
much  like  Ohm’s  law,  so  we  write 


where  the  quantity 


I  = 


V 

Zc’ 


Zc  =  - 


cuC’ 


[impedance  of  a  capacitor] 


having  units  of  ohms,  is  called  the  impedance  of  the  capacitor  at 
this  frequency. 


It  makes  sense  that  the  impedance  becomes  infinite  at  zero  fre¬ 
quency.  Zero  frequency  means  that  it  would  take  an  infinite  time 
before  the  voltage  would  change  by  any  amount.  In  other  words, 
this  is  like  a  situation  where  the  capacitor  has  been  connected  across 
the  terminals  of  a  battery  and  been  allowed  to  settle  down  to  a  state 
where  there  is  constant  charge  on  both  terminals.  Since  the  elec¬ 
tric  fields  between  the  plates  are  constant,  there  is  no  energy  being 
added  to  or  taken  out  of  the  field.  A  capacitor  that  can’t  exchange 
energy  with  any  other  circuit  component  is  nothing  more  than  a 
broken  (open)  circuit. 

Note  that  we  have  two  types  of  complex  numbers:  those  that 
represent  sinusoidal  functions  of  time,  and  those  that  represent 


z  /  In  a  capacitor,  the  current 
is  90°  ahead  of  the  voltage  in 
phase. 


cos  cot< 

.sin  ( co  t+  tc/4) 
-sin  cot  0  sin  cot  2  sin  cot 

-2cos  cot< 

aa  /  Representing  functions 
with  points  in  polar  coordinates. 


cos  cot 


.sincot+cos  cot 


sin  cot 


ab  /  Adding  two  sinusoidal 
functions. 
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ac  /  The  current  through  an 
inductor  lags  behind  the  voltage 
by  a  phase  angle  of  90°. 


impedances.  The  ones  that  represent  sinusoidal  functions  have  tildes 
on  top,  which  look  like  little  sine  waves. 

self-check  J 

Why  can’t  a  capacitor  have  its  impedance  printed  on  it  along  with  its 
capacitance?  >  Answer,  p.  966 

Similar  math  (but  this  time  with  an  integral  instead  of  a  deriva¬ 
tive)  gives 


Zl  =  icoL  [impedance  of  an  inductor] 

for  an  inductor.  It  makes  sense  that  the  inductor  has  lower  impedance 
at  lower  frequencies,  since  at  zero  frequency  there  is  no  change  in 
the  magnetic  field  over  time.  No  energy  is  added  to  or  released 
from  the  magnetic  field,  so  there  are  no  induction  effects,  and  the 
inductor  acts  just  like  a  piece  of  wire  with  negligible  resistance.  The 
term  “choke”  for  an  inductor  refers  to  its  ability  to  “choke  out”  high 
frequencies. 

The  phase  relationships  shown  in  figures  z  and  ac  can  be  re¬ 
membered  using  my  own  mnemonic,  “eVIL,”  which  shows  that  the 
voltage  (V)  leads  the  current  (I)  in  an  inductive  circuit,  while  the 
opposite  is  true  in  a  capacitive  one.  A  more  traditional  mnemonic 
is  “ELI  the  ICE  man,”  which  uses  the  notation  E  for  emf,  a  concept 
closely  related  to  voltage  (see  p.  692). 

Summarizing,  the  impedances  of  resistors,  capacitors,  and  in¬ 
ductors  are 


Zr  =  R 


Zr  =  iuL. 


Low-pass  and  high-pass  filters  example  32 

An  LRC  circuit  only  responds  to  a  certain  range  (band)  of  fre¬ 
quencies  centered  around  its  resonant  frequency.  As  a  filter,  this 
is  known  as  a  bandpass  filter.  If  you  turn  down  both  the  bass  and 
the  treble  on  your  stereo,  you  have  created  a  bandpass  filter. 

To  create  a  high-pass  or  low-pass  filter,  we  only  need  to  insert 
a  capacitor  or  inductor,  respectively,  in  series.  For  instance,  a 
very  basic  surge  protector  for  a  computer  could  be  constructed 
by  inserting  an  inductor  in  series  with  the  computer.  The  desired 
60  Hz  power  from  the  wall  is  relatively  low  in  frequency,  while  the 
surges  that  can  damage  your  computer  show  much  more  rapid 
time  variation.  Even  if  the  surges  are  not  sinusoidal  signals,  we 
can  think  of  a  rapid  “spike”  qualitatively  as  if  it  was  very  high  in 
frequency  —  like  a  high-frequency  sine  wave,  it  changes  very 
rapidly. 
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Inductors  tend  to  be  big,  heavy,  expensive  circuit  elements,  so  a 
simple  surge  protector  would  be  more  likely  to  consist  of  a  capac¬ 
itor  in  parallel  with  the  computer.  (In  fact  one  would  normally  just 
connect  one  side  of  the  power  circuit  to  ground  via  a  capacitor.) 
The  capacitor  has  a  very  high  impedance  at  the  low  frequency  of 
the  desired  60  Hz  signal,  so  it  siphons  off  very  little  of  the  current. 
But  for  a  high-frequency  signal,  the  capacitor’s  impedance  is  very 
small,  and  it  acts  like  a  zero-impedance,  easy  path  into  which  the 
current  is  diverted. 

The  main  things  to  be  careful  about  with  impedance  are  that 
(1)  the  concept  only  applies  to  a  circuit  that  is  being  driven  sinu¬ 
soidally,  (2)  the  impedance  of  an  inductor  or  capacitor  is  frequency- 
dependent. 

Discussion  Question 

A  Figure  z  on  page  61 3  shows  the  voltage  and  current  for  a  capacitor. 
Sketch  the  q-t  graph,  and  use  it  to  give  a  physical  explanation  of  the 
phase  relationship  between  the  voltage  and  current.  For  example,  why  is 
the  current  zero  when  the  voltage  is  at  a  maximum  or  minimum? 

B  Figure  ac  on  page  61 4  shows  the  voltage  and  current  for  an  inductor. 
The  power  is  considered  to  be  positive  when  energy  is  being  put  into  the 
inductor’s  magnetic  field.  Sketch  the  graph  of  the  power,  and  then  the 
graph  of  U,  the  energy  stored  in  the  magnetic  field,  and  use  it  to  give 
a  physical  explanation  of  the  P-t  graph.  In  particular,  discuss  why  the 
frequency  is  doubled  on  the  P-t  graph. 

C  Relate  the  features  of  the  graph  in  figure  ac  on  page  614  to  the  story 
told  in  cartoons  in  figure  m/2-3  on  page  603. 

10.5.8  Power 

How  much  power  is  delivered  when  an  oscillating  voltage  is  ap¬ 
plied  to  an  impedance?  The  equation  P  =  IV  is  generally  true, 
since  voltage  is  defined  as  energy  per  unit  charge,  and  current  is 
defined  as  charge  per  unit  time:  multiplying  them  gives  energy  per 
unit  time.  In  a  DC  circuit,  all  three  quantities  were  constant,  but 
in  an  oscillating  (AC)  circuit,  all  three  display  time  variation. 

A  resistor 

First  let’s  examine  the  case  of  a  resistor.  For  instance,  you’re 
probably  reading  this  book  from  a  piece  of  paper  illuminated  by 
a  glowing  lightbulb,  which  is  driven  by  an  oscillating  voltage  with 
amplitude  VD.  In  the  special  case  of  a  resistor,  we  know  that  I  and 
V  are  in  phase.  For  example,  if  V  varies  as  VQ  cos  cot,  then  I  will  be 
a  cosine  as  well,  I0  cos  cut.  The  power  is  then  I0V0  cos2  cut,  which  is 
always  positive,6  and  varies  between  0  and  I0V0.  Even  if  the  time 
variation  was  cos  cut  or  sin(cuf+7r/4),  we  would  still  have  a  maximum 
power  of  I0V0l  because  both  the  voltage  and  the  current  would  reach 

6 A  resistor  always  turns  electrical  energy  into  heat.  It  never  turns  heat  into 
electrical  energy! 
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ad  /  Power  in  a  resistor:  the 
rate  at  which  electrical  energy  is 
being  converted  into  heat. 


their  maxima  at  the  same  time.  In  a  lightbulb,  the  moment  of 
maximum  power  is  when  the  circuit  is  most  rapidly  heating  the 
filament.  At  the  instant  when  P  =  0,  a  quarter  of  a  cycle  later,  no 
current  is  flowing,  and  no  electrical  energy  is  being  turned  into  heat. 
Throughout  the  whole  cycle,  the  filament  is  getting  rid  of  energy  by 
radiating  light."  Since  the  circuit  oscillates  at  a  frequency8  of  60  Hz, 
the  temperature  doesn’t  really  have  time  to  cycle  up  or  down  very 
much  over  the  1/60  s  period  of  the  oscillation,  and  we  don’t  notice 
any  significant  variation  in  the  brightness  of  the  light,  even  with  a 
short-exposure  photograph. 

Thus,  what  we  really  want  to  know  is  the  average  power,  “aver¬ 
age”  meaning  the  average  over  one  full  cycle.  Since  we’re  covering 
a  whole  cycle  with  our  average,  it  doesn’t  matter  what  phase  we 
assume.  Let’s  use  a  cosine.  The  total  amount  of  energy  transferred 
over  one  cycle  is 


E=  dE 


fT  dE  , 

L  ^dt- 


where  T  =  2ir/oj  is  the  period. 


E  =  /  Pdt 

Jo 

=  [  Pdt 

Jo 

fT 

=  /  IqVo  cos2  ut  dt 

Jo 

fT 

=  I0V0  I  cos2wfdt 
Jo 


fT  1 

=  loVo  /  -  (1  +  cos  2 ut)  dt 

Jo  2 


7To  many  people,  the  word  “radiation”  implies  nuclear  contamination.  Ac¬ 
tually,  the  word  simply  means  something  that  “radiates”  outward.  Natural 
sunlight  is  “radiation.”  So  is  the  light  from  a  lightbulb,  or  the  infrared  light 
being  emitted  by  your  skin  right  now. 

sNote  that  this  time  “frequency”  means  /,  not  uj\  Physicists  and  engineers 
generally  use  u>  because  it  simplifies  the  equations,  but  electricians  and  techni¬ 
cians  always  use  /.  The  60  Hz  frequency  is  for  the  U.S. 
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The  reason  for  using  the  trig  identity  cos2x  =  (1  +  cos  2x)/2  in 
the  last  step  is  that  it  lets  us  get  the  answer  without  doing  a  hard 
integral.  Over  the  course  of  one  full  cycle,  the  quantity  cos  2 ut  goes 
positive,  negative,  positive,  and  negative  again,  so  the  integral  of  it 
is  zero.  We  then  have 

[T  1 

E  =  I0V0  /  -  d t 
Jo  z 
_  IqVqT 

2 

The  average  power  is 

energy  transferred  in  one  full  cycle 
time  for  one  full  cycle 
_  IoVoT/2 

T 

IoV0 


i.e.,  the  average  is  half  the  maximum.  The  power  varies  from  0 
to  I0V0l  and  it  spends  equal  amounts  of  time  above  and  below  the 
maximum,  so  it  isn’t  surprising  that  the  average  power  is  half-way 
in  between  zero  and  the  maximum.  Summarizing,  we  have 


Pav  = 


[average  power  in  a  resistor] 


for  a  resistor. 

Rms  quantities 

Suppose  one  day  the  electric  company  decided  to  start  supplying 
your  electricity  as  DC  rather  than  AC.  How  would  the  DC  voltage 
have  to  be  related  to  the  amplitude  VQ  of  the  AC  voltage  previously 
used  if  they  wanted  your  lightbulbs  to  have  the  same  brightness  as 
before?  The  resistance  of  the  bulb,  R,  is  a  fixed  value,  so  we  need 
to  relate  the  power  to  the  voltage  and  the  resistance,  eliminating 
the  current.  In  the  DC  case,  this  gives  P  =  IV  =  (V/R) V  =  V2 /R. 
(For  DC,  P  and  Pav  are  the  same.)  In  the  AC  case,  Pav  =  I0V0/2  = 
V2 /2R.  Since  there  is  no  factor  of  1/2  in  the  DC  case,  the  same 
power  could  be  provided  with  a  DC  voltage  that  was  smaller  by  a 
factor  of  l/y/h  Although  you  will  hear  people  say  that  household 
voltage  in  the  U.S.  is  110  V,  its  amplitude  is  actually  (110  V)  x 
\[2  ~  160  V.  The  reason  for  referring  to  V0/y/2  as  “the”  voltage  is 
that  people  who  are  naive  about  AC  circuits  can  plug  V0/\/2  into 
a  familiar  DC  equation  like  P  =  V2 / R  and  get  the  right  average 
answer.  The  quantity  V0/\/2  is  called  the  “RMS”  voltage,  which 
stands  for  “root  mean  square.”  The  idea  is  that  if  you  square  the 
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the  rate  at  which  energy  is  being 
stored  in  (+)  or  removed  from  (-) 
the  electric  field. 


function  V(t),  take  its  average  (mean)  over  one  cycle,  and  then  take 
the  square  root  of  that  average,  you  get  V0/y/2.  Many  digital  meters 
provide  RMS  readouts  for  measuring  AC  voltages  and  currents. 


A  capacitor 

For  a  capacitor,  the  calculation  starts  out  the  same,  but  ends  up 
with  a  twist.  If  the  voltage  varies  as  a  cosine,  VQ  cos  cut,  then  the 
relation  I  =  C  dV j  dt  tells  us  that  the  current  will  be  some  constant 
multiplied  by  minus  the  sine,  —V0  sine ot.  The  integral  we  did  in  the 
case  of  a  resistor  now  becomes 


E  = 


—I0Vo  sin  l ot  cos  oot  dt, 


and  based  on  figure  ae,  you  can  easily  convince  yourself  that  over 
the  course  of  one  full  cycle,  the  power  spends  two  quarter-cycles 
being  negative  and  two  being  positive.  In  other  words,  the  average 
power  is  zero! 

Why  is  this?  It  makes  sense  if  you  think  in  terms  of  energy. 
A  resistor  converts  electrical  energy  to  heat,  never  the  other  way 
around.  A  capacitor,  however,  merely  stores  electrical  energy  in  an 
electric  field  and  then  gives  it  back.  For  a  capacitor, 


Pav  =  0  [average  power  in  a  capacitor] 


Notice  that  although  the  average  power  is  zero,  the  power  at  any 
given  instant  is  not  typically  zero,  as  shown  in  figure  ae.  The  capac¬ 
itor  does  transfer  energy:  it’s  just  that  after  borrowing  some  energy, 
it  always  pays  it  back  in  the  next  quarter-cycle. 

An  inductor 

The  analysis  for  an  inductor  is  similar  to  that  for  a  capacitor:  the 
power  averaged  over  one  cycle  is  zero.  Again,  we’re  merely  storing 
energy  temporarily  in  a  field  (this  time  a  magnetic  field)  and  getting 
it  back  later. 


ZT 

r— WVW— ] 
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at /We  wish  to  maximize  the 
power  delivered  to  the  load,  Z0, 
by  adjusting  its  impedance. 


10.5.9  Impedance  matching 

Figure  af  shows  a  commonly  encountered  situation:  we  wish  to 
maximize  the  average  power,  Pav,  delivered  to  the  load  for  a  fixed 
value  of  VQ,  the  amplitude  of  the  oscillating  driving  voltage.  We 
assume  that  the  impedance  of  the  transmission  line,  Zt  is  a  fixed 
value,  over  which  we  have  no  control,  but  we  are  able  to  design  the 
load,  Z0,  with  any  impedance  we  like.  For  now,  we’ll  also  assume 
that  both  impedances  are  resistive.  For  example,  Zt  could  be  the 
resistance  of  a  long  extension  cord,  and  ZQ  could  be  a  lamp  at  the 
end  of  it.  The  result  generalizes  immediately,  however,  to  any  kind 
of  impedance.  For  example,  the  load  could  be  a  stereo  speaker’s 
magnet  coil,  which  is  displays  both  inductance  and  resistance.  (For 
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a  purely  inductive  or  capacitive  load,  Pav  equals  zero,  so  the  problem 
isn’t  very  interesting!) 

Since  we’re  assuming  both  the  load  and  the  transmission  line  are 
resistive,  their  impedances  add  in  series,  and  the  amplitude  of  the 
current  is  given  by 

j 

°  Z0  +  ZT' 


so 


Pav  =  IoVo/2 

=  l'oZo/2 

=  VQZ°  /2 
(Z0  +  ZT)2  7  ' 

The  maximum  of  this  expression  occurs  where  the  derivative  is  zero, 

Id 
<J  -  2dZ~0 


2  dZQ 

0  =  (Z0  +  Zt)  ~  —  2Z0  (ZQ  +  Zt ) 

0  =  (Z0  +  Zt)  —  2Za 
Z0  =  Zt 

In  other  words,  to  maximize  the  power  delivered  to  the  load,  we 
should  make  the  load’s  impedance  the  same  as  the  transmission 
line’s.  This  result  may  seem  surprising  at  first,  but  it  makes  sense 
if  you  think  about  it.  If  the  load’s  impedance  is  too  high,  it’s  like 
opening  a  switch  and  breaking  the  circuit;  no  power  is  delivered. 
On  the  other  hand,  it  doesn’t  pay  to  make  the  load’s  impedance  too 
small.  Making  it  smaller  does  give  more  current,  but  no  matter  how 
small  we  make  it,  the  current  will  still  be  limited  by  the  transmission 
line’s  impedance.  As  the  load’s  impedance  approaches  zero,  the 
current  approaches  this  fixed  value,  and  the  the  power  delivered, 
IqZ0,  decreases  in  proportion  to  ZQ. 

Maximizing  the  power  transmission  by  matching  Zt  to  Z0  is 
called  impedance  matching.  For  example,  an  8-ohm  home  stereo 
speaker  will  be  correctly  matched  to  a  home  stereo  amplifier  with 
an  internal  impedance  of  8  ohms,  and  4-ohm  car  speakers  will  be 
correctly  matched  to  a  car  stereo  with  a  4-ohm  internal  impedance. 
You  might  think  impedance  matching  would  be  unimportant  be¬ 
cause  even  if,  for  example,  we  used  a  car  stereo  to  drive  8-ohm 
speakers,  we  could  compensate  for  the  mismatch  simply  by  turn¬ 
ing  the  volume  knob  higher.  This  is  indeed  one  way  to  compensate 
for  any  impedance  mismatch,  but  there  is  always  a  price  to  pay. 


~  Vq2Zq  ' 
_{Zo  +  ZTf_ 
Zo 

(Zo  +  Zt)\ 
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When  the  impedances  are  matched,  half  the  power  is  dissipated  in 
the  transmission  line  and  half  in  the  load.  By  connecting  a  4-ohm 
amplifier  to  an  8-ohm  speaker,  however,  you  would  be  setting  up  a 
situation  in  two  watts  were  being  dissipated  as  heat  inside  the  amp 
for  every  amp  being  delivered  to  the  speaker.  In  other  words,  you 
would  be  wasting  energy,  and  perhaps  burning  out  your  amp  when 
you  turned  up  the  volume  to  compensate  for  the  mismatch. 

10.5.10  Impedances  in  series  and  parallel 

How  do  impedances  combine  in  series  and  parallel?  The  beauty 
of  treating  them  as  complex  numbers  is  that  they  simply  combine 
according  to  the  same  rules  you’ve  already  learned  as  resistances. 

Series  impedance  example  33 

t>  A  capacitor  and  an  inductor  in  series  with  each  other  are  driven 
by  a  sinusoidally  oscillating  voltage.  At  what  frequency  is  the  cur¬ 
rent  maximized? 

>  Impedances  in  series,  like  resistances  in  series,  add.  The  ca¬ 
pacitor  and  inductor  act  as  if  they  were  a  single  circuit  element 
with  an  impedance 


Z  =  Z,  +  ZC 


=  iwL 


i 

uoC 


The  current  is  then 


iwL  —  i/wC' 

We  don’t  care  about  the  phase  of  the  current,  only  its  amplitude, 
which  is  represented  by  the  absolute  value  of  the  complex  num¬ 
ber  7,  and  this  can  be  maximized  by  making  \iwL—i/wC\  as  small 
as  possible.  But  there  is  some  frequency  at  which  this  quantity  is 
zero  — 


0  =  iwL 
1 

c vC 


i 

uoC 
L vL 


LU  = 


1 

\fLC 


At  this  frequency,  the  current  is  infinite!  What  is  going  on  phys¬ 
ically?  This  is  an  LRC  circuit  with  R  =  0.  It  has  a  resonance  at 
this  frequency,  and  because  there  is  no  damping,  the  response 
at  resonance  is  infinite.  Of  course,  any  real  LRC  circuit  will  have 
some  damping,  however  small  (cf.  figure  j  on  page  181). 
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Resonance  with  damping  example  34 

>  What  is  the  amplitude  of  the  current  in  a  series  LRC  circuit? 

o  Generalizing  from  example  33,  we  add  a  third,  real  impedance: 


R  +  iuoL  —  i/wC 


\V\ 

This  result  would  have  taken  pages  of  algebra  without  the  com¬ 
plex  number  technique! 

A  second-order  stereo  crossover  filter  example  35 

A  stereo  crossover  filter  ensures  that  the  high  frequencies  go  to 
the  tweeter  and  the  lows  to  the  woofer.  This  can  be  accomplished 
simply  by  putting  a  single  capacitor  in  series  with  the  tweeter  and 
a  single  inductor  in  series  with  the  woofer.  However,  such  a  filter 
does  not  cut  off  very  sharply.  Suppose  we  model  the  speakers 
as  resistors.  (They  really  have  inductance  as  well,  since  they 
have  coils  in  them  that  serve  as  electromagnets  to  move  the  di¬ 
aphragm  that  makes  the  sound.)  Then  the  power  they  draw  is 
l2R.  Putting  an  inductor  in  series  with  the  woofer,  ag/1,  gives 
a  total  impedance  that  at  high  frequencies  is  dominated  by  the 
inductor’s,  so  the  current  is  proportional  to  or1,  and  the  power 
drawn  by  the  woofer  is  proportional  to  o>~2. 

A  second-order  filter,  like  ag/2,  is  one  that  cuts  off  more  sharply: 
at  high  frequencies,  the  power  goes  like  cu-4.  To  analyze  this 
circuit,  we  first  calculate  the  total  impedance: 

Z  =  ZL  +  (ZC1+ZRV 

All  the  current  passes  through  the  inductor,  so  if  the  driving  volt¬ 
age  being  supplied  on  the  left  is  Vd,  we  have 

Vd  =  lLz, 

and  we  also  have 

VL  =  1LZL. 

The  loop  rule,  applied  to  the  outer  perimeter  of  the  circuit,  gives 

Vd=VL+VR. 

Straightforward  algebra  now  results  in 

o  Yu 

R  1  +ZL/ZC  +  ZL/ZR 

At  high  frequencies,  the  ZL/ZC  term,  which  varies  as  tu2,  dom¬ 
inates,  so  VR  and  1R  are  proportional  to  tu-2,  and  the  power  is 
proportional  to  cu”4. 


(1) 


L 


ag  /  Example  35. 
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10.6  Fields  by  Gauss’  Law 

10.6.1  Gauss’  law 

The  flea  of  subsection  10.3.2  had  a  long  and  illustrious  scientific 
career,  and  we’re  now  going  to  pick  up  her  story  where  we  left  off. 
This  flea,  whose  name  is  Gauss9,  has  derived  the  equation  E±  = 
2irka  for  the  electric  field  very  close  to  a  charged  surface  with  charge 
density  a.  Next  we  will  describe  two  improvements  she  is  going  to 
make  to  that  equation. 

First,  she  realizes  that  the  equation  is  not  as  useful  as  it  could  be, 
because  it  only  gives  the  part  of  the  field  due  to  the  surface.  If  other 
charges  are  nearby,  then  their  fields  will  add  to  this  field  as  vectors, 
and  the  equation  will  not  be  true  unless  we  carefully  subtract  out 
the  field  from  the  other  charges.  This  is  especially  problematic  for 
her  because  the  planet  on  which  she  lives,  known  for  obscure  reasons 
as  planet  Flatcat,  is  itself  electrically  charged,  and  so  are  all  the  fleas 
-  the  only  thing  that  keeps  them  from  floating  off  into  outer  space 
is  that  they  are  negatively  charged,  while  Flatcat  carries  a  positive 
charge,  so  they  are  electrically  attracted  to  it.  When  Gauss  found 
the  original  version  of  her  equation,  she  wanted  to  demonstrate  it  to 
her  skeptical  colleagues  in  the  laboratory,  using  electric  field  meters 
and  charged  pieces  of  metal  foil.  Even  if  she  set  up  the  measurements 
by  remote  control,  so  that  her  the  charge  on  her  own  body  would 
be  too  far  away  to  have  any  effect,  they  would  be  disrupted  by  the 
ambient  field  of  planet  Flatcat.  Finally,  however,  she  realized  that 
she  could  improve  her  equation  by  rewriting  it  as  follows: 


E outward ,  on  side  1  T  ^ outward ,  on  side  2  Anker . 

The  tricky  thing  here  is  that  “outward”  means  a  different  thing, 
depending  on  which  side  of  the  foil  we’re  on.  On  the  left  side, 
“outward”  means  to  the  left,  while  on  the  right  side,  “outward”  is 
right.  A  positively  charged  piece  of  metal  foil  has  a  field  that  points 
leftward  on  the  left  side,  and  rightward  on  its  right  side,  so  the  two 
contributions  of  2irkcr  are  both  positive,  and  we  get  Anker.  On  the 
other  hand,  suppose  there  is  a  field  created  by  other  charges,  not 
by  the  charged  foil,  that  happens  to  point  to  the  right.  On  the 
right  side,  this  externally  created  field  is  in  the  same  direction  as 
the  foil’s  field,  but  on  the  left  side,  the  it  reduces  the  strength  of  the 
leftward  field  created  by  the  foil.  The  increase  in  one  term  of  the 
equation  balances  the  decrease  in  the  other  term.  This  new  version 
of  the  equation  is  thus  exactly  correct  regardless  of  what  externally 
generated  fields  are  present! 

Her  next  innovation  starts  by  multiplying  the  equation  on  both 
sides  by  the  area,  A ,  of  one  side  of  the  foil: 


(-F outward,  on  side  1  T  E outward ,  on  side  2 )  A  —  Anker  A 


9no  relation  to  the  human  mathematician  of  the  same  name 
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or 


E outward,  on  side  1  4  -E outward ,  on  side  2  A  —  4 7T /. '  . 


where  q  is  the  charge  of  the  foil.  The  reason  for  this  modification  is 
that  she  can  now  make  the  whole  thing  more  attractive  by  defining 
a  new  vector,  the  area  vector  A.  As  shown  in  figure  a,  she  defines 
an  area  vector  for  side  1  which  has  magnitude  A  and  points  outward 
from  side  1,  and  an  area  vector  for  side  2  which  has  the  same  mag¬ 
nitude  and  points  outward  from  that  side,  which  is  in  the  opposite 
direction.  The  dot  product  of  two  vectors,  u  •  v,  can  be  interpreted 
as  Uparaiiei  to  t)|v|,  and  she  can  therefore  rewrite  her  equation  as 

Ei  •  Ai  +  E2  •  A2  =  Airkq. 

The  quantity  on  the  left  side  of  this  equation  is  called  the  flux 
through  the  surface,  written  <F. 

Gauss  now  writes  a  grant  proposal  to  her  favorite  funding  agency, 
the  BSGS  (Blood-Suckers’  Geological  Survey),  and  it  is  quickly  ap¬ 
proved.  Her  audacious  plan  is  to  send  out  exploring  teams  to  chart 
the  electric  fields  of  the  whole  planet  of  Flatcat,  and  thereby  de¬ 
termine  the  total  electric  charge  of  the  planet.  The  fleas’  world 
is  commonly  assumed  to  be  a  flat  disk,  and  its  size  is  known  to 
be  finite,  since  the  sun  passes  behind  it  at  sunset  and  comes  back 
around  on  the  other  side  at  dawn.  The  most  daring  part  of  the  plan 
is  that  it  requires  surveying  not  just  the  known  side  of  the  planet 
but  the  uncharted  Far  Side  as  well.  No  flea  has  ever  actually  gone 
around  the  edge  and  returned  to  tell  the  tale,  but  Gauss  assures 
them  that  they  won’t  fall  off  —  their  negatively  charged  bodies  will 
be  attracted  to  the  disk  no  matter  which  side  they  are  on. 

Of  course  it  is  possible  that  the  electric  charge  of  planet  Flatcat 
is  not  perfectly  uniform,  but  that  isn’t  a  problem.  As  discussed  in 
subsection  10.3.2,  as  long  as  one  is  very  close  to  the  surface,  the  held 
only  depends  on  the  local  charge  density.  In  fact,  a  side-benefit  of 
Gauss’s  program  of  exploration  is  that  any  such  local  irregularities 
will  be  mapped  out.  But  what  the  newspapers  find  exciting  is  the 
idea  that  once  all  the  teams  get  back  from  their  voyages  and  tabulate 
their  data,  the  total  charge  of  the  planet  will  have  been  determined 
for  the  first  time.  Each  surveying  team  is  assigned  to  visit  a  certain 
list  of  republics,  duchies,  city-states,  and  so  on.  They  are  to  record 
each  territory’s  electric  held  vector,  as  well  as  its  area.  Because  the 
electric  held  may  be  nonuniform,  the  final  equation  for  determining 
the  planet’s  electric  charge  will  have  many  terms,  not  just  one  for 
each  side  of  the  planet: 

T  =  ^  E?  •  A j  =  4-7T kqtotai 
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a  /  The  area  vector  is  de¬ 
fined  to  be  perpendicular  to  the 
surface,  in  the  outward  direction. 
Its  magnitude  tells  how  much  the 
area  is. 


b  /  Gauss  contemplates  a 
map  of  the  known  world. 
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c  /  Each  part  of  the  surface 
has  its  own  area  vector.  Note 
the  differences  in  lengths  of  the 
vectors,  corresponding  to  the 
unequal  areas. 


d  /  An  area  vector  can  be 
defined  for  a  sufficiently  small 
part  of  a  curved  surface. 


Gauss  herself  leads  one  of  the  expeditions,  which  heads  due  east, 
toward  the  distant  Tail  Kingdom,  known  only  from  fables  and  the 
occasional  account  from  a  caravan  of  traders.  A  strange  thing  hap¬ 
pens,  however.  Gauss  embarks  from  her  college  town  in  the  wetlands 
of  the  Tongue  Republic,  travels  straight  east,  passes  right  through 
the  Tail  Kingdom,  and  one  day  finds  herself  right  back  at  home,  all 
without  ever  seeing  the  edge  of  the  world!  What  can  have  happened? 
All  at  once  she  realizes  that  the  world  isn’t  flat. 

Now  what?  The  surveying  teams  all  return,  the  data  are  tabu¬ 
lated,  and  the  result  for  the  total  charge  of  Flatcat  is  (1/47 rk)  ^  E j  ■ 
A j  =  37  nC  (units  of  nanocoulombs).  But  the  equation  was  derived 
under  the  assumption  that  Flatcat  was  a  disk.  If  Flatcat  is  really 
round,  then  the  result  may  be  completely  wrong.  Gauss  and  two 
of  her  grad  students  go  to  their  favorite  bar,  and  decide  to  keep 
on  ordering  Bloody  Marys  until  they  either  solve  their  problems  or 
forget  them.  One  student  suggests  that  perhaps  Flatcat  really  is  a 
disk,  but  the  edges  are  rounded.  Maybe  the  surveying  teams  really 
did  flip  over  the  edge  at  some  point,  but  just  didn’t  realize  it.  Under 
this  assumption,  the  original  equation  will  be  approximately  valid, 
and  37  nC  really  is  the  total  charge  of  Flatcat. 

A  second  student,  named  Newton,  suggests  that  they  take  seri¬ 
ously  the  possibility  that  Flatcat  is  a  sphere.  In  this  scenario,  their 
planet’s  surface  is  really  curved,  but  the  surveying  teams  just  didn’t 
notice  the  curvature,  since  they  were  close  to  the  surface,  and  the 
surface  was  so  big  compared  to  them.  They  divided  up  the  surface 
into  a  patchwork,  and  each  patch  was  fairly  small  compared  to  the 
whole  planet,  so  each  patch  was  nearly  flat.  Since  the  patch  is  nearly 
flat,  it  makes  sense  to  define  an  area  vector  that  is  perpendicular  to 
it.  In  general,  this  is  how  we  define  the  direction  of  an  area  vector, 
as  shown  in  figure  d.  This  only  works  if  the  areas  are  small.  For  in¬ 
stance,  there  would  be  no  way  to  define  an  area  vector  for  an  entire 
sphere,  since  “outward”  is  in  more  than  one  direction. 

If  Flatcat  is  a  sphere,  then  the  inside  of  the  sphere  must  be 
vast,  and  there  is  no  way  of  knowing  exactly  how  the  charge  is 
arranged  below  the  surface.  However,  the  survey  teams  all  found 
that  the  electric  field  was  approximately  perpendicular  to  the  surface 
everywhere,  and  that  its  strength  didn’t  change  very  much  from  one 
location  to  another.  The  simplest  explanation  is  that  the  charge 
is  all  concentrated  in  one  small  lump  at  the  center  of  the  sphere. 
They  have  no  way  of  knowing  if  this  is  really  the  case,  but  it’s  a 
hypothesis  that  allows  them  to  see  how  much  their  37  nC  result 
would  change  if  they  assumed  a  different  geometry.  Making  this 
assumption,  Newton  performs  the  following  simple  computation  on 
a  napkin.  The  field  at  the  surface  is  related  to  the  charge  at  the 
center  by 
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El  = 


kQtot.al 


where  r  is  the  radius  of  Flatcat.  The  flux  is  then 


*  =  £eea* 


and  since  the  E j  and  A j  vectors  are  parallel,  the  dot  product  equals 
|Ej||Aj|,  so 

But  the  held  strength  is  always  the  same,  so  we  can  take  it  outside 
the  sum,  giving 


kQtotal 


A 


total 


kqtotal  4^2 


=  iTTkqtotal- 


Not  only  have  all  the  factors  of  r  canceled  out,  but  the  result  is 
the  same  as  for  a  disk! 

Everyone  is  pleasantly  surprised  by  this  apparent  mathematical 
coincidence,  but  is  it  anything  more  than  that?  For  instance,  what 
if  the  charge  wasn’t  concentrated  at  the  center,  but  instead  was 
evenly  distributed  throughout  Flatcat’s  interior  volume?  Newton, 
however,  is  familiar  with  a  result  called  the  shell  theorem  (page  102), 
which  states  that  the  field  of  a  uniformly  charged  sphere  is  the  same 
as  if  all  the  charge  had  been  concentrated  at  its  center.10  We  now 
have  three  different  assumptions  about  the  shape  of  Flatcat  and  the 
arrangement  of  the  charges  inside  it,  and  all  three  lead  to  exactly  the 
same  mathematical  result,  =  4:irkqtotai-  This  is  starting  to  look 
like  more  than  a  coincidence.  In  fact,  there  is  a  general  mathematical 
theorem,  called  Gauss’  theorem,  which  states  the  following: 

For  any  region  of  space,  the  flux  through  the  surface  equals 
Ankqin,  where  qin  is  the  total  charge  in  that  region. 

Don’t  memorize  the  factor  of  47T  in  front  —  you  can  rederive  it 
any  time  you  need  to,  by  considering  a  spherical  surface  centered  on 
a  point  charge. 

10Newton’s  human  namesake  actually  proved  this  for  gravity,  not  electricity, 
but  they’re  both  1/r2  forces,  so  the  proof  works  equally  well  in  both  cases. 
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e/1.  The  flux  due  to  two 
charges  equals  the  sum  of  the 
fluxes  from  each  one.  2.  When 
two  regions  are  joined  together, 
the  flux  through  the  new  region 
equals  the  sum  of  the  fluxes 
through  the  two  parts. 


Note  that  although  region  and  its  surface  had  a  definite  physical 
existence  in  our  story  —  they  are  the  planet  Flatcat  and  the  surface 
of  planet  Flatcat  —  Gauss’  law  is  true  for  any  region  and  surface  we 
choose,  and  in  general,  the  Gaussian  surface  has  no  direct  physical 
significance.  It’s  simply  a  computational  tool. 

Rather  than  proving  Gauss’  theorem  and  then  presenting  some 
examples  and  applications,  it  turns  out  to  be  easier  to  show  some  ex¬ 
amples  that  demonstrate  its  salient  properties.  Having  understood 
these  properties,  the  proof  becomes  quite  simple. 

self-check  K 

Suppose  we  have  a  negative  point  charge,  whose  field  points  inward, 
and  we  pick  a  Gaussian  surface  which  is  a  sphere  centered  on  that 
charge.  How  does  Gauss’  theorem  apply  here?  >  Answer,  p.  966 

10.6.2  Additivity  of  flux 

Figure  e  shows  two  two  different  ways  in  which  flux  is  additive. 
Figure  e/1,  additivity  by  charge,  shows  that  we  can  break  down  a 
charge  distribution  into  two  or  more  parts,  and  the  flux  equals  the 
sum  of  the  fluxes  due  to  the  individual  charges.  This  follows  directly 
from  the  fact  that  the  flux  is  defined  in  terms  of  a  dot  product,  E- A, 
and  the  dot  product  has  the  additive  property  (a+b)  c  =  a-c  +  b-c. 

To  understand  additivity  of  flux  by  region,  e/2,  we  have  to  con¬ 
sider  the  parts  of  the  two  surfaces  that  were  eliminated  when  they 
were  joined  together,  like  knocking  out  a  wall  to  make  two  small 
apartments  into  one  big  one.  Although  the  two  regions  shared  this 
wall  before  it  was  removed,  the  area  vectors  were  opposite:  the  di¬ 
rection  that  is  outward  from  one  region  is  inward  with  respect  to 
the  other.  Thus  if  the  field  on  the  wall  contributes  positive  flux  to 
one  region,  it  contributes  an  equal  amount  of  negative  flux  to  the 
other  region,  and  we  can  therefore  eliminate  the  wall  to  join  the  two 
regions,  without  changing  the  total  flux. 

10.6.3  Zero  flux  from  outside  charges 

A  third  important  property  of  Gauss’  theorem  is  that  it  only 
refers  to  the  charge  inside  the  region  we  choose  to  discuss.  In  other 
words,  it  asserts  that  any  charge  outside  the  region  contributes  zero 
to  the  flux.  This  makes  at  least  some  sense,  because  a  charge  outside 
the  region  will  have  field  vectors  pointing  into  the  surface  on  one 
side,  and  out  of  the  surface  on  the  other.  Certainly  there  should 
be  at  least  partial  cancellation  between  the  negative  (inward)  flux 
on  one  side  and  the  positive  (outward)  flux  on  the  other.  But  why 
should  this  cancellation  be  exact? 

To  see  the  reason  for  this  perfect  cancellation,  we  can  imagine 
space  as  being  built  out  of  tiny  cubes,  and  we  can  think  of  any  charge 
distribution  as  being  composed  of  point  charges.  The  additivity-by- 
charge  property  tells  us  that  any  charge  distribution  can  be  handled 


626 


Chapter  10 


Fields 


by  considering  its  point  charges  individually,  and  the  additivity-by- 
region  property  tells  us  that  if  we  have  a  single  point  charge  outside 
a  big  region,  we  can  break  the  region  down  into  tiny  cubes.  If  we 
can  prove  that  the  flux  through  such  a  tiny  cube  really  does  cancel 
exactly,  then  the  same  must  be  true  for  any  region,  which  we  could 
build  out  of  such  cubes,  and  any  charge  distribution,  which  we  can 
build  out  of  point  charges. 

For  simplicity,  we  will  carry  out  this  calculation  only  in  the  spe¬ 
cial  case  shown  in  figure  f,  where  the  charge  lies  along  one  axis  of 
the  cube.  Let  the  sides  of  the  cube  have  length  26,  so  that  the  area 
of  each  side  is  (2b)2  =  4b2.  The  cube  extends  a  distance  6  above, 
below,  in  front  of,  and  behind  the  horizontal  x  axis.  There  is  a  dis¬ 
tance  d  —  6  from  the  charge  to  the  left  side,  and  d  +  6  to  the  right 
side. 

There  will  be  one  negative  flux,  through  the  left  side,  and  five 
positive  ones.  Of  these  positive  ones,  the  one  through  the  right  side 
is  very  nearly  the  same  in  magnitude  as  the  negative  flux  through 
the  left  side,  but  just  a  little  less  because  the  field  is  weaker  on 
the  right,  due  to  the  greater  distance  from  the  charge.  The  fluxes 
through  the  other  four  sides  are  very  small,  since  the  field  is  nearly 
perpendicular  to  their  area  vectors,  and  the  dot  product  E j  •  A j  is 
zero  if  the  two  vectors  are  perpendicular.  In  the  limit  where  6  is 
very  small,  we  can  approximate  the  flux  by  evaluating  the  field  at 
the  center  of  each  of  the  cube’s  six  sides,  giving 

d*  =  ‘h left  +  4*3? side  +  bright 

—  E/e/t  1 1  A-ieft  |  COS  180  T  4|Es^e|  |  As^e|  COS  0sic[e 
T  |  bright  \  |  A right  |  COS  0  , 


d 

f/The  flux  through  a  tiny  cube 
due  to  a  point  charge. 


and  a  little  trig  gives  cos  Qside  ~  6/d,  so 


4*  |  ^left  1 1  A-left  |  +  4|Esirfe|  |  Asic;e|  ^  +  |  bright  \  |  Aj- ight  \ 
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Using  the  approximation  (1  +  e)~2  ~  1  —  2e  for  small  e,  this  becomes 


<F  = 

=  0. 


/ 4kqb2 


26  46  „  26 

_1“T  +  ¥  +  1_¥ 


Thus  in  the  limit  of  a  very  small  cube,  6  <C  d,  we  have  proved 
that  the  flux  due  to  this  exterior  charge  is  zero.  The  proof  can  be 
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extended  to  the  case  where  the  charge  is  not  along  any  axis  of  the 
cube,11  and  based  on  additivity  we  then  have  a  proof  that  the  flux 
due  to  an  outside  charge  is  always  zero. 

No  charge  on  the  interior  of  a  conductor  example  36 

I  asserted  on  p.  529  that  for  a  perfect  conductor  in  equilibrium,  ex¬ 
cess  charge  is  found  only  at  the  surface,  never  in  the  interior.  This 
can  be  proved  using  Gauss’s  theorem.  Suppose  that  a  charge  q 
existed  at  some  point  in  the  interior,  and  it  was  in  stable  equilib¬ 
rium.  For  concreteness,  let’s  say  q  is  positive.  If  its  equilibrium 
is  to  be  stable,  then  we  need  an  electric  field  everywhere  around 
it  that  points  inward  like  a  pincushion,  so  that  if  the  charge  were 
to  be  perturbed  slightly,  the  field  would  bring  it  back  to  its  equi¬ 
librium  position.  Since  Newton’s  third  law  forbids  objects  from 
making  forces  on  themselves,  this  field  would  have  to  be  the  field 
contributed  by  all  the  other  charges,  not  by  q  itself.  But  this  is  im¬ 
possible,  because  this  kind  of  inward-pointing  pincushion  pattern 
would  have  a  nonzero  (negative)  flux  through  the  pincushion,  but 
Gauss’s  theorem  says  we  can’t  have  flux  from  outside  charges. 


11The  math  gets  messy  for  the  off-axis  case.  This  part  of  the  proof  can  be 
completed  more  easily  and  transparently  using  the  techniques  of  section  10.7, 
and  that  is  exactly  we’ll  do  in  example  38  on  page  636. 
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g  /  Discussion  question  A-D. 


A  One  question  that  might  naturally  occur  to  you  about  Gauss’s  law 
is  what  happens  for  charge  that  is  exactly  on  the  surface  —  should  it  be 
counted  toward  the  enclosed  charge,  or  not?  If  charges  can  be  perfect, 
infinitesimal  points,  then  this  could  be  a  physically  meaningful  question. 
Suppose  we  approach  this  question  by  way  of  a  limit:  start  with  charge  q 
spread  out  over  a  sphere  of  finite  size,  and  then  make  the  size  of  the 
sphere  approach  zero.  The  figure  shows  a  uniformly  charged  sphere 
that’s  exactly  half-way  in  and  half-way  out  of  the  cubical  Gaussian  sur¬ 
face.  What  is  the  flux  through  the  cube,  compared  to  what  it  would  be  if 
the  charge  was  entirely  enclosed?  (There  are  at  least  three  ways  to  find 
this  flux:  by  direct  integration,  by  Gauss’s  law,  or  by  the  additivity  of  flux 
by  region.) 

B  The  dipole  is  completely  enclosed  in  the  cube.  What  does  Gauss’s 
law  say  about  the  flux  through  the  cube?  If  you  imagine  the  dipole’s  field 
pattern,  can  you  verify  that  this  makes  sense? 

C  The  wire  passes  in  through  one  side  of  the  cube  and  out  through 
the  other.  If  the  current  through  the  wire  is  increasing,  then  the  wire  will 
act  like  an  inductor,  and  there  will  be  a  voltage  difference  between  its 
ends.  (The  inductance  will  be  relatively  small,  since  the  wire  isn’t  coiled 
up,  and  the  AV  will  therefore  also  be  fairly  small,  but  still  not  zero.)  The 
AV  implies  the  existence  of  electric  fields,  and  yet  Gauss’s  law  says  the 
flux  must  be  zero,  since  there  is  no  charge  inside  the  cube.  Why  isn’t 
Gauss’s  law  violated? 

D  The  charge  has  been  loitering  near  the  edge  of  the  cube,  but  is 
then  suddenly  hit  with  a  mallet,  causing  it  to  fly  off  toward  the  left  side 
of  the  cube.  We  haven’t  yet  discussed  in  detail  how  disturbances  in  the 
electric  and  magnetic  fields  ripple  outward  through  space,  but  it  turns  out 
that  they  do  so  at  the  speed  of  light.  (In  fact,  that’s  what  light  is:  ripples 
in  the  electric  and  magnetic  fields.)  Because  the  charge  is  closer  to  the 
left  side  of  the  cube,  the  change  in  the  electric  field  occurs  there  before 
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h  /  Completing  the  proof  of 
Gauss’  theorem. 


the  information  reaches  the  right  side.  This  would  seem  certain  to  lead 
to  a  violation  of  Gauss’s  law.  How  can  the  ideas  explored  in  discussion 
question  C  show  the  resolution  to  this  paradox? 

10.6.4  Proof  of  Gauss’ theorem 

With  the  computational  machinery  we’ve  developed,  it  is  now 
simple  to  prove  Gauss’  theorem.  Based  on  additivity  by  charge,  it 
suffices  to  prove  the  law  for  a  point  charge.  We  have  already  proved 
Gauss’  law  for  a  point  charge  in  the  case  where  the  point  charge  is 
outside  the  region.  If  we  can  prove  it  for  the  inside  case,  then  we’re 
all  done. 

If  the  charge  is  inside,  we  reason  as  follows.  First,  we  forget 
about  the  actual  Gaussian  surface  of  interest,  and  instead  construct 
a  spherical  one,  centered  on  the  charge.  For  the  case  of  a  sphere, 
we’ve  already  seen  the  proof  written  on  a  napkin  by  the  flea  named 
Newton  (page  624).  Now  wherever  the  actual  surface  sticks  out 
beyond  the  sphere,  we  glue  appropriately  shaped  pieces  onto  the 
sphere.  In  the  example  shown  in  figure  h,  we  have  to  add  two 
Mickey  Mouse  ears.  Since  these  added  pieces  do  not  contain  the 
point  charge,  the  flux  through  them  is  zero,  and  additivity  of  flux 
by  region  therefore  tells  us  that  the  total  flux  is  not  changed  when 
we  make  this  alteration.  Likewise,  we  need  to  chisel  out  any  regions 
where  the  sphere  sticks  out  beyond  the  actual  surface.  Again,  there 
is  no  change  in  flux,  since  the  region  being  altered  doesn’t  contain 
the  point  charge.  This  proves  that  the  flux  through  the  Gaussian 
surface  of  interest  is  the  same  as  the  flux  through  the  sphere,  and 
since  we’ve  already  proved  that  that  flux  equals  4 irkqin,  our  proof 
of  Gauss’  theorem  is  complete. 

Discussion  Questions 

A  A  critical  part  of  the  proof  of  Gauss’  theorem  was  the  proof  that 
a  tiny  cube  has  zero  flux  through  it  due  to  an  external  charge.  Discuss 
qualitatively  why  this  proof  would  fail  if  Coulomb’s  law  was  a  1/ror  1/r3 
law. 


10.6.5  Gauss’  law  as  a  fundamental  law  of  physics 

Note  that  the  proof  of  Gauss’  theorem  depended  on  the  compu¬ 
tation  on  the  napkin  discussed  on  page  10.6.1.  The  crucial  point  in 
this  computation  was  that  the  electric  field  of  a  point  charge  falls 
off  like  1/r2,  and  since  the  area  of  a  sphere  is  proportional  to  r2, 
the  result  is  independent  of  r.  The  1/r2  variation  of  the  field  also 
came  into  play  on  page  627  in  the  proof  that  the  flux  due  to  an  out¬ 
side  charge  is  zero.  In  other  words,  if  we  discover  some  other  force 
of  nature  which  is  proportional  to  1/r3  or  r,  then  Gauss’  theorem 
will  not  apply  to  that  force.  Gauss’  theorem  is  not  true  for  nuclear 
forces,  which  fall  off  exponentially  with  distance.  However,  this  is 
the  only  assumption  we  had  to  make  about  the  nature  of  the  field. 
Since  gravity,  for  instance,  also  has  fields  that  fall  off  as  1/r2,  Gauss’ 
theorem  is  equally  valid  for  gravity  —  we  just  have  to  replace  mass 
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with  charge,  change  the  Coulomb  constant  k  to  the  gravitational 
constant  G,  and  insert  a  minus  sign  because  the  gravitational  fields 
around  a  (positive)  mass  point  inward. 

Gauss’  theorem  can  only  be  proved  if  we  assume  a  1/r2  field, 
and  the  converse  is  also  true:  any  field  that  satisfies  Gauss’  theo¬ 
rem  must  be  a  1/r2  field.  Thus  although  we  previously  thought  of 
Coulomb’s  law  as  the  fundamental  law  of  nature  describing  electric 
forces,  it  is  equally  valid  to  think  of  Gauss’  theorem  as  the  basic  law 
of  nature  for  electricity.  From  this  point  of  view,  Gauss’  theorem  is 
not  a  mathematical  fact  but  an  experimentally  testable  statement 
about  nature,  so  we’ll  refer  to  it  as  Gauss’  law ,  just  as  we  speak  of 
Coulomb’s  law  or  Newton’s  law  of  gravity. 

If  Gauss’  law  is  equivalent  to  Coulomb’s  law,  why  not  just  use 
Coulomb’s  law?  First,  there  are  some  cases  where  calculating  a 
field  is  easy  with  Gauss’  law,  and  hard  with  Coulomb’s  law.  More 
importantly,  Gauss’  law  and  Coulomb’s  law  are  only  mathematically 
equivalent  under  the  assumption  that  all  our  charges  are  standing 
still,  and  all  our  fields  are  constant  over  time,  i.e.,  in  the  study 
of  electrostatics,  as  opposed  to  electrodynamics.  As  we  broaden 
our  scope  to  study  generators,  inductors,  transformers,  and  radio 
antennas,  we  will  encounter  cases  where  Gauss’  law  is  valid,  but 
Coulomb’s  law  is  not. 


10.6.6  Applications 

Often  we  encounter  situations  where  we  have  a  static  charge 
distribution,  and  we  wish  to  determine  the  field.  Although  super¬ 
position  is  a  generic  strategy  for  solving  this  type  of  problem,  if  the 
charge  distribution  is  symmetric  in  some  way,  then  Gauss’  law  is 
often  a  far  easier  way  to  carry  out  the  computation. 

Field  of  a  long  line  of  charge 

Consider  the  field  of  an  infinitely  long  line  of  charge,  holding  a 
uniform  charge  per  unit  length  A.  Computing  this  field  by  brute- 
force  superposition  was  fairly  laborious  (examples  11  on  page  579 
and  14  on  page  586).  With  Gauss’  law  it  becomes  a  very  simple 
calculation. 

The  problem  has  two  types  of  symmetry.  The  line  of  charge, 
and  therefore  the  resulting  field  pattern,  look  the  same  if  we  rotate 
them  about  the  line.  The  second  symmetry  occurs  because  the  line 
is  infinite:  if  we  slide  the  line  along  its  own  length,  nothing  changes. 
This  sliding  symmetry,  known  as  a  translation  symmetry,  tells  us 
that  the  field  must  point  directly  away  from  the  line  at  any  given 
point. 

Based  on  these  symmetries,  we  choose  the  Gaussian  surface 
shown  in  figure  i.  If  we  want  to  know  the  field  at  a  distance  R 
from  the  line,  then  we  choose  this  surface  to  have  a  radius  R ,  as 


i  /  Applying  Gauss’  law  to  an 
infinite  line  of  charge. 
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shown  in  the  figure.  The  length,  L,  of  the  surface  is  irrelevant. 

The  field  is  parallel  to  the  surface  on  the  end  caps,  and  therefore 
perpendicular  to  the  end  caps’  area  vectors,  so  there  is  no  contribu¬ 
tion  to  the  flux.  On  the  long,  thin  strips  that  make  up  the  rest  of  the 
surface,  the  field  is  perpendicular  to  the  surface,  and  therefore  paral¬ 
lel  to  the  area  vector  of  each  strip,  so  that  the  dot  product  occurring 
in  the  definition  of  the  flux  is  E j  •  A j  =  |  E  j  1 1 A  j  1 1  cos  0°  =  |Ej||Aj|. 
Gauss’  law  gives 

47T  kqin  =  Y  Ej  •  Aj 

AnkXL  =  Emm 

The  magnitude  of  the  field  is  the  same  on  every  strip,  so  we  can 
take  it  outside  the  sum. 

AnkXL  =  |E|  J]|A_j| 

In  the  limit  where  the  strips  are  infinitely  narrow,  the  surface  be¬ 
comes  a  cylinder,  with  (area)=(circumference)(length)=27ri?L. 

AirkXL  =  El  x  2-kRL 


j/ Applying  Gauss’  law  to  an 
infinite  charged  surface. 


Field  near  a  surface  charge 

As  claimed  earlier,  the  result  E  =  2irka  for  the  field  near  a 
charged  surface  is  a  special  case  of  Gauss’  law.  We  choose  a  Gaussian 
surface  of  the  shape  shown  in  figure  j ,  known  as  a  Gaussian  pillbox. 
The  exact  shape  of  the  flat  end  caps  is  unimportant. 

The  symmetry  of  the  charge  distribution  tells  us  that  the  field 
points  directly  away  from  the  surface,  and  is  equally  strong  on  both 
sides  of  the  surface.  This  means  that  the  end  caps  contribute  equally 
to  the  flux,  and  the  curved  sides  have  zero  flux  through  them.  If  the 
area  of  each  end  cap  is  A,  then 

47 Tkqin  =  Ei  •  Ai  +  E2  •  A2, 

where  the  subscripts  1  and  2  refer  to  the  two  end  caps.  We  have 
A2  =  -Ai,  so 


Airkqin  =  Ei  •  Ai  -  E2  •  Ai 
47t kqin  =  (Ei  -  E2)  •  Ai, 


and  by  symmetry  the  magnitudes  of  the  two  fields  are  equal,  so 


2|E|A  =  AirkcrA 
El  =  2TTk(T 
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The  symmetry  between  the  two  sides  could  be  broken  by  the 
existence  of  other  charges  nearby,  whose  fields  would  add  onto  the 
field  of  the  surface  itself.  Even  then,  Gauss’s  law  still  guarantees 

4vr kqin  =  (Ei  -  E2)  •  Ai, 


or 


|Ej_;i  —  E_li2|  =  47rfe<r, 

where  the  subscript  _L  indicates  the  component  of  the  field  parallel 
to  the  surface  (i.e.,  parallel  to  the  area  vectors).  In  other  words, 
the  electric  field  changes  rliscontinuously  when  we  pass  through  a 
charged  surface;  the  discontinuity  occurs  in  the  component  of  the 
field  perpendicular  to  the  surface,  and  the  amount  of  discontinuous 
change  is  4irka.  This  is  a  completely  general  statement  that  is  true 
near  any  charged  surface,  regardless  of  the  existence  of  other  charges 
nearby. 
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10.7  Gauss’  Law  In  Differential  Form 


a/ A  tiny  cubical  Gaussian 
surface. 


Gauss’  law  is  a  bit  spooky.  It  relates  the  field  on  the  Gaussian 
surface  to  the  charges  inside  the  surface.  What  if  the  charges  have 
been  moving  around,  and  the  field  at  the  surface  right  now  is  the  one 
that  was  created  by  the  charges  in  their  previous  locations?  Gauss’ 
law  —  unlike  Coulomb’s  law  —  still  works  in  cases  like  these,  but 
it’s  far  from  obvious  how  the  flux  and  the  charges  can  still  stay  in 
agreement  if  the  charges  have  been  moving  around. 

For  this  reason,  it  would  be  more  physically  attractive  to  restate 
Gauss’  law  in  a  different  form,  so  that  it  related  the  behavior  of 
the  field  at  one  point  to  the  charges  that  were  actually  present  at 
that  point.  This  is  essentially  what  we  were  doing  in  the  fable  of 
the  flea  named  Gauss:  the  fleas’  plan  for  surveying  their  planet  was 
essentially  one  of  dividing  up  the  surface  of  their  planet  (which  they 
believed  was  flat)  into  a  patchwork,  and  then  constructing  small 
a  Gaussian  pillbox  around  each  small  patch.  The  equation  E±  = 
2irkcr  then  related  a  particular  property  of  the  local  electric  field  to 
the  local  charge  density. 

In  general,  charge  distributions  need  not  be  confined  to  a  flat 
surface  —  life  is  three-dimensional  —  but  the  general  approach  of 
defining  very  small  Gaussian  surfaces  is  still  a  good  one.  Our  strat¬ 
egy  is  to  divide  up  space  into  tiny  cubes,  like  the  one  on  page  626. 
Each  such  cube  constitutes  a  Gaussian  surface,  which  may  contain 
some  charge.  Again  we  approximate  the  field  using  its  six  values  at 
the  center  of  each  of  the  six  sides.  Let  the  cube  extend  from  x  to 
x  +  dx,  from  y  to  y  +  d y,  and  from  y  to  y  +  d y. 

The  sides  at  x  and  x  +  dx  have  area  vectors  —  d y  dzx  and  d y  dzx, 
respectively.  The  flux  through  the  side  at  x  is  —Ex(x)dydz,  and 
the  flux  through  the  opposite  side,  at  x  +  dx  is  Ex(x  +  dx)  dydz. 
The  sum  of  these  is  (Ex(x  +  dx)  —  Ex(x))dydz,  and  if  the  field 
was  uniform,  the  flux  through  these  two  opposite  sides  would  be 
zero.  It  will  only  be  zero  if  the  field’s  x  component  changes  as  a 
function  of  x.  The  difference  Ex(x  +  dx)  —  Ex(x)  can  be  rewritten 
as  d Ex  =  (dEx)  /  (dx)  dx,  so  the  contribution  to  the  flux  from  these 
two  sides  of  the  cube  ends  up  being 


d  Ex 
dx 


dx  d  y  d  z. 


Doing  the  same  for  the  other  sides,  we  end  up  with  a  total  flux 


d<L 


fd  Ex 
\  dx 
dEx 
dx 


d  Ey 
d  y 
d  Ey 
d  y 


+ 

+ 


dx  dy  dz 
dv, 
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where  dw  is  the  volume  of  the  cube.  In  evaluating  each  of  these 
three  derivatives,  we  are  going  to  treat  the  other  two  variables  as 
constants,  to  emphasize  this  we  use  the  partial  derivative  notation 
8  introduced  in  chapter  3, 


d<F  = 


dEr  fJEv  8EZ  .  , 

+  +  -M  1  dv. 


dx  dy 


dz 


Using  Gauss’  law, 


47r  kqin  = 


dEr  d  E„  dE2 


+ 


dx  dy 


+ 


dz 


dv, 


and  we  introduce  the  notation  p  (Greek  letter  rho)  for  the  charge 
per  unit  volume,  giving 


Airkp 


dEx  dEy  dEz 
dx  dy  dz 


The  quantity  on  the  right  is  called  the  divergence  of  the  electric 
field,  written  div  E.  Using  this  notation,  we  have 


div  E  =  47 xkp. 

This  equation  has  all  the  same  physical  implications  as  Gauss’  law. 
After  all,  we  proved  Gauss’  law  by  breaking  down  space  into  little 
cubes  like  this.  We  therefore  refer  to  it  as  the  differential  form  of 
Gauss’  law,  as  opposed  to  4?  =  47T kqin,  which  is  called  the  integral 
form. 

Figure  b  shows  an  intuitive  way  of  visualizing  the  meaning  of 
the  divergence.  The  meter  consists  of  some  electrically  charged  balls 
connected  by  springs.  If  the  divergence  is  positive,  then  the  whole 
cluster  will  expand,  and  it  will  contract  its  volume  if  it  is  placed  at 
a  point  where  the  held  has  divE  <  0.  What  if  the  held  is  constant? 
We  know  based  on  the  definition  of  the  divergence  that  we  should 
have  divE  =  0  in  this  case,  and  the  meter  does  give  the  right  result: 
all  the  balls  will  feel  a  force  in  the  same  direction,  but  they  will 
neither  expand  nor  contract. 

Divergence  of  a  sine  wave  example  37 

>  Figure  c  shows  an  electric  field  that  varies  as  a  sine  wave.  This 
is  in  fact  what  you’d  see  in  a  light  wave:  light  is  a  wave  pattern 
made  of  electric  and  magnetic  fields.  (The  magnetic  field  would 
look  similar,  but  would  be  in  a  plane  perpendicular  to  the  page.) 
What  is  the  divergence  of  such  a  field,  and  what  is  the  physical 
significance  of  the  result? 

o  Intuitively,  we  can  see  that  no  matter  where  we  put  the  div-meter 
in  this  field,  it  will  neither  expand  nor  contract.  For  instance,  if  we 
put  it  at  the  center  of  the  figure,  it  will  start  spinning,  but  that’s  it. 


b/A  meter  for  measuring 
divE. 
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Mathematically,  let  the  x  axis  be  to  the  right  and  let  y  be  up.  The 
field  is  of  the  form 

E  =  (sinKx)  y, 

where  the  constant  K  is  not  to  be  confused  with  Coulomb’s  con¬ 
stant.  Since  the  field  has  only  a  y  component,  the  only  term  in 
the  divergence  we  need  to  evaluate  is 

E 

dy  ’ 

but  this  vanishes,  because  Ey  depends  only  on  x,  not  y:  we  treat 
y  as  a  constant  when  evaluating  the  partial  derivative  dEy/dy, 
and  the  derivative  of  an  expression  containing  only  constants 
must  be  zero. 

Physically  this  is  a  very  important  result:  it  tells  us  that  a  light 
wave  can  exist  without  any  charges  along  the  way  to  “keep  it  go¬ 
ing.”  In  other  words,  light  can  travel  through  a  vacuum,  a  region 
with  no  particles  in  it.  If  this  wasn’t  true,  we’d  be  dead,  because 
the  sun’s  light  wouldn’t  be  able  to  get  to  us  through  millions  of 
kilometers  of  empty  space! 

Electric  field  of  a  point  charge  example  38 

The  case  of  a  point  charge  is  tricky,  because  the  field  behaves 
badly  right  on  top  of  the  charge,  blowing  up  and  becoming  dis¬ 
continuous.  At  this  point,  we  cannot  use  the  component  form  of 
the  divergence,  since  none  of  the  derivatives  are  well  defined. 
However,  a  little  visualization  using  the  original  definition  of  the 
divergence  will  quickly  convince  us  that  div  E  is  infinite  here,  and 
that  makes  sense,  because  the  density  of  charge  has  to  be  in¬ 
finite  at  a  point  where  there  is  a  zero-size  point  of  charge  (finite 
charge  in  zero  volume). 

At  all  other  points,  we  have 


where  r  =  r/r  =  (xx  +  yy  +  zz)/r  is  the  unit  vector  pointing  radially 
away  from  the  charge.  The  field  can  therefore  be  written  as 


_  kq(xx  +  yy  +  zz) 

~  (x2  +  y2  +  z2)3/2  ' 

The  three  terms  in  the  divergence  are  all  similar,  e.g., 

dE*  =  koJL  [  x 

dx  dx  [(x2+y2  +  z2)3/2 

1  3  2x2 

(x2  +  y2  +  z2)3/2  2  (x2  +  y2  +  z2)5/2 

=  kq  (7“3  -  3 x2r~5^  . 
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Straightforward  algebra  shows  that  adding  in  the  other  two  terms 
results  in  zero,  which  makes  sense,  because  there  is  no  charge 
except  at  the  origin. 

Gauss’  law  in  differential  form  lends  itself  most  easily  to  finding 
the  charge  density  when  we  are  give  the  field.  What  if  we  want 
to  find  the  field  given  the  charge  density?  As  demonstrated  in  the 
following  example,  one  technique  that  often  works  is  to  guess  the 
general  form  of  the  field  based  on  experience  or  physical  intuition, 
and  then  try  to  use  Gauss’  law  to  find  what  specific  version  of  that 
general  form  will  be  a  solution. 

The  field  inside  a  uniform  sphere  of  charge  example  39 

>  Find  the  field  inside  a  uniform  sphere  of  charge  whose  charge 
density  is  p.  (This  is  very  much  like  finding  the  gravitational  field 
at  some  depth  below  the  surface  of  the  earth.) 

>  By  symmetry  we  know  that  the  field  must  be  purely  radial  (in 
and  out).  We  guess  that  the  solution  might  be  of  the  form 

E  =  brp  r, 

where  r  is  the  distance  from  the  center,  and  b  and  p  are  con¬ 
stants.  A  negative  value  of  p  would  indicate  a  field  that  was 
strongest  at  the  center,  while  a  positive  p  would  give  zero  field 
at  the  center  and  stronger  fields  farther  out.  Physically,  we  know 
by  symmetry  that  the  field  is  zero  at  the  center,  so  we  expect  p  to 
be  positive. 

As  in  the  example  38,  we  rewrite  r  as  r/r,  and  to  simplify  the 
writing  we  define  n  =  p  -  1 ,  so 

E  =  brnr. 

Gauss’  law  in  differential  form  is 


div  E  =  47t/cp, 


so  we  want  a  field  whose  divergence  is  constant.  For  a  field  of 
the  form  we  guessed,  the  divergence  has  terms  in  it  like 


dEx 

dx 


d_ 

dx 


(. brnx ) 


=  b 


dr  n\ 
d~xX  +  r  ) 


The  partial  derivative  dr/dx  is  easily  calculated  to  be  x/r,  so 
^  =  i,(nr"-V  +  r”) 

Adding  in  similar  expressions  for  the  other  two  terms  in  the  diver¬ 
gence,  and  making  use  of  x2  +  y2  +  z2  =  r2,  we  have 

divE  =  b{n  +  3)rn. 
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This  can  indeed  be  constant,  but  only  if  n  is  0  or  -3,  i.e.,  p  is  1 
or  -2.  The  second  solution  gives  a  divergence  which  is  constant 
and  zero :  this  is  the  solution  for  the  outside  of  the  sphere!  The 
first  solution,  which  has  the  field  directly  proportional  to  r,  must 
be  the  one  that  applies  to  the  inside  of  the  sphere,  which  is  what 
we  care  about  right  now.  Equating  the  coefficient  in  front  to  the 
one  in  Gauss’  law,  the  field  is 


The  field  is  zero  at  the  center,  and  gets  stronger  and  stronger  as 
we  approach  the  surface. 

Discussion  Questions 

A  As  suggested  by  the  figure,  discuss  the  results  you  would  get  by 
inserting  the  div-meter  at  various  locations  in  the  sine-wave  field. 

This  chapter  is  summarized  on  page  1000.  Notation  and  terminology 
are  tabulated  on  pages  984-985. 


d  /  Discussion  question  A. 
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The  symbols  C,  - ,  etc.  are  explained  on  page  649. 

1  The  gap  between  the  electrodes  in  an  automobile  engine’s 

spark  plug  is  0.060  cm.  To  produce  an  electric  spark  in  a  gasoline- 
air  mixture,  an  electric  field  of  3.0  x  106  V/m  must  be  achieved. 
On  starting  a  car,  what  minimum  voltage  must  be  supplied  by  the 
ignition  circuit?  Assume  the  field  is  uniform.  V 

(b)  The  small  size  of  the  gap  between  the  electrodes  is  inconvenient 
because  it  can  get  blocked  easily,  and  special  tools  are  needed  to 
measure  it.  Why  don’t  they  design  spark  plugs  with  a  wider  gap? 

■ 

2  (a)  As  suggested  in  example  10  on  page  578,  use  approxi¬ 
mations  to  show  that  the  expression  given  for  the  electric  field  ap¬ 
proaches  kQ/d2  for  large  d. 

(b)  Do  the  same  for  the  result  of  example  13  on  page  582. 

3  Astronomers  believe  that  the  mass  distribution  (mass  per 

unit  volume)  of  some  galaxies  may  be  approximated,  in  spherical 
coordinates,  by  p  =  cie~br ,  for  0  <  r  <  oo,  where  p  is  the  density. 
Find  the  total  mass.  ^ 

4  (a)  At  time  t  =  0,  a  positively  charged  particle  is  placed, 

at  rest,  in  a  vacuum,  in  which  there  is  a  uniform  electric  field  of 
magnitude  E.  Write  an  equation  giving  the  particle’s  speed,  v,  in 
terms  of  t,  E,  and  its  mass  and  charge  m  and  q.  v 

(b)  If  this  is  done  with  two  different  objects  and  they  are  observed 
to  have  the  same  motion,  what  can  you  conclude  about  their  masses 
and  charges?  (For  instance,  when  radioactivity  was  discovered,  it 
was  found  that  one  form  of  it  had  the  same  motion  as  an  electron 
in  this  type  of  experiment.) 

5  Show  that  the  alternative  definition  of  the  magnitude  of  the 
electric  field,  |E|  =  r/Dt  sin  9,  has  units  that  make  sense. 

6  Redo  the  calculation  of  example  5  on  page  572  using  a  different 

origin  for  the  coordinate  system,  and  show  that  you  get  the  same 
result.  - 

7  The  definition  of  the  dipole  moment,  D  =  )C  g,rj,  involves  the 
vector  Yi  stretching  from  the  origin  of  our  coordinate  system  out  to 
the  charge  g^.  There  are  clearly  cases  where  this  causes  the  dipole 
moment  to  be  dependent  on  the  choice  of  coordinate  system.  For 
instance,  if  there  is  only  one  charge,  then  we  could  make  the  dipole 
moment  equal  zero  if  we  chose  the  origin  to  be  right  on  top  of  the 
charge,  or  nonzero  if  we  put  the  origin  somewhere  else. 

(a)  Make  up  a  numerical  example  with  two  charges  of  equal  mag¬ 
nitude  and  opposite  sign.  Compute  the  dipole  moment  using  two 
different  coordinate  systems  that  are  oriented  the  same  way,  but 
differ  in  the  choice  of  origin.  Comment  on  the  result. 
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(b)  Generalize  the  result  of  part  a  to  any  pair  of  charges  with  equal 
magnitude  and  opposite  sign.  This  is  supposed  to  be  a  proof  for  any 
arrangement  of  the  two  charges,  so  don’t  assume  any  numbers. 

(c)  Generalize  further,  to  n  charges.  I 

8  Compare  the  two  dipole  moments.  - 

9  Find  an  arrangement  of  charges  that  has  zero  total  charge  and 
zero  dipole  moment,  but  that  will  make  nonvanishing  electric  fields. 

■ 

10  As  suggested  in  example  12  on  page  581,  show  that  you  can 
get  the  same  result  for  the  on-axis  field  by  differentiating  the  voltage 


Problem  8. 


qi  • 

Problem  1 1 . 


Problem  13. 


11  Three  charges  are  arranged  on  a  square  as  shown.  All  three 

charges  are  positive.  What  value  of  <72/(71  will  produce  zero  electric 
field  at  the  center  of  the  square?  V  ^ 

12  This  is  a  one-dimensional  problem,  with  everything  confined 
to  the  x  axis.  Dipole  A  consists  of  a  —1.000  C  charge  at  x  =  0.000 
m  and  a  1.000  C  charge  at  x  =  1.000  m.  Dipole  B  has  a  —2.000  C 
charge  at  x  =  0.000  m  and  a  2.000  C  charge  at  x  =  0.500  m. 

(a)  Compare  the  two  dipole  moments. 

(b)  Calculate  the  field  created  by  dipole  A  at  x  =  10.000  m,  and 

compare  with  the  held  dipole  B  would  make.  Comment  on  the 
result.  V 

13  In  our  by-now-familiar  neuron,  the  voltage  difference  be¬ 

tween  the  inner  and  outer  surfaces  of  the  cell  membrane  is  about 
Vout  ~  Vin  =  —70  mV  in  the  resting  state,  and  the  thickness  of  the 
membrane  is  about  6.0  nrn  (i.e.,  only  about  a  hundred  atoms  thick). 
What  is  the  electric  held  inside  the  membrane?  V  ^ 

14  A  proton  is  in  a  region  in  which  the  electric  held  is  given  by 

E  =  a  +  bx3.  If  the  proton  starts  at  rest  at  x\  =  0,  hnd  its  speed, 
v,  when  it  reaches  position  X2 ■  Give  your  answer  in  terms  of  a,  b, 
X2 ,  and  e  and  m,  the  charge  and  mass  of  the  proton.  V  - 

15  (a)  Given  that  the  on-axis  held  of  a  dipole  at  large  distances  is 
proportional  to  D/r 3,  show  that  its  voltage  varies  as  D/r 2 .  (Ignore 
positive  and  negative  signs  and  numerical  constants  of  proportion¬ 
ality.) 

(b)  Write  down  an  exact  expression  for  the  voltage  of  a  two-charge 
dipole  at  an  on-axis  point,  without  assuming  that  the  distance  is 
large  compared  to  the  size  of  the  dipole.  Your  expression  will  have 
to  contain  the  actual  charges  and  size  of  the  dipole,  not  just  its  dipole 
moment.  Now  use  approximations  to  show  that,  at  large  distances, 
this  is  consistent  with  your  answer  to  part  a.  i>  Hint,  p.  960 

16  A  hydrogen  atom  is  electrically  neutral,  so  at  large  distances, 
we  expect  that  it  will  create  essentially  zero  electric  held.  This  is 
not  true,  however,  near  the  atom  or  inside  it.  Very  close  to  the 
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proton,  for  example,  the  field  is  very  strong.  To  see  this,  think  of 
the  electron  as  a  spherically  symmetric  cloud  that  surrounds  the 
proton,  getting  thinner  and  thinner  as  we  get  farther  away  from  the 
proton.  (Quantum  mechanics  tells  us  that  this  is  a  more  correct 
picture  than  trying  to  imagine  the  electron  orbiting  the  proton.) 
Near  the  center  of  the  atom,  the  electron  cloud’s  field  cancels  out 
by  symmetry,  but  the  proton’s  held  is  strong,  so  the  total  held  is 
very  strong.  The  voltage  in  and  around  the  hydrogen  atom  can  be 
approximated  using  an  expression  of  the  form  V  =  r~1e~r .  (The 
units  come  out  wrong,  because  I’ve  left  out  some  constants.)  Find 
the  electric  held  corresponding  to  this  voltage,  and  comment  on  its 
behavior  at  very  large  and  very  small  r.  >  Solution,  p.  980  - 

17  A  carbon  dioxide  molecule  is  structured  like  O-C-O,  with 

all  three  atoms  along  a  line.  The  oxygen  atoms  grab  a  little  bit  of 
extra  negative  charge,  leaving  the  carbon  positive.  The  molecule’s 
symmetry,  however,  means  that  it  has  no  overall  dipole  moment, 
unlike  a  V-shaped  water  molecule,  for  instance.  Whereas  the  voltage 
of  a  dipole  of  magnitude  D  is  proportional  to  D/r2  (see  problem  15), 
it  turns  out  that  the  voltage  of  a  carbon  dioxide  molecule  at  a  distant 
point  along  the  molecule’s  axis  equals  6/r3,  where  r  is  the  distance 
from  the  molecule  and  6  is  a  constant  (cf.  problem  9).  What  would 
be  the  electric  held  of  a  carbon  dioxide  molecule  at  a  point  on  the 
molecule’s  axis,  at  a  distance  r  from  the  molecule?  v 

18  A  hydrogen  atom  in  a  particular  state  has  the  charge  density 
(charge  per  unit  volume)  of  the  electron  cloud  given  by  p  =  ae~brz 2, 
where  r  is  the  distance  from  the  proton,  and  z  is  the  coordinate  mea¬ 
sured  along  the  z  axis.  Given  that  the  total  charge  of  the  electron 
cloud  must  be  —  e,  find  a  in  terms  of  the  other  variables. 

19  A  dipole  has  a  midplane,  i.e.,  the  plane  that  cuts  through  the 

dipole’s  center,  and  is  perpendicular  to  the  dipole’s  axis.  Consider 
a  two-charge  dipole  made  of  point  charges  ±q  located  at  z  =  ±1/2. 
Use  approximations  to  find  the  field  at  a  distant  point  in  the  mid¬ 
plane,  and  show  that  its  magnitude  comes  out  to  be  k ;D/R3  (half 
what  it  would  be  at  a  point  on  the  axis  lying  an  equal  distance  from 
the  dipole).  ■ 

20  The  figure  shows  a  vacuum  chamber  surrounded  by  four  metal 
electrodes  shaped  like  hyperbolas.  (Yes,  physicists  do  sometimes  ask 
their  university  machine  shops  for  things  machined  in  mathematical 
shapes  like  this.  They  have  to  be  made  on  computer-controlled 
mills.)  We  assume  that  the  electrodes  extend  far  into  and  out  of 
the  page  along  the  unseen  z  axis,  so  that  by  symmetry,  the  electric 
fields  are  the  same  for  all  z.  The  problem  is  therefore  effectively 
two-dimensional.  Two  of  the  electrodes  are  at  voltage  +V0,  and 
the  other  two  at  — VQ ,  as  shown.  The  equations  of  the  hyperbolic 
surfaces  are  \xy\  =  b2,  where  b  is  a  constant.  (We  can  interpret  b  as 
giving  the  locations  x  =  ±b,  y  =  ±b  of  the  four  points  on  the  surfaces 
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that  are  closest  to  the  central  axis.)  There  is  no  obvious,  pedestrian 
way  to  determine  the  field  or  voltage  in  the  central  vacuum  region, 
but  there’s  a  trick  that  works:  with  a  little  mathematical  insight, 
we  see  that  the  voltage  V  =  V0b~2xy  is  consistent  with  all  the 
given  information.  (Mathematicians  could  prove  that  this  solution 
was  unique,  but  a  physicist  knows  it  on  physical  grounds:  if  there 
were  two  different  solutions,  there  would  be  no  physical  way  for 
the  system  to  decide  which  one  to  do!)  (a)  Use  the  techniques  of 
subsection  10.2.2  to  find  the  field  in  the  vacuum  region,  and  (b) 
sketch  the  field  as  a  “sea  of  arrows.”  V 

21  (a)  A  certain  region  of  three-dimensional  space  has  a  voltage 

that  varies  as  V  =  br2,  where  r  is  the  distance  from  the  origin.  Use 
the  techniques  of  subsection  10.2.2  to  find  the  field.  V 

(b)  Write  down  another  voltage  that  gives  exactly  the  same  held. 

■ 


Problem  23. 


22  (a)  Example  11  on  page  579  gives  the  held  of  a  charged  rod  in 

its  midplane.  Starting  from  this  result,  take  the  limit  as  the  length 
of  the  rod  approaches  inhnity.  Note  that  A  is  not  changing,  so  as  L 
gets  bigger,  the  total  charge  Q  increases.  >  Answer,  p.  970 

(b)  In  the  text,  I  have  shown  (by  several  different  methods)  that  the 
held  of  an  infinite,  uniformly  charged  plane  is  2irkcr.  Now  you’re 
going  to  rederive  the  same  result  by  a  different  method.  Suppose 
that  it  is  the  x  —  y  plane  that  is  charged,  and  we  want  to  hnd  the 
held  at  the  point  (0,0,2).  (Since  the  plane  is  infinite,  there  is  no 
loss  of  generality  in  assuming  x  =  0  and  y  =  0.)  Imagine  that  we 
slice  the  plane  into  an  infinite  number  of  straight  strips  parallel  to 
the  y  axis.  Each  strip  has  inhnitesimal  width  da:,  and  extends  from 
x  to  x  +  da:.  The  contribution  any  one  of  these  strips  to  the  held 
at  our  point  has  a  magnitude  which  can  be  found  from  part  a.  By 
vector  addition,  prove  the  desired  result  for  the  held  of  the  plane  of 
charge.  - 

23  Consider  the  electric  held  created  by  a  uniformly  charged 
cylindrical  surface  that  extends  to  inhnity  in  one  direction. 

(a)  Show  that  the  held  at  the  center  of  the  cylinder’s  mouth  is  2nka, 
which  happens  to  be  the  same  as  the  held  of  an  infinite  flat  sheet  of 
charge! 

(b)  This  expression  is  independent  of  the  radius  of  the  cylinder. 
Explain  why  this  should  be  so.  For  example,  what  would  happen  if 
you  doubled  the  cylinder’s  radius? 

24  In  an  electrical  storm,  the  cloud  and  the  ground  act  like  a 
parallel-plate  capacitor,  which  typically  charges  up  due  to  frictional 
electricity  in  collisions  of  ice  particles  in  the  cold  upper  atmosphere. 
Lightning  occurs  when  the  magnitude  of  the  electric  held  builds  up 
to  a  critical  value,  Ec,  at  which  air  is  ionized. 

(a)  Treat  the  cloud  as  a  hat  square  with  sides  of  length  L.  If  it  is  at 
a  height  h  above  the  ground,  hnd  the  amount  of  energy  released  in 
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the  lightning  strike.  V 

(b)  Based  on  your  answer  from  part  a,  which  is  more  dangerous,  a 
lightning  strike  from  a  high-altitude  cloud  or  a  low-altitude  one? 

(c)  Make  an  order-of-magnitude  estimate  of  the  energy  released  by 
a  typical  lightning  bolt,  assuming  reasonable  values  for  its  size  and 
altitude.  Ec  is  about  106  V/m. 

25  (a)  Show  that  the  energy  in  the  electric  field  of  a  point  charge 

is  infinite!  Does  the  integral  diverge  at  small  distances,  at  large  dis¬ 
tances,  or  both?  >  Hint,  p.  960 

(b)  Now  calculate  the  energy  in  the  electric  field  of  a  uniformly 
charged  sphere  with  radius  b.  Based  on  the  shell  theorem,  it  can 
be  shown  that  the  field  for  r  >  b  is  the  same  as  for  a  point  charge, 
while  the  field  for  r  <  b  is  kqr/b 3.  (Example  39  shows  this  using  a 
different  technique.) 


Remark:  The  calculation  in  part  a  seems  to  show  that  infinite  energy  would 
be  required  in  order  to  create  a  charged,  pointlike  particle.  However,  there  are 
processes  that,  for  example,  create  electron-positron  pairs,  and  these  processes 
don’t  require  infinite  energy.  According  to  Einstein’s  famous  equation  E  =  me2, 
the  energy  required  to  create  such  a  pair  should  only  be  2 me2,  which  is  finite. 
One  way  out  of  this  difficulty  is  to  assume  that  no  particle  is  really  pointlike,  and 
this  is  in  fact  the  main  motivation  behind  a  speculative  physical  theory  called 
string  theory,  which  posits  that  charged  particles  are  actually  tiny  loops,  not 
points.  V  B 

26  The  neuron  in  the  figure  has  been  drawn  fairly  short,  but 
some  neurons  in  your  spinal  cord  have  tails  (axons)  up  to  a  meter 
long.  The  inner  and  outer  surfaces  of  the  membrane  act  as  the 
“plates”  of  a  capacitor.  (The  fact  that  it  has  been  rolled  up  into  a 
cylinder  has  very  little  effect.)  In  order  to  function,  the  neuron  must 
create  a  voltage  difference  V  between  the  inner  and  outer  surfaces 
of  the  membrane.  Let  the  membrane’s  thickness,  radius,  and  length 
be  t,  r,  and  L.  (a)  Calculate  the  energy  that  must  be  stored  in 
the  electric  field  for  the  neuron  to  do  its  job.  (In  real  life,  the 
membrane  is  made  out  of  a  substance  called  a  dielectric,  whose 
electrical  properties  increase  the  amount  of  energy  that  must  be 
stored.  For  the  sake  of  this  analysis,  ignore  this  fact.) 

o  Hint,  p.  960  V 

(b)  An  organism’s  evolutionary  fitness  should  be  better  if  it  needs 
less  energy  to  operate  its  nervous  system.  Based  on  your  answer  to 
part  a,  what  would  you  expect  evolution  to  do  to  the  dimensions  t 
and  r?  What  other  constraints  would  keep  these  evolutionary  trends 
from  going  too  far?  ■ 

27  The  figure  shows  cross-sectional  views  of  two  cubical  ca¬ 
pacitors,  and  a  cross-sectional  view  of  the  same  two  capacitors  put 
together  so  that  their  interiors  coincide.  A  capacitor  with  the  plates 
close  together  has  a  nearly  uniform  electric  field  between  the  plates, 
and  almost  zero  field  outside;  these  capacitors  don’t  have  their  plates 
very  close  together  compared  to  the  dimensions  of  the  plates,  but 


Problem  26. 
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for  the  purposes  of  this  problem,  assume  that  they  still  have  ap¬ 
proximately  the  kind  of  idealized  field  pattern  shown  in  the  figure. 
Each  capacitor  has  an  interior  volume  of  1.00  m3,  and  is  charged  up 
to  the  point  where  its  internal  field  is  1.00  V/m. 

(a)  Calculate  the  energy  stored  in  the  electric  field  of  each  capacitor 

when  they  are  separate.  V 

(b)  Calculate  the  magnitude  of  the  interior  field  when  the  two  ca¬ 

pacitors  are  put  together  in  the  manner  shown.  Ignore  effects  arising 
from  the  redistribution  of  each  capacitor’s  charge  under  the  influ¬ 
ence  of  the  other  capacitor.  V 

(c)  Calculate  the  energy  of  the  put-together  configuration.  Does  as¬ 
sembling  them  like  this  release  energy,  consume  energy,  or  neither? 

V  I 

28  Find  the  capacitance  of  the  surface  of  the  earth,  assuming 

there  is  an  outer  spherical  “plate”  at  infinity.  (In  reality,  this  outer 
plate  would  just  represent  some  distant  part  of  the  universe  to  which 
we  carried  away  some  of  the  earth’s  charge  in  order  to  charge  up  the 
earth.)  V 

29  (a)  Show  that  the  field  found  in  example  11  on  page  579 
reduces  to  E  =  2k\/R  in  the  limit  of  L  — >  oo. 

(b)  An  infinite  strip  of  width  b  has  a  surface  charge  density  a.  Find 

the  field  at  a  point  at  a  distance  2  from  the  strip,  lying  in  the  plane 
perpendicularly  bisecting  the  strip.  V 

(c)  Show  that  this  expression  has  the  correct  behavior  in  the  limit 

where  z  approaches  zero,  and  also  in  the  limit  of  z  3>  b.  For  the 
latter,  you’ll  need  the  result  of  problem  22a,  which  is  given  on  page 
970.  ■ 

30  A  solid  cylinder  of  radius  b  and  length  t  is  uniformly  charged 

with  a  total  charge  Q.  Find  the  electric  field  at  a  point  at  the  center 
of  one  of  the  flat  ends.  ■ 

31  Find  the  voltage  at  the  edge  of  a  uniformly  charged  disk. 
(Define  V  =  0  to  be  infinitely  far  from  the  disk.) 

V  >  Hint,  p.  960  B 

32  Find  the  energy  stored  in  a  capacitor  in  terms  of  its  capaci¬ 
tance  and  the  voltage  difference  across  it.  V 

33  (a)  Find  the  capacitance  of  two  identical  capacitors  in  series, 
(b)  Based  on  this,  how  would  you  expect  the  capacitance  of  a 
parallel-plate  capacitor  to  depend  on  the  distance  between  the  plates? 


34  (a)  Use  complex  number  techniques  to  rewrite  the  function 

f(t)  =  4  sin  cut  +  3  cos  cut  in  the  form  Asin(wf  +  5).  V 

(b)  Verify  the  result  using  the  trigonometric  identity  sin(a  +  /3)  = 
sin  a  cos  /3  +  sin  /3  cos  a.  - 

35  (a)  Show  that  the  equation  Vl  =  Ldl/dt  has  the  right 
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units. 

(b)  Verify  that  RC  has  units  of  time. 

(c)  Verify  that  L/ R  has  units  of  time. 

36  Find  the  inductance  of  two  identical  inductors  in  parallel. 

■ 

37  Calculate  the  quantity  i1  (i.e.,  find  its  real  and  imaginary 

parts).  C  - 

38  The  wires  themselves  in  a  circuit  can  have  resistance,  induc¬ 
tance,  and  capacitance.  Would  “stray”  inductance  and  capacitance 
be  most  important  for  low-frequency  or  for  high-frequency  circuits? 
For  simplicity,  assume  that  the  wires  act  like  they’re  in  series  with 
an  inductor  or  capacitor. 

39  Starting  from  the  relation  V  =  Ldl/  dt  for  the  voltage  dif¬ 
ference  across  an  inductor,  show  that  an  inductor  has  an  impedance 
equal  to  Lu. 

40  A  rectangular  box  is  uniformly  charged  with  a  charge  density 

p.  The  box  is  extremely  long  and  skinny,  and  its  cross-section  is  a 
square  with  sides  of  length  b.  The  length  is  so  great  in  comparison 
to  b  that  we  can  consider  it  as  being  infinite.  Find  the  electric  field 
at  a  point  lying  on  the  box’s  surface,  at  the  midpoint  between  the 
two  edges.  Your  answer  will  involve  an  integral  that  is  most  easily 
done  using  computer  software.  H 

41  A  hollow  cylindrical  pipe  has  length  ^  and  radius  b.  Its 
ends  are  open,  but  on  the  curved  surface  it  has  a  charge  density 
a.  A  charge  q  with  mass  m  is  released  at  the  center  of  the  pipe, 
in  unstable  equilibrium.  Because  the  equilibrium  is  unstable,  the 
particle  acclerates  off  in  one  direction  or  the  other,  along  the  axis 
of  the  pipe,  and  comes  shooting  out  like  a  bullet  from  the  barrel  of 
a  gun.  Find  the  velocity  of  the  particle  when  it’s  infinitely  far  from 
the  “gun.”  Your  answer  will  involve  an  integral  that  is  difficult  to 
do  by  hand;  you  may  want  to  look  it  up  in  a  table  of  integrals,  do 
it  online  at  integrals.com,  or  download  and  install  the  free  Maxima 
symbolic  math  software  from  maxima.sourceforge.net. 

42  If  an  FM  radio  tuner  consisting  of  an  LRC  circuit  contains 

a  1.0  /rH  inductor,  what  range  of  capacitances  should  the  variable 
capacitor  be  able  to  provide?  V 

43  (a)  Find  the  parallel  impedance  of  a  37  kll  resistor  and  a  1.0 

nF  capacitor  at  /  =  1.0  x  104  Hz.  V 

(b)  A  voltage  with  an  amplitude  of  1.0  mV  drives  this  impedance 
at  this  frequency.  What  is  the  amplitude  of  the  current  drawn  from 
the  voltage  source,  what  is  the  current’s  phase  angle  with  respect  to 
the  voltage,  and  does  it  lead  the  voltage,  or  lag  behind  it?  V  - 

44  A  series  LRC  circuit  consists  of  a  1.000  H  resistor,  a  1.000  F 
capacitor,  and  a  1.000  H  inductor.  (These  are  not  particularly  easy 
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values  to  find  on  the  shelf  at  Radio  Shack!) 

(a)  Plot  its  impedance  as  a  point  in  the  complex  plane  for  each  of 
the  following  frequencies:  w=0.250,  0.500,  1.000,  2.000,  and  4.000 
Hz. 

(b)  What  is  the  resonant  angular  frequency,  ujresi  and  how  does  this 

relate  to  your  plot?  V 

(c)  What  is  the  resonant  frequency  fres  corresponding  to  your  an¬ 
swer  in  part  b?  V  ^ 

45  At  a  frequency  oj,  a  certain  series  LR  circuit  has  an  impedance 

of  1  D  +  (2  H)z.  Suppose  that  instead  we  want  to  achieve  the  same 
impedance  using  two  circuit  elements  in  parallel.  What  must  the 
elements  be?  - 

46  (a)  Use  Gauss’  law  to  find  the  fields  inside  and  outside  an 

infinite  cylindrical  surface  with  radius  b  and  uniform  surface  charge 
density  a .  V 

(b)  Show  that  there  is  a  discontinuity  in  the  electric  field  equal  to 
4irka  between  one  side  of  the  surface  and  the  other,  as  there  should 
be  (see  page  633). 

(c)  Reexpress  your  result  in  terms  of  the  charge  per  unit  length,  and 
compare  with  the  field  of  a  line  of  charge. 

(d)  A  coaxial  cable  has  two  conductors:  a  central  conductor  of  radius 

a,  and  an  outer  conductor  of  radius  b.  These  two  conductors  are 
separated  by  an  insulator.  Although  such  a  cable  is  normally  used 
for  time- varying  signals,  assume  throughout  this  problem  that  there 
is  simply  a  DC  voltage  between  the  two  conductors.  The  outer 
conductor  is  thin,  as  in  part  c.  The  inner  conductor  is  solid,  but, 
as  is  always  the  case  with  a  conductor  in  electrostatics,  the  charge 
is  concentrated  on  the  surface.  Thus,  you  can  find  all  the  fields  in 
part  b  by  superposing  the  fields  due  to  each  conductor,  as  found  in 
part  c.  (Note  that  on  a  given  length  of  the  cable,  the  total  charge  of 
the  inner  and  outer  conductors  is  zero,  so  Ai  =  —X2,  but  a\  /  cr2, 
since  the  areas  are  unequal.)  Find  the  capacitance  per  unit  length 
of  such  a  cable.  V  ■ 

47  In  a  certain  region  of  space,  the  electric  field  is  constant 
(i.e. ,  the  vector  always  has  the  same  magnitude  and  direction).  For 
simplicity,  assume  that  the  field  points  in  the  positive  x  direction. 

(a)  Use  Gauss’s  law  to  prove  that  there  is  no  charge  in  this  region 
of  space.  This  is  most  easily  done  by  considering  a  Gaussian  surface 
consisting  of  a  rectangular  box,  whose  edges  are  parallel  to  the  x, 
y ,  and  2:  axes. 

(b)  If  there  are  no  charges  in  this  region  of  space,  what  could  be 

making  this  electric  field?  ■ 

48  (a)  In  a  series  LC  circuit  driven  by  a  DC  voltage  ( ui  =  0), 
compare  the  energy  stored  in  the  inductor  to  the  energy  stored  in 
the  capacitor. 

(b)  Carry  out  the  same  comparison  for  an  LC  circuit  that  is  oscil- 
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lating  freely  (without  any  driving  voltage). 

(c)  Now  consider  the  general  case  of  a  series  LC  circuit  driven  by 
an  oscillating  voltage  at  an  arbitrary  frequency.  Let  Ul  and  be  the 
average  energy  stored  in  the  inductor,  and  similarly  for  Uc-  Define 
a  quantity  u  =  Uc/{Ul  +  Uc),  which  can  be  interpreted  as  the  ca¬ 
pacitor’s  average  share  of  the  energy,  while  1  —  u  is  the  inductor’s 
average  share.  Find  u  in  terms  of  L,  C ,  and  w,  and  sketch  a  graph 
of  u  and  1  —  u  versus  u.  What  happens  at  resonance?  Make  sure 
your  result  is  consistent  with  your  answer  to  part  a.  v 

■ 

49  Use  Gauss’  law  to  find  the  field  inside  an  infinite  cylinder 

with  radius  b  and  uniform  charge  density  p.  (The  external  field  has 
the  same  form  as  the  one  in  problem  46.)  V  ^ 

50  (a)  In  a  certain  region  of  space,  the  electric  field  is  given 
by  E  =  6xx,  where  b  is  a  constant.  Find  the  amount  of  charge 
contained  within  a  cubical  volume  extending  from  x  =  0  to  x  =  a, 
from  y  =  0  to  y  =  a,  and  from  z  =  0  to  z  =  a. 

(b)  Repeat  for  E  =  bx z. 

(c)  Repeat  for  E  =  136zz  —  7czy. 

(d)  Repeat  for  E  =  bxzz.  - 

51  Light  is  a  wave  made  of  electric  and  magnetic  fields,  and  the 
fields  are  perpendicular  to  the  direction  of  the  wave’s  motion,  i.e. , 
they’re  transverse.  An  example  would  be  the  electric  field  given  by 
E  =  6x  sin  cz,  where  b  and  c  are  constants.  (There  would  also  be  an 
associated  magnetic  field.)  We  observe  that  light  can  travel  through 
a  vacuum,  so  we  expect  that  this  wave  pattern  is  consistent  with  the 
nonexistence  of  any  charge  in  the  space  it’s  currently  occupying.  Use 
Gauss’s  law  to  prove  that  this  is  true. 

52  This  is  an  alternative  approach  to  problem  49,  using  a  dif¬ 
ferent  technique.  Suppose  that  a  long  cylinder  contains  a  uniform 
charge  density  p  throughout  its  interior  volume. 

(a)  Use  the  methods  of  section  10.7  to  find  the  electric  field  inside 

the  cylinder.  V 

(b)  Extend  your  solution  to  the  outside  region,  using  the  same  tech¬ 
nique.  Once  you  find  the  general  form  of  the  solution,  adjust  it  so 
that  the  inside  and  outside  fields  match  up  at  the  surface.  U  - 

53  The  purpose  of  this  homework  problem  is  to  prove  that 
the  divergence  is  invariant  with  respect  to  translations.  That  is,  it 
doesn’t  matter  where  you  choose  to  put  the  origin  of  your  coordinate 
system.  Suppose  we  have  a  field  of  the  form  E  =  axx  +  byy  +  czz. 
This  is  the  most  general  field  we  need  to  consider  in  any  small  region 
as  far  as  the  divergence  is  concerned.  (The  dependence  on  x,  y,  and 
z  is  linear,  but  any  smooth  function  looks  linear  close  up.  We  also 
don’t  need  to  put  in  terms  like  xy,  because  they  don’t  contribute  to 
the  divergence.)  Define  a  new  set  of  coordinates  ( u,v,w )  related  to 
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{x,y,z)  by 


x  =  u  +  p 
y  =  v  +  q 
z  =  w  +  r, 

where  p.  q,  and  r  are  constants.  Show  that  the  field’s  divergence  is 
the  same  in  these  new  coordinates.  Note  that  x  and  u  are  identical, 
and  similarly  for  the  other  coordinates.  - 

54  Using  a  techniques  similar  to  that  of  problem  53,  show  that 
the  divergence  is  rotationally  invariant,  in  the  special  case  of  ro¬ 
tations  about  the  z  axis.  In  such  a  rotation,  we  rotate  to  a  new 
(tt,  v,  z )  coordinate  system,  whose  axes  are  rotated  by  an  angle  8 
with  respect  to  those  of  the  (x,  y,  z)  system.  The  coordinates  are 
related  by 


x  =  u  cos  6  +  v  sin  8 
y  =  —u  sin  8  +  v  cos  8 

Find  how  the  u  and  v  components  the  field  E  depend  on  u  and 
v,  and  show  that  its  divergence  is  the  same  in  this  new  coordinate 
system.  lu 

55  An  electric  held  is  given  in  cylindrical  coordinates  (R,  4>,  z) 
by  Er  =  ce~u^R~l  cos2  (j),  where  the  notation  Er  indicates  the 
component  of  the  Held  pointing  directly  away  from  the  axis,  and 
the  components  in  the  other  directions  are  zero.  (This  isn’t  a  com¬ 
pletely  impossible  expression  for  the  Held  near  a  radio  transmitting 
antenna.)  (a)  Find  the  total  charge  enclosed  within  the  infinitely 
long  cylinder  extending  from  the  axis  out  to  R  =  b.  (b)  Interpret 
the  R-dependence  of  your  answer  to  part  a. 

56  Use  Euler’s  theorem  to  derive  the  addition  theorems  that 

express  sin(a  +  b )  and  cos(a  +  b)  in  terms  of  the  sines  and  cosines  of 
a  and  b.  >  Solution,  p.  980  ■ 

57  Find  every  complex  number  z  such  that  z3  =  1. 

>  Solution,  p.  980  - 

58  Factor  the  expression  x3  —  y3  into  factors  of  the  lowest 
possible  order,  using  complex  coefficients.  (Hint:  use  the  result  of 
problem  57.)  Then  do  the  same  using  real  coefficients. 

59  A  dipole  consists  of  two  point  charges  lying  on  the  x  axis,  a 

charge  —  q  at  the  origin,  and  a  +q  at  x  =  t.  The  dipole  is  immersed 
in  an  externally  imposed,  nonuniform  electric  field  with  Ex  =  bx, 
where  b  is  a  constant.  Add  the  forces  acting  on  the  dipole.  Verify 
that  the  total  force  depends  only  on  the  dipole  moment,  not  on  q  or 
£  individually,  and  that  the  result  is  the  same  as  the  one  found  by 
a  fancier  method  in  example  6  on  p.  573.  >  Solution,  p.  960  - 
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Key  to  symbols: 

easy  -  typical  £  challenging  HI  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  wwwdightandmatter.com. 
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Exercises 

Exercise  10A:  Field  Vectors 

Apparatus: 

3  solenoids 
DC  power  supply 
compass 
ruler 

cut-off  plastic  cup 

At  this  point  you’ve  studied  the  gravitational  field,  g,  and  the  electric  field,  E,  but  not  the 
magnetic  field,  B.  However,  they  all  have  some  of  the  same  mathematical  behavior:  they  act 
like  vectors.  Furthermore,  magnetic  fields  are  the  easiest  to  manipulate  in  the  lab.  Manipulating 
gravitational  fields  directly  would  require  futuristic  technology  capable  of  moving  planet-sized 
masses  around!  Playing  with  electric  fields  is  not  as  ridiculously  difficult,  but  static  electric 
charges  tend  to  leak  off  through  your  body  to  ground,  and  static  electricity  effects  are  hard  to 
measure  numerically.  Magnetic  fields,  on  the  other  hand,  are  easy  to  make  and  control.  Any 
moving  charge,  i.e.,  any  current,  makes  a  magnetic  field. 

A  practical  device  for  making  a  strong  magnetic  field  is  simply  a  coil  of  wire,  formally  known 
as  a  solenoid.  The  field  pattern  surrounding  the  solenoid  gets  stronger  or  weaker  in  proportion 
to  the  amount  of  current  passing  through  the  wire. 

1.  With  a  single  solenoid  connected  to  the  power  supply  and  laid  with  its  axis  horizontal,  use  a 
magnetic  compass  to  explore  the  field  pattern  inside  and  outside  it.  The  compass  shows  you  the 
field  vector’s  direction,  but  not  its  magnitude,  at  any  point  you  choose.  Note  that  the  field  the 
compass  experiences  is  a  combination  (vector  sum)  of  the  solenoid’s  field  and  the  earth’s  field. 

2.  What  happens  when  you  bring  the  compass  extremely  far  away  from  the  solenoid? 


What  does  this  tell  you  about  the  way  the  solenoid’s  field  varies  with  distance? 


Thus  although  the  compass  doesn’t  tell  you  the  field  vector’s  magnitude  numerically,  you  can 
get  at  least  some  general  feel  for  how  it  depends  on  distance. 
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3.  The  figure  below  is  a  cross-section  of  the  solenoid  in  the  plane  containing  its  axis.  Make  a 
sea-of-arrows  sketch  of  the  magnetic  field  in  this  plane.  The  length  of  each  arrow  should  at  least 
approximately  reflect  the  strength  of  the  magnetic  field  at  that  point. 


Does  the  field  seem  to  have  sources  or  sinks? 

4.  What  do  you  think  would  happen  to  your  sketch  if  you  reversed  the  wires? 


Try  it. 
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5.  Now  hook  up  the  two  solenoids  in  parallel.  You  are  going  to  measure  what  happens  when 
their  two  fields  combine  at  a  certain  point  in  space.  As  you’ve  seen  already,  the  solenoids’ 
nearby  fields  are  much  stronger  than  the  earth’s  field;  so  although  we  now  theoretically  have 
three  fields  involved  (the  earth’s  plus  the  two  solenoids’),  it  will  be  safe  to  ignore  the  earth’s 
field.  The  basic  idea  here  is  to  place  the  solenoids  with  their  axes  at  some  angle  to  each  other, 
and  put  the  compass  at  the  intersection  of  their  axes,  so  that  it  is  the  same  distance  from  each 
solenoid.  Since  the  geometry  doesn’t  favor  either  solenoid,  the  only  factor  that  would  make  one 
solenoid  influence  the  compass  more  than  the  other  is  current.  You  can  use  the  cut-off  plastic 
cup  as  a  little  platform  to  bring  the  compass  up  to  the  same  level  as  the  solenoids’  axes. 

a) What  do  you  think  will  happen  with  the  solenoids’  axes  at  90  degrees  to  each  other,  and  equal 
currents?  Try  it.  Now  represent  the  vector  addition  of  the  two  magnetic  fields  with  a  diagram. 
Check  your  diagram  with  your  instructor  to  make  sure  you’re  on  the  right  track. 


b)  Now  try  to  make  a  similar  diagram  of  what  would  happen  if  you  switched  the  wires  on  one 
of  the  solenoids. 


After  predicting  what  the  compass  will  do,  try  it  and  see  if  you  were  right. 

c)Now  suppose  you  were  to  go  back  to  the  arrangement  you  had  in  part  a,  but  you  changed  one 
of  the  currents  to  half  its  former  value.  Make  a  vector  addition  diagram,  and  use  trig  to  predict 
the  angle. 


Try  it.  To  cut  the  current  to  one  of  the  solenoids  in  half,  an  easy  and  accurate  method  is 
simply  to  put  the  third  solenoid  in  series  with  it,  and  put  that  third  solenoid  so  far  away  that 
its  magnetic  field  doesn’t  have  any  significant  effect  on  the  compass. 
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Chapter  11 

Electromagnetism 


Think  not  that  I  am  come  to  destroy  the  law,  or  the  prophets:  I  am 
not  come  to  destroy,  but  to  fulfill.  Matthew  5:17 


1 1 .1  More  About  the  Magnetic  Field 

11.1.1  Magnetic  forces 

In  this  chapter,  I  assume  you  know  a  few  basic  ideas  about  Ein¬ 
stein’s  theory  of  relativity,  as  described  in  sections  7.1  and  7.2.  Un¬ 
less  your  typical  workday  involves  rocket  ships  or  particle  accelera¬ 
tors,  all  this  relativity  stuff  might  sound  like  a  description  of  some 
bizarre  futuristic  world  that  is  completely  hypothetical.  There  is, 
however,  a  relativistic  effect  that  occurs  in  everyday  life,  and  it  is 
obvious  and  dramatic:  magnetism.  Magnetism,  as  we  discussed 
previously,  is  an  interaction  between  a  moving  charge  and  another 
moving  charge,  as  opposed  to  electric  forces,  which  act  between  any 
pair  of  charges,  regardless  of  their  motion.  Relativistic  effects  are 
weak  for  speeds  that  are  small  compared  to  the  speed  of  light,  and 
the  average  speed  at  which  electrons  drift  through  a  wire  is  quite 
low  (centimeters  per  second,  typically),  so  how  can  relativity  be  be¬ 
hind  an  impressive  effect  like  a  car  being  lifted  by  an  electromagnet 
hanging  from  a  crane?  The  key  is  that  matter  is  almost  perfectly 
electrically  neutral,  and  electric  forces  therefore  cancel  out  almost 
perfectly.  Magnetic  forces  really  aren’t  very  strong,  but  electric 
forces  are  even  weaker. 

What  about  the  word  “relativity”  in  the  name  of  the  theory?  It 
would  seem  problematic  if  moving  charges  interact  differently  than 
stationary  charges,  since  motion  is  a  matter  of  opinion,  depending 
on  your  frame  of  reference.  Magnetism,  however,  comes  not  to  de¬ 
stroy  relativity  but  to  fulfill  it.  Magnetic  interactions  must  exist 
according  to  the  theory  of  relativity.  To  understand  how  this  can 
be,  consider  how  time  and  space  behave  in  relativity.  Observers 
in  different  frames  of  reference  disagree  about  the  lengths  of  mea¬ 
suring  sticks  and  the  speeds  of  clocks,  but  the  laws  of  physics  are 
valid  and  self-consistent  in  either  frame  of  reference.  Similarly,  ob¬ 
servers  in  different  frames  of  reference  disagree  about  what  electric 
and  magnetic  fields  and  forces  there  are,  but  they  agree  about  con¬ 
crete  physical  events.  For  instance,  figure  a/1  shows  two  particles, 
with  opposite  charges,  which  are  not  moving  at  a  particular  mo- 
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a  /  The  pair  of  charged  parti¬ 
cles,  as  seen  in  two  different 
frames  of  reference. 
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b  /  A  large  current  is  created 
by  shorting  across  the  leads  of 
the  battery.  The  moving  charges 
in  the  wire  attract  the  moving 
charges  in  the  electron  beam, 
causing  the  electrons  to  curve. 
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c  /  A  charged  particle  and  a 
current,  seen  in  two  different 
frames  of  reference.  The  second 
frame  is  moving  at  velocity  v 
with  respect  to  the  first  frame, 
so  all  the  velocities  have  v  sub¬ 
tracted  from  them.  (As  discussed 
in  the  main  text,  this  is  only 
approximately  correct.) 


ment  in  time.  An  observer  in  this  frame  of  reference  says  there  are 
electric  fields  around  the  particles,  and  predicts  that  as  time  goes 
on,  the  particles  will  begin  to  accelerate  towards  one  another,  even¬ 
tually  colliding.  A  different  observer,  a/2,  says  the  particles  are 
moving.  This  observer  also  predicts  that  the  particles  will  collide, 
but  explains  their  motion  in  terms  of  both  an  electric  field,  E,  and  a 
magnetic  field,  B.  As  we’ll  see  shortly,  the  magnetic  field  is  required 
in  order  to  maintain  consistency  between  the  predictions  made  in 
the  two  frames  of  reference. 

To  see  how  this  really  works  out,  we  need  to  find  a  nice  sim¬ 
ple  example  that  is  easy  to  calculate.  An  example  like  figure  a  is 
not  easy  to  handle,  because  in  the  second  frame  of  reference,  the 
moving  charges  create  fields  that  change  over  time  at  any  given  lo¬ 
cation.  Examples  like  figure  b  are  easier,  because  there  is  a  steady 
flow  of  charges,  and  all  the  fields  stay  the  same  over  time.1  What  is 
remarkable  about  this  demonstration  is  that  there  can  be  no  elec¬ 
tric  fields  acting  on  the  electron  beam  at  all,  since  the  total  charge 
density  throughout  the  wire  is  zero.  Unlike  figure  a/2,  figure  b  is 
purely  magnetic. 

To  see  why  this  must  occur  based  on  relativity,  we  make  the 
mathematically  idealized  model  shown  in  figure  c.  The  charge  by 
itself  is  like  one  of  the  electrons  in  the  vacuum  tube  beam  of  fig¬ 
ure  b,  and  a  pair  of  moving,  infinitely  long  line  charges  has  been 
substituted  for  the  wire.  The  electrons  in  a  real  wire  are  in  rapid 
thermal  motion,  and  the  current  is  created  only  by  a  slow  drift  su¬ 
perimposed  on  this  chaos.  A  second  deviation  from  reality  is  that 
in  the  real  experiment,  the  protons  are  at  rest  with  respect  to  the 
tabletop,  and  it  is  the  electrons  that  are  in  motion,  but  in  c/1  we 
have  the  positive  charges  moving  in  one  direction  and  the  negative 
ones  moving  the  other  way.  If  we  wanted  to,  we  could  construct  a 
third  frame  of  reference  in  which  the  positive  charges  were  at  rest, 
which  would  be  more  like  the  frame  of  reference  fixed  to  the  table- 
top  in  the  real  demonstration.  However,  as  we’ll  see  shortly,  frames 
c/1  and  c/2  are  designed  so  that  they  are  particularly  easy  to  ana¬ 
lyze.  It’s  important  to  note  that  even  though  the  two  line  charges 
are  moving  in  opposite  directions,  their  currents  don’t  cancel.  A 
negative  charge  moving  to  the  left  makes  a  current  that  goes  to  the 
right,  so  in  frame  c/1,  the  total  current  is  twice  that  contributed  by 
either  line  charge. 

Frame  1  is  easy  to  analyze  because  the  charge  densities  of  the 
two  line  charges  cancel  out,  and  the  electric  field  experienced  by  the 


1For  a  more  practical  demonstration  of  this  effect,  you  can  put  an  ordinary 
magnet  near  a  computer  monitor.  The  picture  will  be  distorted.  Make  sure  that 
the  monitor  has  a  demagnetizing  (“degaussing”)  button,  however!  Otherwise 
you  may  permanently  damage  it.  Don’t  use  a  television  tube,  because  TV  tubes 
don’t  have  demagnetizing  buttons. 
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lone  charge  is  therefore  zero: 


Ei  =  0 

In  frame  1,  any  force  experienced  by  the  lone  charge  must  therefore 
be  attributed  solely  to  magnetism. 

Frame  2  shows  what  we’d  see  if  we  were  observing  all  this  from 
a  frame  of  reference  moving  along  with  the  lone  charge.  Why  don’t 
the  charge  densities  also  cancel  in  this  frame?  Here’s  where  the 
relativity  comes  in.  Relativity  tells  us  that  moving  objects  appear 
contracted  to  an  observer  who  is  not  moving  along  with  them.  Both 
line  charges  are  in  motion  in  both  frames  of  reference,  but  in  frame  1, 
the  line  charges  were  moving  at  equal  speeds,  so  their  contractions 
were  equal,  and  their  charge  densities  canceled  out.  In  frame  2, 
however,  their  speeds  are  unequal.  The  positive  charges  are  moving 
more  slowly  than  in  frame  1,  so  in  frame  2  they  are  less  contracted. 
The  negative  charges  are  moving  more  quickly,  so  their  contraction 
is  greater  now.  Since  the  charge  densities  don’t  cancel,  there  is  an 
electric  field  in  frame  2,  which  points  into  the  wire,  attracting  the 
lone  charge.  Furthermore,  the  attraction  felt  by  the  lone  charge 
must  be  purely  electrical,  since  the  lone  charge  is  at  rest  in  this 
frame  of  reference,  and  magnetic  effects  occur  only  between  moving 
charges  and  other  moving  charges.2 

To  summarize,  frame  1  displays  a  purely  magnetic  attraction, 
while  in  frame  2  it  is  purely  electrical. 

Now  we  can  calculate  the  force  in  frame  2,  and  equating  it  to 
the  force  in  frame  1,  we  can  find  out  how  much  magnetic  force 
occurs.  To  keep  the  math  simple,  and  to  keep  from  assuming  too 
much  about  your  knowledge  of  relativity,  we’re  going  to  carry  out 
this  whole  calculation  in  the  approximation  where  all  the  speeds 
are  fairly  small  compared  to  the  speed  of  light.3  For  instance,  if 
we  find  an  expression  such  as  ( v/c )2  +  (u/c)  ,  we  will  assume  that 
the  fourth-order  term  is  negligible  by  comparison.  This  is  known  as 
a  calculation  “to  leading  order  in  u/c.”  In  fact,  I’ve  already  used 
the  leading-order  approximation  twice  without  saying  so!  The  first 

2  One  could  object  that  this  is  circular  reasoning,  since  the  whole  purpose  of 
this  argument  is  to  prove  from  first  principles  that  magnetic  effects  follow  from 
the  theory  of  relativity.  Could  there  be  some  extra  interaction  which  occurs 
between  a  moving  charge  and  any  other  charge,  regardless  of  whether  the  other 
charge  is  moving  or  not?  We  can  argue,  however,  that  such  a  theory  would  lack 
self-consistency,  since  we  have  to  define  the  electric  field  somehow,  and  the  only 
way  to  define  it  is  in  terms  of  F/q,  where  F  is  the  force  on  a  test  charge  q  which  is 
at  rest.  In  other  words,  we’d  have  to  say  that  there  was  some  extra  contribution 
to  the  electric  field  if  the  charge  making  it  was  in  motion.  This  would,  however, 
violate  Gauss’  law,  and  Gauss’  law  is  amply  supported  by  experiment,  even  when 
the  sources  of  the  electric  field  are  moving.  It  would  also  violate  the  time-reversal 
symmetry  of  the  laws  of  physics. 

3The  reader  who  wants  to  see  the  full  relativistic  treatment  is  referred  to 
E.M.  Purcell,  Electricity  and  Magnetism,  McGraw  Hill,  1985,  p.  174. 
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time  I  used  it  implicitly  was  in  figure  c,  where  I  assumed  that  the 
velocities  of  the  two  line  charges  were  u  —  v  and  —  u  —  v.  Relativistic 
velocities  don’t  just  combine  by  simple  addition  and  subtraction  like 
this,  but  this  is  an  effect  we  can  ignore  in  the  present  approximation. 
The  second  sleight  of  hand  occurred  when  I  stated  that  we  could 
equate  the  forces  in  the  two  frames  of  reference.  Force,  like  time 
and  distance,  is  distorted  relativistically  when  we  change  from  one 
frame  of  reference  to  another.  Again,  however,  this  is  an  effect  that 
we  can  ignore  to  the  desired  level  of  approximation. 


Let  ±A  be  the  charge  per  unit  length  of  each  line  charge  with¬ 
out  relativistic  contraction,  i.e.,  in  the  frame  moving  with  that  line 
charge.  Using  the  approximation  7  =  (1  —  v2/c2)_1//2  ~  1  +  v2 /2 c2 
for  v  <C  c,  the  total  charge  per  unit  length  in  frame  2  is 


Xtotal, 


A 


1  + 


(u  —  vY 
2  c2 


-A 


1  + 


(—u  —  vY 

2d2 


— 2A  uv 


Let  R  be  the  distance  from  the  line  charge  to  the  lone  charge.  Ap¬ 
plying  Gauss’  law  to  a  cylinder  of  radius  R  centered  on  the  line 
charge,  we  find  that  the  magnitude  of  the  electric  field  experienced 
by  the  lone  charge  in  frame  2  is 


AkXuv 

e  =  ^r' 

and  the  force  acting  on  the  lone  charge  q  is 


AkXquv 

In  frame  1,  the  current  is  I  =  2X\u  (see  homework  problem  5), 
which  we  can  approximate  as  I  =  2Xu,  since  the  current,  unlike 
Xtotai,  2,  doesn’t  vanish  completely  without  the  relativistic  effect. 
The  magnetic  force  on  the  lone  charge  q  due  to  the  current  I  is 

2  klqv 

f^^r~' 


Discussion  Question 

A  Resolve  the  following  paradox  concerning  the  argument  given  in  this 
section.  We  would  expect  that  at  any  given  time,  electrons  in  a  solid  would 
be  associated  with  protons  in  a  definite  way.  For  simplicity,  let’s  imagine 
that  the  solid  is  made  out  of  hydrogen  (which  actually  does  become  a 
metal  under  conditions  of  very  high  pressure).  A  hydrogen  atom  consists 
of  a  single  proton  and  a  single  electron.  Even  if  the  electrons  are  moving 
and  forming  an  electric  current,  we  would  imagine  that  this  would  be  like 
a  game  of  musical  chairs,  with  the  protons  as  chairs  and  the  electrons 
as  people.  Each  electron  has  a  proton  that  is  its  “friend,”  at  least  for  the 
moment.  This  is  the  situation  shown  in  figure  c/1.  How,  then,  can  an 
observer  in  a  different  frame  see  the  electrons  and  protons  as  not  being 
paired  up,  as  in  c/2? 
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1 1 .1 .2  The  magnetic  field 

Definition  in  terms  of  the  force  on  a  moving  particle 

With  electricity,  it  turned  out  to  be  useful  to  define  an  electric 
field  rather  than  always  working  in  terms  of  electric  forces.  Likewise, 
we  want  to  define  a  magnetic  field,  B.  Let’s  look  at  the  result  of 
the  preceding  subsection  for  insight.  The  equation 

2  klqv 

F  =  v?w 

shows  that  when  we  put  a  moving  charge  near  other  moving  charges, 
there  is  an  extra  magnetic  force  on  it,  in  addition  to  any  electric 
forces  that  may  exist.  Equations  for  electric  forces  always  have  a 
factor  of  k  in  front  —  the  Coulomb  constant  k  is  called  the  coupling 
constant  for  electric  forces.  Since  magnetic  effects  are  relativistic 
in  origin,  they  end  up  having  a  factor  of  k/c 2  instead  of  just  k. 
In  a  world  where  the  speed  of  light  was  infinite,  relativistic  effects, 
including  magnetism,  would  be  absent,  and  the  coupling  constant 
for  magnetism  would  be  zero.  A  cute  feature  of  the  metric  system  is 
that  we  have  k/c :2  =  10~7  N-s2/C2  exactly,  as  a  matter  of  definition. 

Naively,  we  could  try  to  work  by  analogy  with  the  electric  field, 
and  define  the  magnetic  field  as  the  magnetic  force  per  unit  charge. 
However,  if  we  think  of  the  lone  charge  in  our  example  as  the  test 
charge,  we’ll  find  that  this  approach  fails,  because  the  force  depends 
not  just  on  the  test  particle’s  charge,  but  on  its  velocity,  v,  as  well. 
Although  we  only  carried  out  calculations  for  the  case  where  the 
particle  was  moving  parallel  to  the  wire,  in  general  this  velocity  is 
a  vector,  v,  in  three  dimensions.  We  can  also  anticipate  that  the 
magnetic  field  will  be  a  vector.  The  electric  and  gravitational  fields 
are  vectors,  and  we  expect  intuitively  based  on  our  experience  with 
magnetic  compasses  that  a  magnetic  field  has  a  particular  direction 
in  space.  Furthermore,  reversing  the  current  I  in  our  example  would 
have  reversed  the  force,  which  would  only  make  sense  if  the  magnetic 
field  had  a  direction  in  space  that  could  be  reversed.  Summarizing, 
we  think  there  must  be  a  magnetic  field  vector  B,  and  the  force 
on  a  test  particle  moving  through  a  magnetic  field  is  proportional 
both  to  the  B  vector  and  to  the  particle’s  own  v  vector.  In  other 
words,  the  magnetic  force  vector  F  is  found  by  some  sort  of  vector 
multiplication  of  the  vectors  v  and  B.  As  proved  on  page  952, 
however,  there  is  only  one  physically  useful  way  of  defining  such  a 
multiplication,  which  is  the  cross  product. 

We  therefore  define  the  magnetic  field  vector,  B,  as  the  vector  that 
determines  the  force  on  a  charged  particle  according  to  the  following 
rule: 

F  =  gv  x  B  [definition  of  the  magnetic  field] 


From  this  definition,  we  see  that  the  magnetic  field’s  units  are 


F 


d/The  right-hand  relation¬ 
ship  between  the  velocity  of  a 
positively  charged  particle,  the 
magnetic  field  through  which  it  is 
moving,  and  the  magnetic  force 
on  it. 


e/The  unit  of  magnetic  field, 
the  tesla,  is  named  after  Serbian- 
American  inventor  Nikola  Tesla. 
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f  /  A  standard  dipole  made 
from  a  square  loop  of  wire  short¬ 
ing  across  a  battery.  It  acts  very 
much  like  a  bar  magnet,  but  its 
strength  is  more  easily  quantified. 


g/A  dipole  tends  to  align  it¬ 
self  to  the  surrounding  magnetic 
field. 


direction  of  m 
and  A  vectors 


h  /  The  m  and  A  vectors. 


N  •  s/C  •  m,  which  are  usually  abbreviated  as  teslas,  1  T  =  1  N  • 
s/C-rn.  The  definition  implies  a  right-hand-rule  relationship  among 
the  vectors,  figure  d,  if  the  charge  q  is  positive,  and  the  opposite 
handedness  if  it  is  negative. 

This  is  not  just  a  definition  but  a  bold  prediction!  Is  it  really 
true  that  for  any  point  in  space,  we  can  always  find  a  vector  B  that 
successfully  predicts  the  force  on  any  passing  particle,  regardless 
of  its  charge  and  velocity  vector?  Yes  —  it’s  not  obvious  that  it 
can  be  done,  but  experiments  verify  that  it  can.  How?  Well  for 
example,  the  cross  product  of  parallel  vectors  is  zero,  so  we  can  try 
particles  moving  in  various  directions,  and  hunt  for  the  direction 
that  produces  zero  force;  the  B  vector  lies  along  that  line,  in  either 
the  same  direction  the  particle  was  moving,  or  the  opposite  one.  We 
can  then  go  back  to  our  data  from  one  of  the  other  cases,  where  the 
force  was  nonzero,  and  use  it  to  choose  between  these  two  directions 
and  find  the  magnitude  of  the  B  vector.  We  could  then  verify  that 
this  vector  gave  correct  force  predictions  in  a  variety  of  other  cases. 

Even  with  this  empirical  reassurance,  the  meaning  of  this  equa¬ 
tion  is  not  intuitively  transparent,  nor  is  it  practical  in  most  cases 
to  measure  a  magnetic  field  this  way.  For  these  reasons,  let’s  look 
at  an  alternative  method  of  defining  the  magnetic  field  which,  al¬ 
though  not  as  fundamental  or  mathematically  simple,  may  be  more 
appealing. 

Definition  in  terms  of  the  torque  on  a  dipole 

A  compass  needle  in  a  magnetic  field  experiences  a  torque  which 
tends  to  align  it  with  the  field.  This  is  just  like  the  behavior  of  an 
electric  dipole  in  an  electric  field,  so  we  consider  the  compass  needle 
to  be  a  magnetic  dipole.  In  subsection  10.1.3  on  page  572,  we  gave 
an  alternative  definition  of  the  electric  field  in  terms  of  the  torque 
on  an  electric  dipole. 

To  define  the  strength  of  a  magnetic  field,  however,  we  need 
some  way  of  defining  the  strength  of  a  test  dipole,  i.e.,  we  need  a 
definition  of  the  magnetic  dipole  moment.  We  could  use  an  iron 
permanent  magnet  constructed  according  to  certain  specifications, 
but  such  an  object  is  really  an  extremely  complex  system  consisting 
of  many  iron  atoms,  only  some  of  which  are  aligned  with  each  other. 
A  more  fundamental  standard  dipole  is  a  square  current  loop.  This 
could  be  little  resistive  circuit  consisting  of  a  square  of  wire  shorting 
across  a  battery,  f. 

Applying  F  =  v  x  B,  we  find  that  such  a  loop,  when  placed  in 
a  magnetic  field,  g,  experiences  a  torque  that  tends  to  align  plane 
so  that  its  interior  “face”  points  in  a  certain  direction.  Since  the 
loop  is  symmetric,  it  doesn’t  care  if  we  rotate  it  like  a  wheel  with¬ 
out  changing  the  plane  in  which  it  lies.  It  is  this  preferred  facing 
direction  that  we  will  end  up  using  as  our  alternative  definition  of 
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the  magnetic  field. 

If  the  loop  is  out  of  alignment  with  the  field,  the  torque  on  it 
is  proportional  to  the  amount  of  current,  and  also  to  the  interior 
area  of  the  loop.  The  proportionality  to  current  makes  sense,  since 
magnetic  forces  are  interactions  between  moving  charges,  and  cur¬ 
rent  is  a  measure  of  the  motion  of  charge.  The  proportionality  to 
the  loop’s  area  is  also  not  hard  to  understand,  because  increasing 
the  length  of  the  sides  of  the  square  increases  both  the  amount  of 
charge  contained  in  this  circular  “river”  and  the  amount  of  lever¬ 
age  supplied  for  making  torque.  Two  separate  physical  reasons  for 
a  proportionality  to  length  result  in  an  overall  proportionality  to 
length  squared,  which  is  the  same  as  the  area  of  the  loop.  For  these 
reasons,  we  define  the  magnetic  dipole  moment  of  a  square  current 
loop  as 


of  page 


i/The  torque  on  a  current 
loop  in  a  magnetic  field.  The 
current  comes  out  of  the  page, 
goes  across,  goes  back  into  the 
page,  and  then  back  across  the 
other  way  in  the  hidden  side  of 
the  loop. 


m  =  /A, 

where  the  direction  of  the  vectors  is  defined  as  shown  in  figure  h. 
We  can  now  give  an  alternative  definition  of  the  magnetic  held: 


The  magnetic  held  vector,  B,  at  any  location  in  space  is  defined  by 
observing  the  torque  exerted  on  a  magnetic  test  dipole  mj  consisting 
of  a  square  current  loop.  The  held’s  magnitude  is 


—  |  \  ■  ni 

|  |  sm  0 

where  6  is  the  angle  between  the  dipole  vector  and  the  held.  This 
is  equivalent  to  the  vector  cross  product  r  =  mf  x  B. 


Let’s  show  that  this  is  consistent  with  the  previous  definition, 
using  the  geometry  shown  in  figure  i.  The  velocity  vector  that  point 
in  and  out  of  the  page  are  shown  using  the  convention  defined  in 
hgure  j .  Let  the  mobile  charge  carriers  in  the  wire  have  linear  density 
A,  and  let  the  sides  of  the  loop  have  length  h.  so  that  we  have 
I  =  \v,  and  m  =  h2 Xv.  The  only  nonvanishing  torque  comes  from 
the  forces  on  the  left  and  right  sides.  The  currents  in  these  sides 
are  perpendicular  to  the  held,  so  the  magnitude  of  the  cross  product 
F  =  gv  X  B  is  simply  |F|  =  qvB.  The  torque  supplied  by  each  of 
these  forces  is  r  x  F,  where  the  lever  arm  r  has  length  A/2,  and 
makes  an  angle  0  with  respect  to  the  force  vector.  The  magnitude 
of  the  total  torque  acting  on  the  loop  is  therefore 


©  out  of  the  page 
©  into  the  page 

j  /  A  vector  coming  out  of  the 
page  is  shown  with  the  tip  of  an 
arrowhead.  A  vector  going  into 
the  page  is  represented  using  the 
tailfeathers  of  the  arrow. 
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k  /  Dipole  vectors  can  be  added. 


t\  =  2^|F|  sin 6 
=  h  qvB  sin#, 
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and  substituting  q  =  Xh  and  v  =  m/h2 A,  we  have 


I /An  irregular  loop  can  be 
broken  up  into  little  squares. 


m  /  The  magnetic  field  pat¬ 
tern  around  a  bar  magnet  is 
created  by  the  superposition  of 
the  dipole  fields  of  the  individual 
iron  atoms.  Roughly  speaking, 
it  looks  like  the  field  of  one  big 
dipole,  especially  farther  away 
from  the  magnet.  Closer  in, 
however,  you  can  see  a  hint  of 
the  magnet’s  rectangular  shape. 
The  picture  was  made  by  placing 
iron  filings  on  a  piece  of  paper, 
and  then  bringing  a  magnet  up 
underneath. 


r  =  h  Xh  -tttt-B  sin  6 
h/X 

=  mB  sin  9, 


which  is  consistent  with  the  second  definition  of  the  field. 

It  undoubtedly  seems  artificial  to  you  that  we  have  discussed 
dipoles  only  in  the  form  of  a  square  loop  of  current.  A  permanent 
magnet,  for  example,  is  made  out  of  atomic  dipoles,  and  atoms 
aren’t  square!  However,  it  turns  out  that  the  shape  doesn’t  matter. 
To  see  why  this  is  so,  consider  the  additive  property  of  areas  and 
dipole  moments,  shown  in  figure  k.  Each  of  the  square  dipoles  has 
a  dipole  moment  that  points  out  of  the  page.  When  they  are  placed 
side  by  side,  the  currents  in  the  adjoining  sides  cancel  out,  so  they 
are  equivalent  to  a  single  rectangular  loop  with  twice  the  area.  We 
can  break  down  any  irregular  shape  into  little  squares,  as  shown  in 
figure  1,  so  the  dipole  moment  of  any  planar  current  loop  can  be 
calculated  based  on  its  area,  regardless  of  its  shape. 

The  magnetic  dipole  moment  of  an  atom  example  1 

Let’s  make  an  order-of-magnitude  estimate  of  the  magnetic  dipole 
moment  of  an  atom.  A  hydrogen  atom  is  about  10~10  m  in  diam¬ 
eter,  and  the  electron  moves  at  speeds  of  about  1CT2c.  We  don’t 
know  the  shape  of  the  orbit,  and  indeed  it  turns  out  that  accord¬ 
ing  to  the  principles  of  quantum  mechanics,  the  electron  doesn’t 
even  have  a  well-defined  orbit,  but  if  we’re  brave,  we  can  still  es¬ 
timate  the  dipole  moment  using  the  cross-sectional  area  of  the 
atom,  which  will  be  on  the  order  of  (10“10  m)2  =  10_2°  m2.  The 
electron  is  a  single  particle,  not  a  steady  current,  but  again  we 
throw  caution  to  the  winds,  and  estimate  the  current  it  creates  as 
e/At,  where  At,  the  time  for  one  orbit,  can  be  estimated  by  divid¬ 
ing  the  size  of  the  atom  by  the  electron’s  velocity.  (This  is  only 
a  rough  estimate,  and  we  don’t  know  the  shape  of  the  orbit,  so  it 
would  be  silly,  for  instance,  to  bother  with  multiplying  the  diameter 
by  7t  based  on  our  intuitive  visualization  of  the  electron  as  moving 
around  the  circumference  of  a  circle.)  The  result  for  the  dipole 
moment  is  m  ~  10~23  A  m2. 

Should  we  be  impressed  with  how  small  this  dipole  moment  is, 
or  with  how  big  it  is,  considering  that  it’s  being  made  by  a  single 
atom?  Very  large  or  very  small  numbers  are  never  very  interest¬ 
ing  by  themselves.  To  get  a  feeling  for  what  they  mean,  we  need 
to  compare  them  to  something  else.  An  interesting  comparison 
here  is  to  think  in  terms  of  the  total  number  of  atoms  in  a  typical 
object,  which  might  be  on  the  order  of  1026  (Avogadro’s  number). 
Suppose  we  had  this  many  atoms,  with  their  moments  all  aligned. 
The  total  dipole  moment  would  be  on  the  order  of  103  Am2,  which 
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is  a  pretty  big  number.  To  get  a  dipole  moment  this  strong  using 
human-scale  devices,  we’d  have  to  send  a  thousand  amps  of  cur¬ 
rent  through  a  one-square  meter  loop  of  wire!  The  insight  to  be 
gained  here  is  that,  even  in  a  permanent  magnet,  we  must  not 
have  all  the  atoms  perfectly  aligned,  because  that  would  cause 
more  spectacular  magnetic  effects  than  we  really  observe.  Ap¬ 
parently,  nearly  all  the  atoms  in  such  a  magnet  are  oriented  ran¬ 
domly,  and  do  not  contribute  to  the  magnet’s  dipole  moment. 

Discussion  Questions 

A  The  physical  situation  shown  in  figure  c  on  page  654  was  analyzed 
entirely  in  terms  of  forces.  Now  let’s  go  back  and  think  about  it  in  terms  of 
fields.  The  charge  by  itself  up  above  the  wire  is  like  a  test  charge,  being 
used  to  determine  the  magnetic  and  electric  fields  created  by  the  wire. 

In  figures  c/1  and  c/2,  are  there  fields  that  are  purely  electric  or  purely 
magnetic?  Are  there  fields  that  are  a  mixture  of  E  and  B?  How  does  this 
compare  with  the  forces? 

B  Continuing  the  analysis  begun  in  discussion  question  A,  can  we 
come  up  with  a  scenario  involving  some  charged  particles  such  that  the 
fields  are  purely  magnetic  in  one  frame  of  reference  but  a  mixture  of  E  and 
B  in  another  frame?  How  about  an  example  where  the  fields  are  purely 
electric  in  one  frame,  but  mixed  in  another?  Or  an  example  where  the 
fields  are  purely  electric  in  one  frame,  but  purely  magnetic  in  another? 

1 1 .1 .3  Some  applications 

Magnetic  levitation  example  2 

In  figure  n,  a  small,  disk-shaped  permanent  magnet  is  stuck  on 
the  side  of  a  battery,  and  a  wire  is  clasped  loosely  around  the 
battery,  shorting  it.  A  large  current  flows  through  the  wire.  The 
electrons  moving  through  the  wire  feel  a  force  from  the  magnetic 
field  made  by  the  permanent  magnet,  and  this  force  levitates  the 
wire. 

From  the  photo,  it’s  possible  to  find  the  direction  of  the  magnetic 
field  made  by  the  permanent  magnet.  The  electrons  in  the  copper 
wire  are  negatively  charged,  so  they  flow  from  the  negative  (flat) 
terminal  of  the  battery  to  the  positive  terminal  (the  one  with  the 
bump,  in  front).  As  the  electrons  pass  by  the  permanent  magnet, 
we  can  imagine  that  they  would  experience  a  field  either  toward 
the  magnet,  or  away  from  it,  depending  on  which  way  the  magnet 
was  flipped  when  it  was  stuck  onto  the  battery.  By  the  right-hand 
rule  (figure  d  on  page  657),  the  field  must  be  toward  the  battery. 

Nervous-system  effects  during  an  MR!  scan  example  3 

During  an  MRI  scan  of  the  head,  the  patient’s  nervous  system 
is  exposed  to  intense  magnetic  fields,  and  there  are  ions  moving  n  /  Example  2. 
around  in  the  nerves.  The  resulting  forces  on  the  ions  can  cause 
symptoms  such  as  vertigo. 
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o  /  Magnetic  forces  cause  a 
beam  of  electrons  to  move  in  a 
circle. 


p  /  You  can’t  isolate  the  poles  of  a 
magnet  by  breaking  it  in  half. 


q/A  magnetic  dipole  is  made 
out  of  other  dipoles,  not  out  of 
monopoles. 


A  circular  orbit  example  4 

The  magnetic  force  is  always  perpendicular  to  the  motion  of  the 
particle,  so  it  can  never  do  any  work,  and  a  charged  particle  mov¬ 
ing  through  a  magnetic  field  does  not  experience  any  change  in 
its  kinetic  energy:  its  velocity  vector  can  change  its  direction,  but 
not  its  magnitude.  If  the  velocity  vector  is  initially  perpendicular 
to  the  field,  then  the  curve  of  its  motion  will  remain  in  the  plane 
perpendicular  to  the  field,  so  the  magnitude  of  the  magnetic  force 
on  it  will  stay  the  same.  When  an  object  experiences  a  force  with 
constant  magnitude,  which  is  always  perpendicular  to  the  direc¬ 
tion  of  its  motion,  the  result  is  that  it  travels  in  a  circle. 

Figure  o  shows  a  beam  of  electrons  in  a  spherical  vacuum  tube. 
In  the  top  photo,  the  beam  is  emitted  near  the  right  side  of  the 
tube,  and  travels  straight  up.  In  the  bottom  photo,  a  magnetic  field 
has  been  imposed  by  an  electromagnet  surrounding  the  vacuum 
tube;  the  ammeter  on  the  right  shows  that  the  current  through  the 
electromagnet  is  now  nonzero.  We  observe  that  the  beam  is  bent 
into  a  circle. 

self-check  A 

Infer  the  direction  of  the  magnetic  field.  Don’t  forget  that  the  beam  is 
made  of  electrons,  which  are  negatively  charged!  >  Answer,  p.  966 

Homework  problem  12  is  a  quantitative  analysis  of  circular  orbits. 

A  velocity  filter  example  5 

Suppose  you  see  the  electron  beam  in  figure  o,  and  you  want  to 
determine  how  fast  the  electrons  are  going.  You  certainly  can’t 
do  it  with  a  stopwatch!  Physicists  may  also  encounter  situations 
where  they  have  a  beam  of  unknown  charged  particles,  and  they 
don’t  even  know  their  charges.  This  happened,  for  instance,  when 
alpha  and  beta  radiation  were  discovered.  One  solution  to  this 
problem  relies  on  the  fact  that  the  force  experienced  by  a  charged 
particle  in  an  electric  field,  FE  =  q E,  is  independent  of  its  veloc¬ 
ity,  but  the  force  due  to  a  magnetic  field,  F  e  =  qvx  B,  isn’t.  One 
can  send  a  beam  of  charged  particles  through  a  space  containing 
both  an  electric  and  a  magnetic  field,  setting  up  the  fields  so  that 
the  two  forces  will  cancel  out  perfectly  for  a  certain  velocity.  Note 
that  since  both  forces  are  proportional  to  the  charge  of  the  parti¬ 
cles,  the  cancellation  is  independent  of  charge.  Such  a  velocity 
filter  can  be  used  either  to  determine  the  velocity  of  an  unknown 
beam  or  particles,  or  to  select  from  a  beam  of  particles  only  those 
having  velocities  within  a  certain  desired  range.  Homework  prob¬ 
lem  7  is  an  analysis  of  this  application. 

11.1.4  No  magnetic  monopoles 

If  you  could  play  with  a  handful  of  electric  dipoles  and  a  handful 

of  bar  magnets,  they  would  appear  very  similar.  For  instance,  a  pair 

of  bar  magnets  wants  to  align  themselves  head-to-tail,  and  a  pair  of 
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electric  dipoles  does  the  same  thing.  (It  is  unfortunately  not  that 
easy  to  make  a  permanent  electric  dipole  that  can  be  handled  like 
this,  since  the  charge  tends  to  leak.) 

You  would  eventually  notice  an  important  difference  between  the 
two  types  of  objects,  however.  The  electric  dipoles  can  be  broken 
apart  to  form  isolated  positive  charges  and  negative  charges.  The 
two-ended  device  can  be  broken  into  parts  that  are  not  two-ended. 
But  if  you  break  a  bar  magnet  in  half,  p,  you  will  find  that  you  have 
simply  made  two  smaller  two-ended  objects. 

The  reason  for  this  behavior  is  not  hard  to  divine  from  our  mi¬ 
croscopic  picture  of  permanent  iron  magnets.  An  electric  dipole  has 
extra  positive  “stuff”  concentrated  in  one  end  and  extra  negative  in 
the  other.  The  bar  magnet,  on  the  other  hand,  gets  its  magnetic 
properties  not  from  an  imbalance  of  magnetic  “stuff”  at  the  two 
ends  but  from  the  orientation  of  the  rotation  of  its  electrons.  One 
end  is  the  one  from  which  we  could  look  down  the  axis  and  see  the 
electrons  rotating  clockwise,  and  the  other  is  the  one  from  which 
they  would  appear  to  go  counterclockwise.  There  is  no  difference 
between  the  “stuff”  in  one  end  of  the  magnet  and  the  other,  q. 

Nobody  has  ever  succeeded  in  isolating  a  single  magnetic  pole. 
In  technical  language,  we  say  that  magnetic  monopoles  not  seem  to 
exist.  Electric  monopoles  do  exist  —  that’s  what  charges  are. 

Electric  and  magnetic  forces  seem  similar  in  many  ways.  Both 
act  at  a  distance,  both  can  be  either  attractive  or  repulsive,  and 
both  are  intimately  related  to  the  property  of  matter  called  charge. 
(Recall  that  magnetism  is  an  interaction  between  moving  charges.) 
Physicists’s  aesthetic  senses  have  been  offended  for  a  long  time  be¬ 
cause  this  seeming  symmetry  is  broken  by  the  existence  of  elec¬ 
tric  monopoles  and  the  absence  of  magnetic  ones.  Perhaps  some 
exotic  form  of  matter  exists,  composed  of  particles  that  are  mag¬ 
netic  monopoles.  If  such  particles  could  be  found  in  cosmic  rays 
or  moon  rocks,  it  would  be  evidence  that  the  apparent  asymmetry 
was  only  an  asymmetry  in  the  composition  of  the  universe,  not  in 
the  laws  of  physics.  For  these  admittedly  subjective  reasons,  there 
have  been  several  searches  for  magnetic  monopoles.  Experiments 
have  been  performed,  with  negative  results,  to  look  for  magnetic 
monopoles  embedded  in  ordinary  matter.  Soviet  physicists  in  the 
1960’s  made  exciting  claims  that  they  had  created  and  detected  mag¬ 
netic  monopoles  in  particle  accelerators,  but  there  was  no  success 
in  attempts  to  reproduce  the  results  there  or  at  other  accelerators. 
The  most  recent  search  for  magnetic  monopoles,  done  by  reanalyz¬ 
ing  data  from  the  search  for  the  top  quark  at  Fermilab,  turned  up 
no  candidates,  which  shows  that  either  monopoles  don’t  exist  in 
nature  or  they  are  extremely  massive  and  thus  hard  to  create  in 
accelerators. 

The  nonexistence  of  magnetic  monopoles  means  that  unlike  an 
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sources  or  sinks. 
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electric  field,  a  magnetic  one,  can  never  have  sources  or  sinks.  The 
magnetic  field  vectors  lead  in  paths  that  loop  back  on  themselves, 
without  ever  converging  or  diverging  at  a  point,  as  in  the  fields 
shown  in  figure  r.  Gauss’  law  for  magnetism  is  therefore  much  sim¬ 
pler  than  Gauss’  law  for  electric  fields: 

Bi  •  Ai  =  0 

The  magnetic  flux  through  any  closed  surface  is  zero. 

self-check  B 

Draw  a  Gaussian  surface  on  the  electric  dipole  field  of  figure  r  that  has 
nonzero  electric  flux  through  it,  and  then  draw  a  similar  surface  on  the 
magnetic  field  pattern.  What  happens?  >  Answer,  p.  966 

The  field  of  a  wire  example  6 

>  On  page  656,  we  showed  that  a  long,  straight  wire  carrying 
current  /  exerts  a  magnetic  force 

2klqv 

F~  W 

on  a  particle  with  charge  q  moving  parallel  to  the  wire  with  velocity 
v.  What,  then,  is  the  magnetic  field  of  the  wire? 

>  Comparing  the  equation  above  to  the  first  definition  of  the  mag¬ 
netic  field,  F  =  v  x  B,  it  appears  that  the  magnetic  field  is  one  that 
falls  off  like  1  /R,  where  R  is  the  distance  from  the  wire.  However, 
it’s  not  so  easy  to  determine  the  direction  of  the  field  vector.  There 
are  two  other  axes  along  which  the  particle  could  have  been  mov¬ 
ing,  and  the  brute-force  method  would  be  to  carry  out  relativistic 
calculations  for  these  cases  as  well.  Although  this  would  probably 
be  enough  information  to  determine  the  field,  we  don’t  want  to  do 
that  much  work. 

Instead,  let’s  consider  what  the  possibilities  are.  The  field  can’t 
be  parallel  to  the  wire,  because  a  cross  product  vanishes  when 
the  two  vectors  are  parallel,  and  yet  we  know  from  the  case  we 
analyzed  that  the  force  doesn’t  vanish  when  the  particle  is  moving 
parallel  to  the  wire.  The  other  two  possibilities  that  are  consistent 
with  the  symmetry  of  the  problem  are  shown  in  figure  s.  One  is 
like  a  bottle  brush,  and  the  other  is  like  a  spool  of  thread.  The 
bottle  brush  pattern,  however,  violates  Gauss’  law  for  magnetism. 
If  we  made  a  cylindrical  Gaussian  surface  with  its  axis  coinciding 
with  the  wire,  the  flux  through  it  would  not  be  zero.  We  there¬ 
fore  conclude  that  the  spool-of-thread  pattern  is  the  correct  one.4 

4Strictly  speaking,  there  is  a  hole  in  this  logic,  since  I’ve  only  ruled  out  a 
field  that  is  purely  along  one  of  these  three  perpendicular  directions.  What  if  it 
has  components  along  more  than  one  of  them?  A  little  more  work  is  required 
to  eliminate  these  mixed  possibilities.  For  example,  we  can  rule  out  a  field 
with  a  nonzero  component  parallel  to  the  wire  based  on  the  following  symmetry 
argument.  Suppose  a  charged  particle  is  moving  in  the  plane  of  the  page  directly 
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Since  the  particle  in  our  example  was  moving  perpendicular  to 
the  field,  we  have  \F\  =  jq||vj|6j,  so 


\q\W 

2k  I 
c2 R 


1 1 .1 .5  Symmetry  and  handedness 

Imagine  that  you  establish  radio  contact  with  an  alien  on  another 
planet.  Neither  of  you  even  knows  where  the  other  one’s  planet 
is,  and  you  aren’t  able  to  establish  any  landmarks  that  you  both 
recognize.  You  manage  to  learn  quite  a  bit  of  each  other’s  languages, 
but  you’re  stumped  when  you  try  to  establish  the  definitions  of  left 
and  right  (or,  equivalently,  clockwise  and  counterclockwise).  Is  there 
any  way  to  do  it? 


/  A 
\  V 


left-handed  definitions 


F 


\ 

i):::: 

/ 


right-handed  definitions 


t  /  Left-handed  and  right-handed  definitions. 


If  there  was  any  way  to  do  it  without  reference  to  external  land¬ 
marks,  then  it  would  imply  that  the  laws  of  physics  themselves  were 
asymmetric,  which  would  be  strange.  Why  should  they  distinguish 
left  from  right?  The  gravitational  field  pattern  surrounding  a  star 
or  planet  looks  the  same  in  a  mirror,  and  the  same  goes  for  electric 

toward  the  wire.  If  the  field  had  a  component  parallel  to  the  wire,  then  the 
particle  would  feel  a  force  into  or  out  of  the  page,  but  such  a  force  is  impossible 
based  on  symmetry,  since  the  whole  arrangement  is  symmetric  with  respect  to 
mirror-reflection  across  the  plane  of  the  page. 
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u  /  In  this  scene  from  Swan 
Lake,  the  choreography  has  a 
symmetry  with  respect  to  left  and 
right. 


v  /  C.S.  Wu 


fields.  However,  the  magnetic  field  patterns  shown  in  figure  s  seems 
to  violate  this  principle.  Could  you  use  these  patterns  to  explain  left 
and  right  to  the  alien?  No.  If  you  look  back  at  the  definition  of  the 
magnetic  field,  it  also  contains  a  reference  to  handedness:  the  di¬ 
rection  of  the  vector  cross  product.  The  aliens  might  have  reversed 
their  definition  of  the  magnetic  field,  in  which  case  their  drawings 
of  field  patterns  would  look  like  mirror  images  of  ours,  as  in  the  left 
panel  of  figure  t. 

Until  the  middle  of  the  twentieth  century,  physicists  assumed 
that  any  reasonable  set  of  physical  laws  would  have  to  have  this 
kind  of  symmetry  between  left  and  right.  An  asymmetry  would 
be  grotesque.  Whatever  their  aesthetic  feelings,  they  had  to  change 
their  opinions  about  reality  when  experiments  by  C.S.  Wu  et  al.  showed 
that  the  weak  nuclear  force  violates  right-left  symmetry!  It  is  still 
a  mystery  why  right-left  symmetry  is  observed  so  scrupulously  in 
general,  but  is  violated  by  one  particular  type  of  physical  process. 


666 


Chapter  1 1 


Electromagnetism 


11.2  Magnetic  Fields  by  Superposition 


/  A  \ 


1 1 .2.1  Superposition  of  straight  wires 


In  chapter  10,  one  of  the  most  important  goals  was  to  learn 
how  to  calculate  the  electric  field  for  a  given  charge  distribution. 
The  corresponding  problem  for  magnetism  would  be  to  calculate  the 
magnetic  field  arising  from  a  given  set  of  currents.  So  far,  however, 
we  only  know  how  to  calculate  the  magnetic  field  of  a  long,  straight 
wire, 


B  = 


2  kl 


with  the  geometry  shown  in  figure  a.  Whereas  a  charge  distribution 
can  be  broken  down  into  individual  point  charges,  most  currents 
cannot  be  broken  down  into  a  set  of  straight-line  currents.  Never¬ 
theless,  let’s  see  what  we  can  do  with  the  tools  that  we  have. 

A  ground  fault  interrupter  example  7 

Electric  current  in  your  home  is  supposed  to  flow  out  of  one  side 
of  the  outlet,  through  an  appliance,  and  back  into  the  wall  through 
the  other  side  of  the  outlet.  If  that’s  not  what  happens,  then  we 
have  a  problem  —  the  current  must  be  finding  its  way  to  ground 
through  some  other  path,  perhaps  through  someone’s  body.  If 
you  have  outlets  in  your  home  that  have  “test”  and  “reset”  but¬ 
tons  on  them,  they  have  a  safety  device  built  into  them  that  is 
meant  to  protect  you  in  this  situation.  The  ground  fault  interrupter 
(GFI)  shown  in  figure  b,  routes  the  outgoing  and  returning  cur¬ 
rents  through  two  wires  that  lie  very  close  together.  The  clockwise 
and  counterclockwise  fields  created  by  the  two  wires  combine  by 
vector  addition,  and  normally  cancel  out  almost  exactly.  However, 
if  current  is  not  coming  back  through  the  circuit,  a  magnetic  field 
is  produced.  The  doughnut-shaped  collar  detects  this  field  (us¬ 
ing  an  effect  called  induction,  to  be  discussed  in  section  11.5), 
and  sends  a  signal  to  a  logic  chip,  which  breaks  the  circuit  within 
about  25  milliseconds. 

An  example  with  vector  addition  example  8 

>  Two  long,  straight  wires  each  carry  current  /  parallel  to  the  y 
axis,  but  in  opposite  directions.  They  are  separated  by  a  gap 
2/7  in  the  x  direction.  Find  the  magnitude  and  direction  of  the 
magnetic  field  at  a  point  located  at  a  height  z  above  the  plane  of 
the  wires,  directly  above  the  center  line. 


-^i  i  i  i) . 
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a /The  magnetic  field  of  a 
long,  straight  wire. 


b/A  ground  fault  interrupter. 


perspective  view 


B1  B2 
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view  along  the  y  axis 


0  The  magnetic  fields  contributed  by  the  two  wires  add  like  vec-  c  /  Example  8. 
tors,  which  means  we  can  add  their  x  and  z  components.  The  x 
components  cancel  by  symmetry.  The  magnitudes  of  the  individ¬ 
ual  fields  are  equal, 


=  B2  = 
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so  the  total  field  in  the  z  direction  is 


Bz  =  2 


2  kl 
c*R 


sin  0, 


where  0  is  the  angle  the  field  vectors  make  above  the  x  axis.  The 
sine  of  this  angle  equals  h/R,  so 


4  klh 

(Putting  this  explicitly  in  terms  of  z  gives  the  less  attractive  form 
Bz  =  4klh/c2(h2  +  z2).) 

At  large  distances  from  the  wires,  the  individual  fields  are  mostly 
in  the  ±x  direction,  so  most  of  their  strength  cancels  out.  It’s  not 
surprising  that  the  fields  tend  to  cancel,  since  the  currents  are 
in  opposite  directions.  What’s  more  interesting  is  that  not  only 
is  the  field  weaker  than  the  field  of  one  wire,  it  also  falls  off  as 
R~2  rather  than  R _1 .  If  the  wires  were  right  on  top  of  each  other, 
their  currents  would  cancel  each  other  out,  and  the  field  would  be 
zero.  From  far  away,  the  wires  appear  to  be  almost  on  top  of  each 
other,  which  is  what  leads  to  the  more  drastic  R~2  dependence 
on  distance. 


self-check  C 

In  example  8,  what  is  the  field  right  between  the  wires,  at  z  =  0,  and 
how  does  this  simpler  result  follow  from  vector  addition?  >  Answer, 
p.  967 


An  alarming  infinity 

An  interesting  aspect  of  the  R~2  dependence  of  the  held  in  exam¬ 
ple  8  is  the  energy  of  the  held.  We’ve  already  established  on  p.  593 
that  the  energy  density  of  the  magnetic  held  must  be  proportional 
to  the  square  of  the  held  strength,  B 2,  the  same  as  for  the  gravi¬ 
tational  and  electric  helds.  Suppose  we  try  to  calculate  the  energy 
per  unit  length  stored  in  the  held  of  a  single  wire.  We  haven’t  yet 
found  the  proportionality  factor  that  goes  in  front  of  the  B2,  but 
that  doesn’t  matter,  because  the  energy  per  unit  length  turns  out 
to  be  infinite!  To  see  this,  we  can  construct  concentric  cylindrical 
shells  of  length  L,  with  each  shell  extending  from  R  to  R  +  d R. 
The  volume  of  the  shell  equals  its  circumference  times  its  thickness 
times  its  length,  dv  =  (2irR)(dR)(L)  =  2irLdR.  For  a  single  wire, 
we  have  B  ~  Rl,  so  the  energy  density  is  proportional  to  R~2, 
and  the  energy  contained  in  each  shell  varies  as  R~2  du  ~  R -1  dr. 
Integrating  this  gives  a  logarithm,  and  as  we  let  R  approach  infinity, 
we  get  the  logarithm  of  infinity,  which  is  infinite. 

Taken  at  face  value,  this  result  would  imply  that  electrical  cur¬ 
rents  could  never  exist,  since  establishing  one  would  require  an  in¬ 
finite  amount  of  energy  per  unit  length!  In  reality,  however,  we 
would  be  dealing  with  an  electric  circuit ,  which  would  be  more  like 
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the  two  wires  of  example  8:  current  goes  out  one  wire,  but  comes 
back  through  the  other.  Since  the  field  really  falls  off  as  R~2,  we 
have  an  energy  density  that  varies  as  i?  ,  which  does  not  give  in¬ 
finity  when  integrated  out  to  infinity.  (There  is  still  an  infinity  at 
R  =  0,  but  this  doesn’t  occur  for  a  real  wire,  which  has  a  finite 
diameter.) 

Still,  one  might  worry  about  the  physical  implications  of  the 
single-wire  result.  For  instance,  suppose  we  turn  on  an  electron 
gun,  like  the  one  in  a  TV  tube.  It  takes  perhaps  a  microsecond  for 
the  beam  to  progress  across  the  tube.  After  it  hits  the  other  side 
of  the  tube,  a  return  current  is  established,  but  at  least  for  the  first 
microsecond,  we  have  only  a  single  current,  not  two.  Do  we  have 
infinite  energy  in  the  resulting  magnetic  held?  No.  It  takes  time  for 
electric  and  magnetic  Held  disturbances  to  travel  outward  through 
space,  so  during  that  microsecond,  the  Held  spreads  only  to  some 
finite  value  of  R,  not  R  =  oo. 

This  reminds  us  of  an  important  fact  about  our  study  of  mag¬ 
netism  so  far:  we  have  only  been  considering  situations  where  the 
currents  and  magnetic  fields  are  constant  over  time.  The  equation 
B  =  2kI/c2R  was  derived  under  this  assumption.  This  equation  is 
only  valid  if  we  assume  the  current  has  been  established  and  flowing 
steadily  for  a  long  time,  and  if  we  are  talking  about  the  field  at 
a  point  in  space  at  which  the  field  has  been  established  for  a  long 
time.  The  generalization  to  time-varying  fields  is  nontrivial,  and 
qualitatively  new  effects  will  crop  up.  We  have  already  seen  one 
example  of  this  on  page  603,  where  we  inferred  that  an  inductor’s 
time-varying  magnetic  field  creates  an  electric  field  —  an  electric 
field  which  is  not  created  by  any  charges  anywhere.  Effects  like 
these  will  be  discussed  in  section  11.5 

A  sheet  of  current 

There  is  a  saying  that  in  computer  science,  there  are  only  three 
nice  numbers:  zero,  one,  and  however  many  you  please.  In  other 
words,  computer  software  shouldn’t  have  arbitrary  limitations  like 
a  maximum  of  16  open  files,  or  256  e-mail  messages  per  mailbox. 
When  superposing  the  fields  of  long,  straight  wires,  the  really  inter¬ 
esting  cases  are  one  wire,  two  wires,  and  infinitely  many  wires.  With 
an  infinite  number  of  wires,  each  carrying  an  infinitesimal  current, 
we  can  create  sheets  of  current,  as  in  figure  d.  Such  a  sheet  has  a 
certain  amount  of  current  per  unit  length,  which  we  notate  r/  (Greek 
letter  eta).  The  setup  is  similar  to  example  8,  except  that  all  the 
currents  are  in  the  same  direction,  and  instead  of  adding  up  two 
fields,  we  add  up  an  infinite  number  of  them  by  doing  an  integral. 


views  along  the  x  axis 
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the  current  per  unit  length,  r| 
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b  a 
a  point  off  the  sheet 


d  /  A  sheet  of  charge. 
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For  the  y  component,  we  have 


a  sheet  of  charge 
Ex  =  ±2nka 


-  I+  +  +  +  +  +  +  +  +1  -> 


a  sheet  of  current 

B||=  ±2jikr|/c2 


e  /  A  sheet  of  charge  and  a 
sheet  of  current. 


R  f2kdI  0 

By  =  I  — cos  9 
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2  ky 

~f 

2ky^ 


' -a 
rb 
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where  in  the  last  step  we  have  used  the  identity  tan-1(— x)  = 
—  tan-1  a;,  combined  with  the  relation  tan-1  b/z  +  tan^1  a/z  =  7, 
which  can  be  verified  with  a  little  geometry  and  trigonometry.  The 
calculation  of  Bz  is  left  as  an  exercise  (problem  23).  More  interest¬ 
ing  is  what  happens  underneath  the  sheet:  by  the  right-hand  rule, 
all  the  currents  make  rightward  contributions  to  the  field  there,  so 
By  abruptly  reverses  itself  as  we  pass  through  the  sheet. 

Close  to  the  sheet,  the  angle  7  approaches  7 r,  so  we  have 


By  — 


2irky 


Figure  e  shows  the  similarity  between  this  result  and  the  result  for 
a  sheet  of  charge.  In  one  case  the  sources  are  charges  and  the  held 
is  electric;  in  the  other  case  we  have  currents  and  magnetic  fields. 
In  both  cases  we  find  that  the  held  changes  suddenly  when  we  pass 
through  a  sheet  of  sources,  and  the  amount  of  this  change  doesn’t 
depend  on  the  size  of  the  sheet.  It  was  this  type  of  reasoning  that 
eventually  led  us  to  Gauss’  law  in  the  case  of  electricity,  and  in 
section  11.3  we  will  see  that  a  similar  approach  can  be  used  with 
magnetism.  The  difference  is  that,  whereas  Gauss’  law  involves  the 
flux,  a  measure  of  how  much  the  held  spreads  out,  the  corresponding 
law  for  magnetism  will  measure  how  much  the  held  curls. 

Is  it  just  dumb  luck  that  the  magnetic-field  case  came  out  so 
similar  to  the  electric  held  case?  Not  at  all.  We’ve  already  seen 
that  what  one  observer  perceives  as  an  electric  held,  another  ob¬ 
server  may  perceive  as  a  magnetic  held.  An  observer  hying  along 
above  a  charged  sheet  will  say  that  the  charges  are  in  motion,  and 
will  therefore  say  that  it  is  both  a  sheet  of  current  and  a  sheet  of 
charge.  Instead  of  a  pure  electric  held,  this  observer  will  experience 
a  combination  of  an  electric  held  and  a  magnetic  one.  (We  could 
also  construct  an  example  like  the  one  in  hgure  c  on  page  654,  in 
which  the  held  was  purely  magnetic.) 
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1 1 .2.2  Energy  in  the  magnetic  field 

In  section  10.4,  I’ve  already  argued  that  the  energy  density  of 
the  magnetic  field  must  be  proportional  to  |  B  | 2 ,  which  we  can  write 
as  B2  for  convenience.  To  pin  down  the  constant  of  proportionality, 
we  now  need  to  do  something  like  the  argument  on  page  587:  find 
one  example  where  we  can  calculate  the  mechanical  work  done  by 
the  magnetic  field,  and  equate  that  to  the  amount  of  energy  lost  by 
the  field  itself.  The  easiest  example  is  two  parallel  sheets  of  charge, 
with  their  currents  in  opposite  directions.  Homework  problem  53  is 
such  a  calculation,  which  gives  the  result 


d  Um  = 


8irk 


B2  dm 


1 1 .2.3  Superposition  of  dipoles 

To  understand  this  subsection,  you’ll  have  to  have  studied  sec¬ 
tion  4.2.4,  on  iterated  integrals. 

The  distant  field  of  a  dipole,  in  its  midplane 

Most  current  distributions  cannot  be  broken  down  into  long, 
straight  wires,  and  subsection  11.2.1  has  exhausted  most  of  the  in¬ 
teresting  cases  we  can  handle  in  this  way.  A  much  more  useful 
building  block  is  a  square  current  loop.  We  have  already  seen  how 
the  dipole  moment  of  an  irregular  current  loop  can  be  found  by 
breaking  the  loop  down  into  square  dipoles  (figure  1  on  page  660), 
because  the  currents  in  adjoining  squares  cancel  out  on  their  shared 
edges.  Likewise,  as  shown  in  figure  f,  if  we  could  find  the  magnetic 
field  of  a  square  dipole,  then  we  could  find  the  field  of  any  planar 
loop  of  current  by  adding  the  contributions  to  the  field  from  all  the 
squares. 

The  field  of  a  square- loop  dipole  is  very  complicated  close  up, 
but  luckily  for  us,  we  only  need  to  know  the  current  at  distances 
that  are  large  compared  to  the  size  of  the  loop,  because  we’re  free 
to  make  the  squares  on  our  grid  as  small  as  we  like.  The  distant  field 
of  a  square  dipole  turns  out  to  be  simple,  and  is  no  different  from  the 
distant  field  of  any  other  dipole  with  the  same  dipole  moment.  We 
can  also  save  ourselves  some  work  if  we  only  worry  about  finding  the 
field  of  the  dipole  in  its  own  plane,  i.e.,  the  plane  perpendicular  to 
its  dipole  moment.  By  symmetry,  the  field  in  this  plane  cannot  have 
any  component  in  the  radial  direction  (inward  toward  the  dipole,  or 
outward  away  from  it);  it  is  perpendicular  to  the  plane,  and  in  the 
opposite  direction  compared  to  the  dipole  vector.  (The  field  inside 
the  loop  is  in  the  same  direction  as  the  dipole  vector,  but  we’re 
interested  in  the  distant  field.)  Letting  the  dipole  vector  be  along 
the  z  axis,  we  find  that  the  field  in  the  x  —  y  plane  is  of  the  form 
B~  =  f(r),  where  /(r)  is  some  function  that  depends  only  on  r,  the 
distance  from  the  dipole. 
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f  /  The  field  of  any  planar  current 
loop  can  be  found  by  breaking  it 
down  into  square  dipoles. 
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We  can  pin  down  the  result  even  more  without  any  math.  We 
know  that  the  magnetic  field  made  by  a  current  always  contains  a 
factor  of  k/c2,  which  is  the  coupling  constant  for  magnetism.  We 
also  know  that  the  field  must  be  proportional  to  the  dipole  moment, 
m  =  I  A.  Fields  are  always  directly  proportional  to  currents,  and 
the  proportionality  to  area  follows  because  dipoles  add  according 
to  their  area.  For  instance,  a  square  dipole  that  is  2  micrometers 
by  2  micrometers  in  size  can  be  cut  up  into  four  dipoles  that  are 
1  micrometer  on  a  side.  This  tells  us  that  our  result  must  be  of 
the  form  Bz  =  ( k/c2)(IA)g(r ).  Now  if  we  multiply  the  quantity 
( k/c2)(IA )  by  the  function  g(r),  we  have  to  get  units  of  teslas,  and 
this  only  works  out  if  g(r)  has  units  of  nr-3  (homework  problem  15), 
so  our  result  must  be  of  the  form 


Bz 


j3kIA 
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g/A  long,  straight  current- 
carrying  wire  can  be  constructed 
by  filling  half  of  a  plane  with 
square  dipoles. 
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h  /  Setting  up  the  integral. 


where  /?  is  a  unitless  constant.  Thus  our  only  task  is  to  determine 
/3,  and  we  will  have  determined  the  field  of  the  dipole  (in  the  plane 
of  its  current,  i.e.,  the  midplane  with  respect  to  its  dipole  moment 
vector) . 

If  we  wanted  to,  we  could  simply  build  a  dipole,  measure  its 
field,  and  determine  /3  empirically.  Better  yet,  we  can  get  an  exact 
result  if  we  take  a  current  loop  whose  field  we  know  exactly,  break 
it  down  into  infinitesimally  small  squares,  integrate  to  find  the  total 
field,  set  this  result  equal  to  the  known  expression  for  the  field  of  the 
loop,  and  solve  for  (3.  There’s  just  one  problem  here.  We  don’t  yet 
know  an  expression  for  the  field  of  any  current  loop  of  any  shape  — 
all  we  know  is  the  field  of  a  long,  straight  wire.  Are  we  out  of  luck? 
No,  because,  as  shown  in  figure  g,  we  can  make  a  long,  straight 
wire  by  putting  together  square  dipoles!  Any  square  dipole  away 
from  the  edge  has  all  four  of  its  currents  canceled  by  its  neighbors. 
The  only  currents  that  don’t  cancel  are  the  ones  on  the  edge,  so  by 
superimposing  all  the  square  dipoles,  we  get  a  straight-line  current. 

This  might  seem  strange.  If  the  squares  on  the  interior  have  all 
their  currents  canceled  out  by  their  neighbors,  why  do  we  even  need 
them?  Well,  we  need  the  squares  on  the  edge  in  order  to  make  the 
straight-line  current.  We  need  the  second  row  of  squares  to  cancel 
out  the  currents  at  the  top  of  the  first  row  of  squares,  and  so  on. 

Integrating  as  shown  in  figure  h,  we  have 


Bz 


d  B 
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where  d Bz  is  the  contribution  to  the  total  magnetic  field  at  our  point 
of  interest,  which  lies  a  distance  R  from  the  wire. 


This  can  be  simplified  with  the  substitutions  x  =  Ru ,  y  =  Rv,  and 
dx  d y  =  R2  d u  du: 


Bz 


Pkl  1 

-f, n -  /  /  - du  dv 

c  R  Jv= 0  Ju=- oo  [it2  +  (1  +  v)2]  ! 


The  u  integral  is  of  the  form  f^°  ( u 2  +  b)  3/2  du  =  2/b2,  so 


B,  = 


f3kl 


1 


du, 


c2R  Jv= o  (1  +  v)2 
and  the  remaining  v  integral  is  equals  2,  so 
2  fikl 


This  is  the  field  of  a  wire,  which  we  already  know  equals  2 kI/c2R, 
so  we  have  /3= 1.  Remember,  the  point  of  this  whole  calculation 
was  not  to  find  the  field  of  a  wire,  which  we  already  knew,  but 
to  find  the  unitless  constant  (3  in  the  expression  for  the  held  of  a 
dipole.  The  distant  held  of  a  dipole,  in  its  midplane,  is  therefore 
Bz  =  f3k!A/c2r 3  =  kIA/c2r 3,  or,  in  terms  of  the  dipole  moment, 


The  distant  Geld  of  a  dipole,  out  of  its  midplane 

What  about  the  held  of  a  magnetic  dipole  outside  of  the  dipole’s 
midplane?  Let’s  compare  with  an  electric  dipole.  An  electric  dipole, 
unlike  a  magnetic  one,  can  be  built  out  of  two  opposite  monopoles, 
i.e.,  charges,  separated  by  a  certain  distance,  and  it  is  then  straight¬ 
forward  to  show  by  vector  addition  that  the  held  of  an  electric  dipole 
is 


Ez  =  kD  (3  cos2  6  —  l)  r  3 
Eji  =  kD  (3  sin  9  cos  6)  r~3, 
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i  /  The  field  of  a  dipole. 


where  r  is  the  distance  from  the  dipole  to  the  point  of  interest,  8 
is  the  angle  between  the  dipole  vector  and  the  line  connecting  the 
dipole  to  this  point,  and  Ez  and  Er  are,  respectively,  the  compo¬ 
nents  of  the  field  parallel  to  and  perpendicular  to  the  dipole  vector. 

In  the  midplane,  8  equals  7r/2,  which  produces  Ez  =  — kDr~ 3 
and  Er  =  0.  This  is  the  same  as  the  field  of  a  magnetic  dipole  in  its 
midplane,  except  that  the  electric  coupling  constant  k  replaces  the 
magnetic  version  k/c2,  and  the  electric  dipole  moment  D  is  substi¬ 
tuted  for  the  magnetic  dipole  moment  m.  It  is  therefore  reasonable 
to  conjecture  that  by  using  the  same  presto-change-o  recipe  we  can 
find  the  field  of  a  magnetic  dipole  outside  its  midplane: 

BZ  =  ~E~  (3  cos2  8—1)  r-3 

Br  =  (3  sin  8  cos8)r~3. 

cz 


This  turns  out  to  be  correct.  5 

Concentric,  counterrotating  currents  example  9 

t>  Two  concentric  circular  current  loops,  with  radii  a  and  b,  carry 
the  same  amount  of  current  /,  but  in  opposite  directions.  What  is 
the  field  at  the  center? 

>  We  can  produce  these  currents  by  tiling  the  region  between  the 
circles  with  square  current  loops,  whose  currents  all  cancel  each 
other  except  at  the  inner  and  outer  edges.  The  flavor  of  the  calcu¬ 
lation  is  the  same  as  the  one  in  which  we  made  a  line  of  current 
by  filling  a  half-plane  with  square  loops.  The  main  difference  is 
that  this  geometry  has  a  different  symmetry,  so  it  will  make  more 
sense  to  use  polar  coordinates  instead  of  x  and  y.  The  field  at 
the  center  is 


j  /  Example  9. 
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The  positive  sign  indicates  that  the  field  is  out  of  the  page. 


Tf  you’ve  taken  a  course  in  differential  equations,  this  won’t  seem  like  a  very 
surprising  assertion.  The  differential  form  of  Gauss’  law  is  a  differential  equation, 
and  by  giving  the  value  of  the  field  in  the  midplane,  we’ve  specified  a  boundary 
condition  for  the  differential  equation.  Normally  if  you  specify  the  boundary 
conditions,  then  there  is  a  unique  solution  to  the  differential  equation.  In  this 
particular  case,  it  turns  out  that  to  ensure  uniqueness,  we  also  need  to  demand 
that  the  solution  satisfy  the  differential  form  of  Ampere’s  law,  which  is  discussed 
in  section  11.4. 
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Field  at  the  center  of  a  circular  loop  example  1 0 

o  What  is  the  magnetic  field  at  the  center  of  a  circular  current  loop 
of  radius  a,  which  carries  a  current  /? 

>  This  is  like  example  9,  but  with  the  outer  loop  being  very  large, 
and  therefore  too  distant  to  make  a  significant  field  at  the  center. 
Taking  the  limit  of  that  result  as  b  approaches  infinity,  we  have 


Comparing  the  results  of  examples  9  and  10,  we  see  that  the 
directions  of  the  fields  are  both  out  of  the  page.  In  example  9,  the 
outer  loop  has  a  current  in  the  opposite  direction,  so  it  contributes 
a  field  that  is  into  the  page.  This,  however,  is  weaker  than  the  field 
due  to  the  inner  loop,  which  dominates  because  it  is  less  distant. 

11.2.4  The  Biot-Savart  law  (optional) 

In  section  11.2.3  we  developed  a  method  for  finding  the  field  due 
to  a  given  current  distribution  by  tiling  a  plane  with  square  dipoles. 
This  method  has  several  disadvantages: 


•  The  currents  all  have  to  lie  in  a  single  plane,  and  the  point  at 
which  we’re  computing  the  field  must  be  in  that  plane  as  well. 

•  We  need  to  do  integral  over  an  area,  which  means  one  integral 
inside  another,  e.g.,  f  f  . .  .dxdy.  That  can  get  messy. 

•  It’s  physically  bizarre  to  have  to  construct  square  dipoles  in 
places  where  there  really  aren’t  any  currents. 


Figure  k  shows  the  first  step  in  eliminating  these  defects:  instead 
of  spreading  our  dipoles  out  in  a  plane,  we  bring  them  out  along  an 
axis.  As  shown  in  figure  1,  this  eliminates  the  restriction  to  currents 
that  lie  in  a  plane.  Now  we  have  to  use  the  general  equations  for 
a  dipole  field  from  page  674,  rather  than  the  simpler  expression  for 
the  field  in  the  midplane  of  a  dipole.  This  increase  in  complication 
is  more  than  compensated  for  by  a  fortunate  feature  of  the  new 
geometry,  which  is  that  the  infinite  tube  can  be  broken  down  into 
strips,  and  we  can  find  the  field  of  such  a  strip  for  once  and  for  all. 
This  means  that  we  no  longer  have  to  do  one  integral  inside  another. 
The  derivation  of  the  most  general  case  is  a  little  messy,  so  I’ll  just 
present  the  case  shown  in  figure  m,  where  the  point  of  interest  is 
assumed  to  lie  in  the  y  —  z  plane.  Intuitively,  what  we’re  really 
finding  is  the  field  of  the  short  piece  of  length  df?  on  the  end  of  the 
U;  the  two  long  parallel  segments  are  going  to  be  canceled  out  by 
their  neighbors  when  we  assemble  all  the  strips  to  make  the  tube. 
We  expect  that  the  field  of  this  end-piece  will  form  a  pattern  that 
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k  /  Two  ways  of  making  a  current 
loop  out  of  square  dipoles. 


I /The  new  method  can  han¬ 
dle  non-planar  currents. 
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circulates  around  the  y  axis,  so  at  the  point  of  interest,  it’s  really 
the  x  component  of  the  field  that  we  want  to  compute: 


.'s!'/  a/*-,  \ 


X 

m/The  field  of  an  infinite  U. 


n/The  geometry  of  the  Biot- 
Savart  law.  The  small  arrows 
show  the  result  of  the  Biot-Savart 
law  at  various  positions  relative 
to  the  current  segment  til.  The 
Biot-Savart  law  involves  a  cross 
product,  and  the  right-hand 
rule  for  this  cross  product  is 
demonstrated  for  one  case. 
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In  the  more  general  case,  1,  the  current  loop  is  not  planar,  the  point 
of  interest  is  not  in  the  end-planes  of  the  U’s,  and  the  U  shapes 
have  their  ends  staggered,  so  the  end-piece  d l  is  not  the  only  part  of 
each  U  whose  current  is  not  canceled.  Without  going  into  the  gory 
details,  the  correct  general  result  is  as  follows: 


dB 


kl  dl  x  r 

c2r3  ’ 


which  is  known  as  the  Biot-Savart  law.  (It  rhymes  with  “leo  bazaar.” 
Both  t’s  are  silent.)  The  distances  dl  and  r  are  now  defined  as 
vectors,  dl  and  r,  which  point,  respectively,  in  the  direction  of  the 
current  in  the  end-piece  and  the  direction  from  the  end-piece  to 
the  point  of  interest.  The  new  equation  looks  different,  but  it  is 
consistent  with  the  old  one.  The  vector  cross  product  dl  x  r  has  a 
magnitude  r  dl  sin  cf),  which  cancels  one  of  r’s  in  the  denominator 
and  makes  the  dl  x  r /r3  into  a  vector  with  magnitude  dl  sin  <f>/r2. 

The  field  at  the  center  of  a  circular  loop  example  1 1 

Previously  we  had  to  do  quite  a  bit  of  work  (examples  9  and  10), 
to  calculate  the  field  at  the  center  of  a  circular  loop  of  current  of 
radius  a.  It’s  much  easier  now.  Dividing  the  loop  into  many  short 
segments,  each  d l  is  perpendicular  to  the  r  vector  that  goes  from 
it  to  the  center  of  the  circle,  and  every  r  vector  has  magnitude  a. 
Therefore  every  cross  product  til  x  r  has  the  same  magnitude, 
a  til,  as  well  as  the  same  direction  along  the  axis  perpendicular 
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to  the  loop.  The  field  is 
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Out-of-the-plane  field  of  a  circular  loop  example  12 

>  What  is  the  magnetic  field  of  a  circular  loop  of  current  at  a  point 
on  the  axis  perpendicular  to  the  loop,  lying  a  distance  z  from  the 
loop’s  center? 

>  Again,  let’s  write  a  for  the  loop’s  radius.  The  r  vector  now  has 
magnitude  Va2  +  z2,  but  it  is  still  perpendicular  to  the  61  vector. 
By  symmetry,  the  only  nonvanishing  component  of  the  field  is 
along  the  z  axis, 
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Is  it  the  field  of  a  particle? 

We  have  a  simple  equation,  based  on  Coulomb’s  law,  for  the  elec¬ 
tric  field  surrounding  a  charged  particle.  Looking  at  figure  n,  we  can 
imagine  that  if  the  current  segment  d£  was  very  short,  then  it  might 
only  contain  one  electron.  It’s  tempting,  then,  to  interpret  the  Biot- 
Savart  law  as  a  similar  equation  for  the  magnetic  field  surrounding  a 
moving  charged  particle.  Tempting  but  wrong!  Suppose  you  stand 
at  a  certain  point  in  space  and  watch  a  charged  particle  move  by. 
It  has  an  electric  field,  and  since  it’s  moving,  you  will  also  detect 
a  magnetic  field  on  top  of  that.  Both  of  these  Helds  change  over 
time,  however.  Not  only  do  they  change  their  magnitudes  and  di¬ 
rections  due  to  your  changing  geometric  relationship  to  the  particle, 
but  they  are  also  time-delayed,  because  disturbances  in  the  electro¬ 
magnetic  field  travel  at  the  speed  of  light,  which  is  finite.  The  fields 
you  detect  are  the  ones  corresponding  to  where  the  particle  used 
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to  be,  not  where  it  is  now.  Coulomb’s  law  and  the  Biot-Savart  law 
are  both  false  in  this  situation,  since  neither  equation  includes  time 
as  a  variable.  It’s  valid  to  think  of  Coulomb’s  law  as  the  equation 
for  the  field  of  a  stationary  charged  particle,  but  not  a  moving  one. 
The  Biot-Savart  law  fails  completely  as  a  description  of  the  field  of 
a  charged  particle,  since  stationary  particles  don’t  make  magnetic 
fields,  and  the  Biot-Savart  law  fails  in  the  case  where  the  particle  is 
moving. 

If  you  look  back  at  the  long  chain  of  reasoning  that  led  to  the 
Biot-Savart  law,  it  all  started  from  the  relativistic  arguments  at  the 
beginning  of  this  chapter,  where  we  assumed  a  steady  current  in  an 
infinitely  long  wire.  Everything  that  came  later  was  built  on  this 
foundation,  so  all  our  reasoning  depends  on  the  assumption  that  the 
currents  are  steady.  In  a  steady  current,  any  charge  that  moves  away 
from  a  certain  spot  is  replaced  by  more  charge  coming  up  behind  it, 
so  even  though  the  charges  are  all  moving,  the  electric  and  magnetic 
fields  they  produce  are  constant.  Problems  of  this  type  are  called 
electrostatics  and  magnetostatics  problems,  and  it  is  only  for  these 
problems  that  Coulomb’s  law  and  the  Biot-Savart  law  are  valid. 

You  might  think  that  we  could  patch  up  Coulomb’s  law  and  the 
Biot-Savart  law  by  inserting  the  appropriate  time  delays.  However, 
we’ve  already  seen  a  clear  example  of  a  phenomenon  that  wouldn’t 
be  fixed  by  this  patch:  on  page  603,  we  found  that  a  changing 
magnetic  field  creates  an  electric  field.  Induction  effects  like  these 
also  lead  to  the  existence  of  light,  which  is  a  wave  disturbance  in  the 
electric  and  magnetic  fields.  We  could  try  to  apply  another  band- 
aid  fix  to  Coulomb’s  law  and  the  Biot-Savart  law  to  make  them  deal 
with  induction,  but  it  won’t  work. 

So  what  are  the  fundamental  equations  that  describe  how  sources 
give  rise  to  electromagnetic  fields?  We’ve  already  encountered  two 
of  them:  Gauss’  law  for  electricity  and  Gauss’  law  for  magnetism. 
Experiments  show  that  these  are  valid  in  all  situations,  not  just 
static  ones.  But  Gauss’  law  for  magnetism  merely  says  that  the 
magnetic  flux  through  a  closed  surface  is  zero.  It  doesn’t  tell  us 
how  to  make  magnetic  fields  using  currents.  It  only  tells  us  that  we 
can’t  make  them  using  magnetic  monopoles.  The  following  section 
develops  a  new  equation,  called  Ampere’s  law,  which  is  equivalent 
to  the  Biot-Savart  law  for  magnetostatics,  but  which,  unlike  the 
Biot-Savart  law,  can  easily  be  extended  to  nonstatic  situations. 
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11.3  Magnetic  Fields  by  Ampere’s  Law 

11.3.1  Ampere’s  law 

As  discussed  at  the  end  of  subsection  11.2.4,  our  goal  now  is  to 
find  an  equation  for  magnetism  that,  unlike  the  Biot-Savart  law,  will 
not  end  up  being  a  dead  end  when  we  try  to  extend  it  to  nonstatic 
situations.6  Experiments  show  that  Gauss’  law  is  valid  in  both  static 
and  nonstatic  situations,  so  it  would  be  reasonable  to  look  for  an 
approach  to  magnetism  that  is  similar  to  the  way  Gauss’  law  deals 
with  electricity. 

How  can  we  do  this?  Figure  a,  reproduced  from  page  670,  is  our 
roadmap.  Electric  fields  spread  out  from  charges.  Magnetic  fields 
curl  around  currents.  In  figure  b/1,  we  define  a  Gaussian  surface, 
and  we  define  the  flux  in  terms  of  the  electric  field  pointing  out 
through  this  surface.  In  the  magnetic  case,  b/2,  we  define  a  surface, 
called  an  Amperian  surface,  and  we  define  a  quantity  called  the 
circulation,  T  (uppercase  Greek  gamma),  in  terms  of  the  magnetic 
field  that  points  along  the  edge  of  the  Amperian  surface,  c.  We 
break  the  edge  into  tiny  parts  s j,  and  for  each  of  these  parts,  we 
define  a  contribution  to  the  circulation  using  the  dot  product  of  ds 
with  the  magnetic  field: 

T  =  ^  sj  ' 

The  circulation  is  a  measure  of  how  curly  the  field  is.  Like  a  Gaus¬ 
sian  surface,  an  Amperian  surface  is  purely  a  mathematical  con¬ 
struction.  It  is  not  a  physical  object. 

In  figure  b/2,  the  field  is  perpendicular  to  the  edges  on  the  ends, 
but  parallel  to  the  top  and  bottom  edges.  A  dot  product  is  zero 
when  the  vectors  are  perpendicular,  so  only  the  top  and  bottom 
edges  contribute  to  T.  Let  these  edges  have  length  s.  Since  the 
field  is  constant  along  both  of  these  edges,  we  don’t  actually  have 
to  break  them  into  tiny  parts;  we  can  just  have  si  on  the  top  edge, 
pointing  to  the  left,  and  S2  on  the  bottom  edge,  pointing  to  the  right. 
The  vector  si  is  in  the  same  direction  as  the  field  Bi,  and  S2  is  in 
the  same  direction  as  B2,  so  the  dot  products  are  simply  equal  to 
the  products  of  the  vectors’  magnitudes.  The  resulting  circulation 
is 


r  =  |s1||B1|  +  |s2||B2 

2-Kkijs  2irki]s 
=  +  — 

A.'Kkiqs 


But  77s  is  (current/length) (length),  i.e.,  it  is  the  amount  of  current 
that  pierces  the  Amperian  surface.  We’ll  call  this  current  Ithrough- 

6If  you  didn’t  read  this  optional  subsection,  don’t  worry,  because  the  point  is 
that  we  need  to  try  a  whole  new  approach  anyway. 


a  sheet  of  charge 

Ex  =  ±27tko 

\  \  f  f  |  |  f  f  / 

■*-  I+  +  +  +  +  +  +  +  ~+1  -* 

a  sheet  of  current 

B||=  ±27ikr|/c2 

+  1©©®©®©©©©®!  + 


a  /  The  electric  field  of  a  sheet  of 
charge,  and  the  magnetic  field  of 
a  sheet  of  current. 


b  /  A  Gaussian  surface  and 
an  Amperian  surface. 


c/The  definition  of  the  circu¬ 
lation,  F. 
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(1)  r >0,  Ithrough^® 


(2)  r<0,  (through 

d  /  Positive  and  negative  signs  in 
Ampere’s  law. 


(1) 


(3) 


e  /  Example  13:  a  cutaway 

view  of  a  solenoid. 


We  have  found  one  specific  example  of  the  general  law  of  nature 
known  as  Ampere’s  law: 


47T&: 

^2  1  through 


Positive  and  negative  signs 

Figures  d/1  and  d/2  show  what  happens  to  the  circulation  when 
we  reverse  the  direction  of  the  current  I  through-  Reversing  the  cur¬ 
rent  causes  the  magnetic  field  to  reverse  itself  as  well.  The  dot 
products  occurring  in  the  circulation  are  all  negative  in  d/2,  so  the 
total  circulation  is  now  negative.  To  preserve  Ampere’s  law,  we  need 
to  define  the  current  in  d/2  as  a  negative  number.  In  general,  deter¬ 
mine  these  plus  and  minus  signs  using  the  right-hand  rule  shown  in 
the  figure.  As  the  fingers  of  your  hand  sweep  around  in  the  direction 
of  the  s  vectors,  your  thumb  defines  the  direction  of  current  which  is 
positive.  Choosing  the  direction  of  the  thumb  is  like  choosing  which 
way  to  insert  an  ammeter  in  a  circuit:  on  a  digital  meter,  reversing 
the  connections  gives  readings  which  are  opposite  in  sign. 


A  solenoid  example  13 

>  What  is  the  field  inside  a  long,  straight  solenoid  of  length  £  and 
radius  a,  and  having  N  loops  of  wire  evenly  wound  along  it,  which 
carry  a  current  /? 

>  This  is  an  interesting  example,  because  it  allows  us  to  get  a 
very  good  approximation  to  the  field,  but  without  some  experi¬ 
mental  input  it  wouldn’t  be  obvious  what  approximation  to  use. 
Figure  e/1  shows  what  we’d  observe  by  measuring  the  field  of  a 
real  solenoid.  The  field  is  nearly  constant  inside  the  tube,  as  long 
as  we  stay  far  away  from  the  mouths.  The  field  outside  is  much 
weaker.  For  the  sake  of  an  approximate  calculation,  we  can  ideal¬ 
ize  this  field  as  shown  in  figure  e/2.  Of  the  edges  of  the  Amperian 
surface  shown  in  e/3,  only  AB  contributes  to  the  flux  —  there  is 
zero  field  along  CD,  and  the  field  is  perpendicular  to  edges  BC 
and  DA.  Ampere’s  law  gives 


r  = 

(8)(length  of  AB)  = 
B  = 


4nk 

c2 

4nk 


through 


(r))(length  of  AB) 


4nkr\ 


4nkNI 

c2l 


self-check  D 

What  direction  is  the  current  in  figure  e?  >  Answer,  p.  967 
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self-check  E 

Based  on  how  t  entered  into  the  derivation  in  example  13,  how  should 
it  be  interpreted?  Is  it  the  total  length  of  the  wire?  >  Answer,  p.  967 

self-check  F 

Surprisingly,  we  never  needed  to  know  the  radius  of  the  solenoid  in 
example  13.  Why  is  it  physically  plausible  that  the  answer  would  be 
independent  of  the  radius?  >  Answer,  p.  967 

Example  13  shows  how  much  easier  it  can  sometimes  be  to  cal¬ 
culate  a  field  using  Ampere’s  law  rather  than  the  approaches  de¬ 
veloped  previously  in  this  chapter.  However,  if  we  hadn’t  already 
known  something  about  the  field,  we  wouldn’t  have  been  able  to 
get  started.  In  situations  that  lack  symmetry,  Ampere’s  law  may 
make  things  harder,  not  easier.  Anyhow,  we  will  have  no  choice  in 
nonstatic  cases,  where  Ampere’s  law  is  true,  and  static  equations 
like  the  Biot-Savart  law  are  false. 

11.3.2  A  quick  and  dirty  proof 

Here’s  an  informal  sketch  for  a  proof  of  Ampere’s  law,  with  no 
pretensions  to  rigor.  Even  if  you  don’t  care  much  for  proofs,  it  would 
be  a  good  idea  to  read  it,  because  it  will  help  to  build  your  ability 
to  visualize  how  Ampere’s  law  works. 

First  we  establish  by  a  direct  computation  (homework  problem 
26)  that  Ampere’s  law  holds  for  the  geometry  shown  in  figure  f/1, 
a  circular  Amperian  surface  with  a  wire  passing  perpendicularly 
through  its  center.  If  we  then  alter  the  surface  as  in  figure  f/2, 
Ampere’s  law  still  works,  because  the  straight  segments,  being  per¬ 
pendicular  to  the  field,  don’t  contribute  to  the  circulation,  and  the 
new  arc  makes  the  same  contribution  to  the  circulation  as  the  old 
one  it  replaced,  because  the  weaker  field  is  compensated  for  by  the 
greater  length  of  the  arc.  It  is  clear  that  by  a  series  of  such  modifi¬ 
cations,  we  could  mold  the  surface  into  any  shape,  f/3. 

Next  we  prove  Ampere’s  law  in  the  case  shown  in  figure  f/4: 
a  small,  square  Amperian  surface  subject  to  the  Held  of  a  distant 
square  dipole.  This  part  of  the  proof  can  be  most  easily  accom¬ 
plished  by  the  methods  of  section  11.4.  It  should,  for  example,  be 
plausible  in  the  case  illustrated  here.  The  field  on  the  left  edge  is 
stronger  than  the  field  on  the  right,  so  the  overall  contribution  of 
these  two  edges  to  the  circulation  is  slightly  counterclockwise.  How¬ 
ever,  the  field  is  not  quite  perpendicular  to  the  top  and  bottoms 
edges,  so  they  both  make  small  clockwise  contributions.  The  clock¬ 
wise  and  counterclockwise  parts  of  the  circulation  end  up  canceling 
each  other  out.  Once  Ampere’s  law  is  established  for  a  square  sur¬ 
face  like  f/4,  it  follows  that  it  is  true  for  an  irregular  surface  like  f/5, 
since  we  can  build  such  a  shape  out  of  squares,  and  the  circulations 
are  additive  when  we  paste  the  surfaces  together  this  way. 

By  pasting  a  square  dipole  onto  the  wire,  f/6,  like  a  flag  attached 


f  /  A  proof  of  Ampere’s  law. 


Section  1 1 .3  Magnetic  Fields  by  Ampere’s  Law 


681 


to  a  flagpole,  we  can  cancel  out  a  segment  of  the  wire’s  current 
and  create  a  detour.  Ampere’s  law  is  still  true  because,  as  shown 
in  the  last  step,  the  square  dipole  makes  zero  contribution  to  the 
circulation.  We  can  make  as  many  detours  as  we  like  in  this  manner, 
thereby  morphing  the  wire  into  an  arbitrary  shape  like  f/7. 

What  about  a  wire  like  f/8?  It  doesn’t  pierce  the  Amperian  sur¬ 
face,  so  it  doesn’t  add  anything  to  I  through >  and  we  need  to  show  that 
it  likewise  doesn’t  change  the  circulation.  This  wire,  however,  could 
be  built  by  tiling  the  half-plane  on  its  right  with  square  dipoles,  and 
we’ve  already  established  that  the  field  of  a  distant  dipole  doesn’t 
contribute  to  the  circulation.  (Note  that  we  couldn’t  have  done  this 
with  a  wire  like  f/7,  because  some  of  the  dipoles  would  have  been 
right  on  top  of  the  Amperian  surface.) 

If  Ampere’s  law  holds  for  cases  like  f/7  and  f/8,  then  it  holds 
for  any  complex  bundle  of  wires,  including  some  that  pass  through 
the  Amperian  surface  and  some  that  don’t.  But  we  can  build  just 
about  any  static  current  distribution  we  like  using  such  a  bundle  of 
wires,  so  it  follows  that  Ampere’s  law  is  valid  for  any  static  current 
distribution. 

1 1 .3.3  Maxwell’s  equations  for  static  fields 

Static  electric  fields  don’t  curl  the  way  magnetic  fields  do,  so  we 
can  state  a  version  of  Ampere’s  law  for  electric  fields,  which  is  that 
the  circulation  of  the  electric  field  is  zero.  Summarizing  what  we 
know  so  far  about  static  fields,  we  have 

=  47T kqin 

=  0 
Tg  =  0 

_  4vrA: 

B  ^2  -‘■through' 


This  set  of  equations  is  the  static  case  of  the  more  general  relations 
known  as  Maxwell’s  equations.  On  the  left  side  of  each  equation,  we 
have  information  about  a  field.  On  the  right  is  information  about 
the  field’s  sources. 

It  is  vitally  important  to  realize  that  these  equations  are  only 
true  for  statics.  They  are  incorrect  if  the  distribution  of  charges 
or  currents  is  changing  over  time.  For  example,  we  saw  on  page 
603  that  the  changing  magnetic  field  in  an  inductor  gives  rise  to 
an  electric  field.  Such  an  effect  is  completely  inconsistent  with  the 
static  version  of  Maxwell’s  equations;  the  equations  don’t  even  refer 
to  time,  so  if  the  magnetic  field  is  changing  over  time,  they  will 
not  do  anything  special.  The  extension  of  Maxwell’s  equations  to 
nonstatic  fields  is  discussed  in  section  11.6. 
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Discussion  Questions 

A  Figure  g/1  shows  a  wire  with  a  circular  Amperian  surface  drawn 
around  its  waist;  in  this  situation,  Ampere’s  law  can  be  verified  easily 
based  on  the  equation  for  the  field  of  a  wire.  In  panel  2,  a  second  wire  has 
been  added.  Explain  why  it’s  plausible  that  Ampere’s  law  still  holds. 

B  Fig  ure  h  is  like  figure  g,  but  now  the  second  wire  is  perpendicular 
to  the  first,  and  lies  in  the  plane  of,  and  outside  of,  the  Amperian  surface. 
Carry  out  a  similar  analysis. 

C  This  discussion  question  is  similar  to  questions  A  and  B,  but  now  the 
Amperian  surface  has  been  moved  off  center. 

D  The  left-hand  wire  has  been  nudged  over  a  little.  Analyze  as  before. 

E  You  know  what  to  do. 


g  /  Discussion  question  A. 


h  /  Discussion  question  B. 


i  /  Discussion  question  C. 


j  /  Discussion  question  D. 


k/ Discussion  question  E. 
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a /The  div-meter,  1,  and  the 
curl-meter,  2  and  3. 


1 1 .4  Ampere’s  Law  In  Differential  Form  (Optional) 

11.4.1  The  curl  operator 

The  differential  form  of  Gauss’  law  is  more  physically  satisfying 
than  the  integral  form,  because  it  relates  the  charges  that  are  present 
at  some  point  to  the  properties  of  the  electric  field  at  the  same  point. 
Likewise,  it  would  be  more  attractive  to  have  a  differential  version 
of  Ampere’s  law  that  would  relate  the  currents  to  the  magnetic  field 
at  a  single  point,  intuitively,  the  divergence  was  based  on  the  idea 
of  the  div-meter,  a/1.  The  corresponding  device  for  measuring  the 
curliness  of  a  field  is  the  curl-meter,  a/2.  If  the  field  is  curly,  then 
the  torques  on  the  charges  will  not  cancel  out,  and  the  wheel  will 
twist  against  the  resistance  of  the  spring.  If  your  intuition  tells  you 
that  the  curlmeter  will  never  do  anything  at  all,  then  your  intuition 
is  doing  a  beautiful  job  on  static  fields;  for  nonstatic  fields,  however, 
it  is  perfectly  possible  to  get  a  curly  electric  field. 

Gauss’  law  in  differential  form  relates  div  E,  a  scalar,  to  the 
charge  density,  another  scalar.  Ampere’s  law,  however,  deals  with 
directions  in  space:  if  we  reverse  the  directions  of  the  currents,  it 
makes  a  difference.  We  therefore  expect  that  the  differential  form  of 
Ampere’s  law  will  have  vectors  on  both  sides  of  the  equal  sign,  and 
we  should  be  thinking  of  the  curl-meter’s  result  as  a  vector.  First 
we  find  the  orientation  of  the  curl-meter  that  gives  the  strongest 
torque,  and  then  we  define  the  direction  of  the  curl  vector  using  the 
right-hand  rule  shown  in  figure  a/3. 

To  convert  the  div-meter  concept  to  a  mathematical  definition, 
we  found  the  infinitesimal  flux,  d<L  through  a  tiny  cubical  Gaussian 
surface  containing  a  volume  du.  By  analogy,  we  imagine  a  tiny 
square  Amperian  surface  with  infinitesimal  area  dA.  We  assume  this 
surface  has  been  oriented  in  order  to  get  the  maximum  circulation. 
The  area  vector  dA  will  then  be  in  the  same  direction  as  the  one 
defined  in  figure  a/3.  Ampere’s  law  is 

_  4irk 

Ul  —  ^2  Ui  through' 

We  define  a  current  density  per  unit  area,  j,  which  is  a  vector 
pointing  in  the  direction  of  the  current  and  having  magnitude  j  = 
dI/\  dA|.  In  terms  of  this  quantity,  we  have 

dT 

dr 

|  dAj 

With  this  motivation,  we  define  the  magnitude  of  the  curl  as 


|  curl  B 


dr 

dAj' 
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Note  that  the  curl,  just  like  a  derivative,  has  a  differential  divided 
by  another  differential.  In  terms  of  this  definition,  we  find  Ampere’s 
law  in  differential  form: 


b  /  The  coordinate  system 
used  in  the  following  examples. 


The  complete  set  of  Maxwell’s  equations  in  differential  form  is  col¬ 
lected  on  page  954. 

1 1 .4.2  Properties  of  the  curl  operator 

The  curl  is  a  derivative. 

As  an  example,  let’s  calculate  the  curl  of  the  field  x  shown  in 
figure  c.  For  our  present  purposes,  it  doesn’t  really  matter  whether 
this  is  an  electric  or  a  magnetic  field;  we’re  just  getting  out  feet  wet 
with  the  curl  as  a  mathematical  definition.  Applying  the  definition 
of  the  curl  directly,  we  construct  an  Amperian  surface  in  the  shape 
of  an  infinitesimally  small  square.  Actually,  since  the  field  is  uni¬ 
form,  it  doesn’t  even  matter  very  much  whether  we  make  the  square 
finite  or  infinitesimal.  The  right  and  left  edges  don’t  contribute  to 
the  circulation,  since  the  field  is  perpendicular  to  these  edges.  The 
top  and  bottom  do  contribute,  but  the  top’s  contribution  is  clock¬ 
wise,  i.e.,  into  the  page  according  to  the  right-hand  rule,  while  the 
bottom  contributes  an  equal  amount  in  the  counterclockwise  direc¬ 
tion,  which  corresponds  to  an  out-of-the-page  contribution  to  the 
curl.  They  cancel,  and  the  circulation  is  zero.  We  could  also  have 
determined  this  by  imagining  a  curl-meter  inserted  in  this  field:  the 
torques  on  it  would  have  canceled  out. 

It  makes  sense  that  the  curl  of  a  constant  field  is  zero,  because 
the  curl  is  a  kind  of  derivative.  The  derivative  of  a  constant  is  zero. 

The  curl  is  rotationally  invariant. 

Figure  c  looks  just  like  figure  c,  but  rotated  by  90  degrees.  Phys¬ 
ically,  we  could  be  viewing  the  same  field  from  a  point  of  view  that 
was  rotated.  Since  the  laws  of  physics  are  the  same  regardless  of 
rotation,  the  curl  must  be  zero  here  as  well.  In  other  words,  the  curl 
is  rotationally  invariant.  If  a  certain  field  has  a  certain  curl  vector, 
then  viewed  from  some  other  angle,  we  simply  see  the  same  field  and 
the  same  curl  vector,  viewed  from  a  different  angle.  A  zero  vector 
viewed  from  a  different  angle  is  still  a  zero  vector. 

As  a  less  trivial  example,  let’s  compute  the  curl  of  the  field  F  = 
xy  shown  in  figure  e,  at  the  point  (x  =  0,  y  =  0).  The  circulation 
around  a  square  of  side  s  centered  on  the  origin  can  be  approximated 


c  /  The  field  x. 

d  /  The  field  y. 


e  /  The  field  xy. 
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by  evaluating  the  field  at  the  midpoints  of  its  sides 


x  =  s/2 

y  =  o 

F  =  (s/2)y 

x  =  0 

y  =  s/2 

F  =  0 

x  =  —s/2 

y  =  o 

F  =  -(*/ 2)y 

x  =  0 

y  =  - s/2 

F  =  0 

si  •  F  =  s2  / 2 
s2  •  F  =  0 
s3  •  F  =  s2/ 2 
S4  •  F  =  0, 


which  gives  a  circulation  of  s2,  and  a  curl  with  a  magnitude  of 
s2/area  =  s2 / s2  =  1.  By  the  right-hand  rule,  the  curl  points  out  of 
the  page,  i.e. ,  along  the  positive  z  axis,  so  we  have 


curl  xy  =  z. 


f  /  The  field  -yx. 


g  /  Example  14. 


Now  consider  the  field  —  yx,  shown  in  figure  f.  This  is  the  same 
as  the  previous  field,  but  with  your  book  rotated  by  90  degrees  about 
the  z  axis.  Rotating  the  result  of  the  first  calculation,  z,  about  the 
z  axis  doesn’t  change  it,  so  the  curl  of  this  field  is  also  z. 


Scaling 

When  you’re  taking  an  ordinary  derivative,  you  have  the  rule 


_d 

dx 


[c/O)] 


/(*)• 


In  other  words,  multiplying  a  function  by  a  constant  results  in  a 
derivative  that  is  multiplied  by  that  constant.  The  curl  is  a  kind  of 
derivative  operator,  and  the  same  is  true  for  a  curl. 

Multiplying  the  field  by  - 1.  example  14 

t>  What  is  the  curl  of  the  field  -xy  at  the  origin? 

>  Using  the  scaling  property  just  discussed,  we  can  make  this  into 
a  curl  that  we’ve  already  calculated: 


curl  (-xy)  =  -curl  (xy) 

=  -z 


This  is  in  agreement  with  the  right-hand  rule. 

The  curl  is  additive. 

We  have  only  calculated  each  field’s  curl  at  the  origin,  but  each 
of  these  fields  actually  has  the  same  curl  everywhere.  In  example 
14,  for  instance,  it  is  obvious  that  the  curl  is  constant  along  any 
vertical  line.  But  even  if  we  move  along  the  x  axis,  there  is  still  an 
imbalance  between  the  torques  on  the  left  and  right  sides  of  the  curl- 
meter.  More  formally,  suppose  we  start  from  the  origin  and  move 
to  the  left  by  one  unit.  We  find  ourselves  in  a  region  where  the  field 
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is  very  much  as  it  was  before,  except  that  all  the  held  vectors  have 
had  one  unit  worth  of  y  added  to  them.  But  what  do  we  get  if  we 
take  the  curl  of  —  xy  +  y?  The  curl,  like  any  god-fearing  derivative 
operation,  has  the  additive  property 

curl  (F  +  G)  =  curl  F  +  curl  G, 


so 

curl  (—xy  +  y)  =  curl  (—xy)  +  curl  (y). 

But  the  second  term  is  zero,  so  we  get  the  same  result  as  at  the 
origin. 

A  field  that  goes  in  a  circle  example  15 

>  What  is  the  curl  of  the  field  xy  -  yx? 


>  Using  the  linearity  of  the  curl,  and  recognizing  each  of  the  terms 
as  one  whose  curl  we  have  already  computed,  we  find  that  this 
field’s  curl  is  a  constant  2z.  This  agrees  with  the  right-hand  rule. 

The  field  inside  a  long,  straight  wire  example  16 

>  What  is  the  magnetic  field  inside  a  long,  straight  wire  in  which 
the  current  density  is  /? 

>  Let  the  wire  be  along  the  z  axis,  so  j  =  yz.  Ampere’s  law  gives 


,  r,  47T/C  .„ 

curl  B  =  — ^-y z. 

c2 


In  other  words,  we  need  a  magnetic  field  whose  curl  is  a  constant. 
We’ve  encountered  several  fields  with  constant  curls,  but  the  only 
one  that  has  the  same  symmetry  as  the  cylindrical  wire  is  xy -yx, 
so  the  answer  must  be  this  field  or  some  constant  multiplied  by  it, 


B  =  b  (xy  -  yx) . 
The  curl  of  this  field  is  2bz,  so 


_ ,  Ank  . 


and  thus 


_  2nk  . ,  „ 

B  =  -^j(xy-yx)- 


The  curl  in  component  form 
Now  consider  the  field 

Fx  =  ax  +  by  +  c 
Fy  =  dx  +  ey  +  f, 
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i.e. 


F  =  axx  +  byk  +  cx  +  dx  y  +  eyy  +  /  y. 


y 


X 


i  /  A  cyclic 
y,  and  z. 


tltllt 
t  It  I  l  t 
tltllt 

t  l  I  It  t 


Mill* 
Mill* 
Mill1 
*  I  |  |  1  1 


j  /  Example  17. 


The  only  terms  whose  curls  we  haven’t  yet  explicitly  computed  are 
the  a,  e,  and  /  terms,  and  their  curls  turn  out  to  be  zero  (homework 
problem  50).  Only  the  b  and  d  terms  have  nonvanishing  curls.  The 
curl  of  this  field  is 

curl  F  =  curl  (byk)  +  curl  (dx y) 

=  6  curl  (yk)  +  cicurl  ( xy )  [scaling] 

=  b(— z)  +  d(z)  [found  previously] 

=  (d  —  b)  z. 


But  any  field  in  the  x  —  y  plane  can  be  approximated  with  this 
type  of  field,  as  long  as  we  only  need  to  get  a  good  approximation 
within  a  small  region.  The  infinitesimal  Amperian  surface  occurring 
in  the  definition  of  the  curl  is  tiny  enough  to  fit  in  a  pretty  small  re¬ 
gion,  so  we  can  get  away  with  this  here.  The  d  and  b  coefficients  can 
then  be  associated  with  the  partial  derivatives  dFy/dx  and  dFx/dy. 
We  therefore  have 


curl  F 


dFy  _  dFx  \  £ 

dx  dy  J 


for  any  field  in  the  x  —  y  plane.  In  three  dimensions,  we  just  need  to 
generate  two  more  equations  like  this  by  doing  a  cyclic  permutation 
of  the  variables  x,  y ,  and  z: 


(curl  F)x 
(curl  F)y 
(curl  F)z 


dFz 

dFy 

dy 

dz 

dFx 

dFz 

dz 

dx 

dFy 

9FX 

dx 

dy 

A  sine  wave  example  1 7 

>  Find  the  curl  of  the  following  electric  field 

E  =  (sinx)y, 


and  interpret  the  result. 

>  The  only  nonvanishing  partial  derivative  occurring  in  this  curl  is 


(curl  E )z  = 


dEy 

dx 

cosx, 
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so 


curl  E  =  cosz 


This  is  visually  reasonable:  the  curl-meter  would  spin  if  we  put  its 
wheel  in  the  plane  of  the  page,  with  its  axle  poking  out  along  the 
z  axis.  In  some  areas  it  would  spin  clockwise,  in  others  counter¬ 
clockwise,  and  this  makes  sense,  because  the  cosine  is  positive 
in  some  placed  and  negative  in  others. 

This  is  a  perfectly  reasonable  field  pattern:  it  the  electric  field 
pattern  of  a  light  wave!  But  Ampere’s  law  for  electric  fields  says 
the  curl  of  E  is  supposed  to  be  zero.  What’s  going  on?  What’s 
wrong  is  that  we  can’t  assume  the  static  version  of  Ampere’s  law. 
All  we’ve  really  proved  is  that  this  pattern  is  impossible  as  a  static 
field:  we  can’t  have  a  light  wave  that  stands  still. 

Figure  k  is  a  summary  of  the  vector  calculus  presented  in  the 
optional  sections  of  this  book.  The  first  column  shows  that  one  func¬ 
tion  is  a  related  to  another  by  a  kind  of  differentiation.  The  second 
column  states  the  fundamental  theorem  of  calculus,  which  says  that 
if  you  integrate  the  derivative  over  the  interior  of  a  region,  you  get 
some  information  about  the  original  function  at  the  boundary  of 
that  region. 
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interior 


f=  ^ 


k  /  A  summary  of  the  derivative,  gradient,  curl,  and  divergence. 

11.5  Induced  Electric  Fields 

11.5.1  Faraday’s  experiment 

Nature  is  simple,  but  the  simplicity  may  not  become  evident  un¬ 
til  a  hundred  years  after  the  discovery  of  some  new  piece  of  physics. 
We’ve  already  seen,  on  page  603,  that  the  time-varying  magnetic 
field  in  an  inductor  causes  an  electric  field.  This  electric  held  is  not 
created  by  charges.  That  argument,  however,  only  seems  clear  with 
hindsight.  The  discovery  of  this  phenomenon  of  induced  electric 
fields  —  fields  that  are  not  due  to  charges  —  was  a  purely  exper¬ 
imental  accomplishment  by  Michael  Faraday  (1791-1867),  the  son 
of  a  blacksmith  who  had  to  struggle  against  the  rigid  class  struc- 
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ture  of  19th  century  England.  Faraday,  working  in  1831,  had  only  a 
vague  and  general  idea  that  electricity  and  magnetism  were  related 
to  each  other,  based  on  Oersted’s  demonstration,  a  decade  before, 
that  magnetic  fields  were  caused  by  electric  currents. 

Figure  b  is  a  simplified  drawing  of  the  following  experiment,  as 
described  in  Faraday’s  original  paper:  “Two  hundred  and  three  feet 
of  copper  wire  . . .  were  passed  round  a  large  block  of  wood;  [another] 
two  hundred  and  three  feet  of  similar  wire  were  interposed  as  a  spiral 
between  the  turns  of  the  first,  and  metallic  contact  everywhere  pre¬ 
vented  by  twine  [insulation] .  One  of  these  [coils]  was  connected  with 
a  galvanometer  [voltmeter] ,  and  the  other  with  a  battery. . .  When 
the  contact  was  made,  there  was  a  sudden  and  very  slight  effect  at 
the  galvanometer,  and  there  was  also  a  similar  slight  effect  when 
the  contact  with  the  battery  was  broken.  But  whilst  the  . . .  current 
was  continuing  to  pass  through  the  one  [coil] ,  no  . . .  effect  . . .  upon 
the  other  [coil]  could  be  perceived,  although  the  active  power  of  the 
battery  was  proved  to  be  great,  by  its  heating  the  whole  of  its  own 
coil  [through  ordinary  resistive  heating]  ...” 

From  Faraday’s  notes  and  publications,  it  appears  that  the  situ¬ 
ation  in  figure  b/3  was  a  surprise  to  him,  and  he  probably  thought 
it  would  be  a  surprise  to  his  readers,  as  well.  That’s  why  he  offered 
evidence  that  the  current  was  still  flowing:  to  show  that  the  bat¬ 
tery  hadn’t  just  died.  The  induction  effect  occurred  during  the  short 
time  it  took  for  the  black  coil’s  magnetic  held  to  be  established,  b/2. 
Even  more  counterintuitively,  we  get  an  effect,  equally  strong  but  in 
the  opposite  direction,  when  the  circuit  is  broken,  b/4.  The  effect 
occurs  only  when  the  magnetic  held  is  changing,  and  it  appears  to 
be  proportional  to  the  derivative  <9B/<9t,  which  is  in  one  direction 
when  the  held  is  being  established,  and  in  the  opposite  direction 
when  it  collapses. 

The  effect  is  proportional  to  <9B/9t,  but  what  is  the  effect?  A 
voltmeter  is  nothing  more  than  a  resistor  with  an  attachment  for 
measuring  the  current  through  it.  A  current  will  not  how  through 
a  resistor  unless  there  is  some  electric  held  pushing  the  electrons, 
so  we  conclude  that  the  changing  magnetic  field  has  produced  an 
electric  field  in  the  surrounding  space.  Since  the  white  wire  is  not 
a  perfect  conductor,  there  must  be  electric  helds  in  it  as  well.  The 
remarkable  thing  about  the  circuit  formed  by  the  white  wire  is  that 
as  the  electrons  travel  around  and  around,  they  are  always  being 
pushed  forward  by  electric  helds.  This  violates  the  loop  rule,  which 
says  that  when  an  electron  makes  a  round  trip,  there  is  supposed 
to  be  just  as  much  “uphill”  (moving  against  the  electric  held)  as 
“downhill”  (moving  with  it).  That’s  OK.  The  loop  rule  is  only  true 
for  statics.  Faraday’s  experiments  show  that  an  electron  really  can 
go  around  and  around,  and  always  be  going  “downhill,”  as  in  the 
famous  drawing  by  M.C.  Escher  shown  in  hgure  c.  That’s  just  what 
happens  when  you  have  a  curly  held. 


a  /  Faraday  on  a  British  ban¬ 
knote. 


(3) 


(5) 


b  /  Faraday’s  experiment,  sim¬ 
plified  and  shown  with  modern 
equipment. 
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c  /  Detail  from  Ascending  and 
Descending,  M.C.  Escher,  1960. 


E/  \ 

- 7 

\  / 


d/The  relationship  between 
the  change  in  the  magnetic  field, 
and  the  electric  field  it  produces. 


e/The  electric  circulation  is 
the  sum  of  the  voltmeter  read¬ 
ings. 


When  a  field  is  curly,  we  can  measure  its  curliness  using  a  circu¬ 
lation.  Unlike  the  magnetic  circulation  r^,  the  electric  circulation 
Te  is  something  we  can  measure  directly  using  ordinary  tools.  A 
circulation  is  defined  by  breaking  up  a  loop  into  tiny  segments,  ds, 
and  adding  up  the  dot  products  of  these  distance  vectors  with  the 
field.  But  when  we  multiply  electric  field  by  distance,  what  we  get 
is  an  indication  of  the  amount  of  work  per  unit  charge  done  on  a 
test  charge  that  has  been  moved  through  that  distance.  The  work 
per  unit  charge  has  units  of  volts,  and  it  can  be  measured  using  a 
voltmeter,  as  shown  in  figure  e,  where  T e  equals  the  sum  of  the  volt¬ 
meter  readings.  Since  the  electric  circulation  is  directly  measurable, 
most  people  who  work  with  circuits  are  more  familiar  with  it  than 
they  are  with  the  magnetic  circulation.  They  usually  refer  to  T^; 
using  the  synonym  “emf,”  which  stands  for  “electromotive  force,” 
and  notate  it  as  £.  (This  is  an  unfortunate  piece  of  terminology, 
because  its  units  are  really  volts,  not  newtons.)  The  term  emf  can 
also  be  used  when  the  path  is  not  a  closed  loop. 

Faraday’s  experiment  demonstrates  a  new  relationship 


where  the  negative  sign  is  a  way  of  showing  the  observed  left-handed 
relationship,  d.  This  is  similar  to  the  structure  of  of  Ampere’s  law: 


r,B  OC  Ithroughi 

which  also  relates  the  curliness  of  a  field  to  something  that  is  going 
on  nearby  (a  current,  in  this  case). 

It’s  important  to  note  that  even  though  the  emf,  T^,  has  units 
of  volts,  it  isn’t  a  voltage.  A  voltage  is  a  measure  of  the  electrical 
energy  a  charge  has  when  it  is  at  a  certain  point  in  space.  The  curly 
nature  of  nonstatic  fields  means  that  this  whole  concept  becomes 
nonsense.  In  a  curly  field,  suppose  one  electron  stays  at  home  while 
its  friend  goes  for  a  drive  around  the  block.  When  they  are  reunited, 
the  one  that  went  around  the  block  has  picked  up  some  kinetic 
energy,  while  the  one  who  stayed  at  home  hasn’t.  We  simply  can’t 
define  an  electrical  energy  Ue  =  qV  so  that  Ue+K  stays  the  same  for 
each  electron.  No  voltage  pattern,  V,  can  do  this,  because  then  it 
would  predict  the  same  kinetic  energies  for  the  two  electrons,  which 
is  incorrect.  When  we’re  dealing  with  nonstatic  fields,  we  need  to 
think  of  the  electrical  energy  in  terms  of  the  energy  density  of  the 
fields  themselves. 

It  might  sound  as  though  an  electron  could  get  a  free  lunch 
by  circling  around  and  around  in  a  curly  electric  field,  resulting  in 
a  violation  of  conservation  of  energy.  The  following  examples,  in 
addition  to  their  practical  interest,  both  show  that  energy  is  in  fact 
conserved. 
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The  generator  example  18 

A  basic  generator,  f,  consists  of  a  permanent  magnet  that  rotates 
within  a  coil  of  wire.  The  magnet  is  turned  by  a  motor  or  crank, 
(not  shown).  As  it  spins,  the  nearby  magnetic  field  changes.  This 
changing  magnetic  field  results  in  an  electric  field,  which  has  a 
curly  pattern.  This  electric  field  pattern  creates  a  current  that 
whips  around  the  coils  of  wire,  and  we  can  tap  this  current  to  light 
the  lightbulb. 

If  the  magnet  was  on  a  frictionless  bearing,  could  we  light  the  bulb 
for  free  indefinitely,  thus  violating  conservation  of  energy?  No. 
Mechanical  work  has  to  be  done  to  crank  the  magnet,  and  that’s 
where  the  energy  comes  from.  If  we  break  the  light-bulb  circuit, 
it  suddenly  gets  easier  to  crank  the  magnet!  This  is  because 
the  current  in  the  coil  sets  up  its  own  magnetic  field,  and  that  field 
exerts  a  torque  on  the  magnet.  If  we  stopped  cranking,  this  torque 
would  quickly  make  the  magnet  stop  turning. 

self-check  G 

When  you’re  driving  your  car,  the  engine  recharges  the  battery  continu¬ 
ously  using  a  device  called  an  alternator,  which  is  really  just  a  generator. 
Why  can’t  you  use  the  alternator  to  start  the  engine  if  your  car’s  battery 
is  dead?  i>  Answer,  p.  967 

The  transformer  example  19 

In  example  18  on  page  546,  we  discussed  the  advantages  of 
transmitting  power  over  electrical  lines  using  high  voltages  and 
low  currents.  However,  we  don’t  want  our  wall  sockets  to  oper¬ 
ate  at  10000  volts!  For  this  reason,  the  electric  company  uses  a 
device  called  a  transformer,  g,  to  convert  everything  to  lower  volt¬ 
ages  and  higher  currents  inside  your  house.  The  coil  on  the  input 
side  creates  a  magnetic  field.  Transformers  work  with  alternating 
current,  so  the  magnetic  field  surrounding  the  input  coil  is  always 
changing.  This  induces  an  electric  field,  which  drives  a  current 
around  the  output  coil. 

Since  the  electric  field  is  curly,  an  electron  can  keep  gaining  more 
and  more  energy  by  circling  through  it  again  and  again.  Thus  the 
output  voltage  can  be  controlled  by  changing  the  number  of  coils 
of  wire  on  the  output  side.  Changing  the  number  of  coils  on  the 
input  side  also  has  an  effect  (homework  problem  33). 

In  any  case,  conservation  of  energy  guarantees  that  the  amount 
of  power  on  the  output  side  must  equal  the  amount  put  in  orig¬ 
inally,  ljnVjn  =  loutVout,  so  no  matter  what  factor  the  voltage  is 
reduced  by,  the  current  is  increased  by  the  same  factor. 

Discussion  Questions 

A  Suppose  the  bar  magnet  in  figure  f  on  page  693  has  a  magnetic 
field  pattern  that  emerges  from  its  top,  circling  around  and  coming  back 
in  the  bottom.  This  field  is  created  by  electrons  orbiting  atoms  inside 


output  — 
low  voltage 
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h  /  It  doesn’t  matter  whether 
it’s  the  coil  or  the  permanent 
magnet  that  spins.  Either  way,  we 
get  a  functioning  generator. 


the  magnet.  Are  these  atomic  currents  clockwise  or  counterclockwise  as 
seen  from  above?  In  what  direction  is  the  current  flowing  in  the  circuit? 

We  have  a  circling  atomic  current  inside  the  circling  current  in  the  wires. 
When  we  have  two  circling  currents  like  this,  they  will  make  torques  on 
each  other  that  will  tend  to  align  them  in  a  certain  way.  Since  currents  in 
the  same  direction  attract  one  another,  which  way  is  the  torque  made  by 
the  wires  on  the  bar  magnet?  Verify  that  due  to  this  torque,  mechanical 
work  has  to  be  done  in  order  to  crank  the  generator. 

11.5.2  Why  induction? 

Faraday’s  results  leave  us  in  the  dark  about  several  things: 

•  They  don’t  explain  why  induction  effects  occur. 

•  The  relationship  T e  oc  —dB/dt  tells  us  that  a  changing  mag¬ 
netic  field  creates  an  electric  field  in  the  surrounding  region  of 
space,  but  the  phrase  “surrounding  region  of  space”  is  vague, 
and  needs  to  be  made  mathematical. 

•  Suppose  that  we  can  make  the  “surrounding  region  of  space” 
idea  more  well  defined.  We  would  then  want  to  know  the  pro¬ 
portionality  constant  that  has  been  hidden  by  the  oc  symbol. 
Although  experiments  like  Faraday’s  could  be  used  to  find  a 
numerical  value  for  this  constant,  we  would  like  to  know  why 
it  should  have  that  particular  value. 

We  can  get  some  guidance  from  the  example  of  a  car’s  alternator 
(which  just  means  generator),  referred  to  in  the  self-check  on  page 
693.  To  keep  things  conceptually  simple,  I  carefully  avoided  men¬ 
tioning  that  in  a  real  car’s  alternator,  it  isn’t  actually  the  permanent 
magnet  that  spins.  The  coil  is  what  spins.  The  choice  of  design  h/1 
or  h/2  is  merely  a  matter  of  engineering  convenience,  not  physics. 
All  that  matters  is  the  relative  motion  of  the  two  objects. 

This  is  highly  suggestive.  As  discussed  at  the  beginning  of  this 
chapter,  magnetism  is  a  relativistic  effect.  From  arguments  about 
relative  motion,  we  concluded  that  moving  electric  charges  create 
magnetic  fields.  Now  perhaps  we  can  use  reasoning  with  the  same 
flavor  to  show  that  changing  magnetic  fields  produce  curly  electric 
fields.  Note  that  figure  h/2  doesn’t  even  require  induction.  The 
protons  and  electrons  in  the  coil  are  moving  through  a  magnetic 
field,  so  they  experience  forces.  The  protons  can’t  flow,  because 
the  coil  is  a  solid  substance,  but  the  electrons  can,  so  a  current  is 
induced.7 

Now  if  we’re  convinced  that  figure  h/2  produces  a  current  in 
the  coil,  then  it  seems  very  plausible  that  the  same  will  happen  in 

'Note  that  the  magnetic  field  never  does  work  on  a  charged  particle,  because 
its  force  is  perpendicular  to  the  motion;  the  electric  power  is  actually  coming 
from  the  mechanical  work  that  had  to  be  done  to  spin  the  coil.  Spinning  the  coil 
is  more  difficult  due  to  the  presence  of  the  magnet. 
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figure  h/1,  which  implies  the  existence  of  induction  effects.  But 
this  example  involves  circular  motion,  so  it  doesn’t  quite  work  as  a 
way  of  proving  that  induction  exists.  When  we  say  that  motion  is 
relative,  we  only  mean  straight-line  motion,  not  circular  motion. 

A  more  ironclad  relativistic  argument  comes  from  the  arrange¬ 
ment  shown  in  figure  i.  This  is  also  a  generator  —  one  that  is 
impractical,  but  much  easier  to  understand. 

Flea  1  doesn’t  believe  in  this  modern  foolishness  about  induction. 
She’s  sitting  on  the  bar  magnet,  which  to  her  is  obviously  at  rest.  As 
the  square  wire  loop  is  dragged  away  from  her  and  the  magnet,  its 
protons  experience  a  force  out  of  the  page,  because  the  cross  product 
F  =  qv  X  B  is  out  of  the  page.  The  electrons,  which  are  negatively 
charged,  feel  a  force  into  the  page.  The  conduction  electrons  are 
free  to  move,  but  the  protons  aren’t.  In  the  front  and  back  sides  of 
the  loop,  this  force  is  perpendicular  to  the  wire.  In  the  right  and 
left  sides,  however,  the  electrons  are  free  to  respond  to  the  force. 
Note  that  the  magnetic  field  is  weaker  on  the  right  side.  It’s  as 
though  we  had  two  pumps  in  a  loop  of  pipe,  with  the  weaker  pump 
trying  to  push  in  the  opposite  direction;  the  weaker  pump  loses  the 
argument.8  We  get  a  current  that  circulates  around  the  loop.9  There 
is  no  induction  going  on  in  this  frame  of  reference;  the  forces  that 
cause  the  current  are  just  the  ordinary  magnetic  forces  experienced 
by  any  charged  particle  moving  through  a  magnetic  field. 

Flea  2  is  sitting  on  the  loop,  which  she  considers  to  be  at  rest. 
In  her  frame  of  reference,  it’s  the  bar  magnet  that  is  moving.  Like 
flea  1,  she  observes  a  current  circulating  around  the  loop,  but  unlike 
flea  1,  she  cannot  use  magnetic  forces  to  explain  this  current.  As  far 
as  she  is  concerned,  the  electrons  were  initially  at  rest.  Magnetic 
forces  are  forces  between  moving  charges  and  other  moving  charges, 
so  a  magnetic  field  can  never  accelerate  a  charged  particle  starting 
from  rest.  A  force  that  accelerates  a  charge  from  rest  can  only  be 
an  electric  force,  so  she  is  forced  to  conclude  that  there  is  an  electric 
field  in  her  region  of  space.  This  field  drives  electrons  around  and 
around  in  circles,  so  it  is  apparently  violating  the  loop  rule  —  it  is 
a  curly  field.  What  reason  can  flea  2  offer  for  the  existence  of  this 
electric  field  pattern?  Well,  she’s  been  noticing  that  the  magnetic 
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i  /  A  generator  that  works  with 
linear  motion. 


8  If  the  pump  analogy  makes  you  uneasy,  consider  what  would  happen  if  all 
the  electrons  moved  into  the  page  on  both  sides  of  the  loop.  We’d  end  up  with 
a  net  negative  charge  at  the  back  side,  and  a  net  positive  charge  on  the  front. 
This  actually  would  happen  in  the  first  nanosecond  after  the  loop  was  set  in 
motion.  This  buildup  of  charge  would  start  to  quench  both  currents  due  to 
electrical  forces,  but  the  current  in  the  right  side  of  the  wire,  which  is  driven  by 
the  weaker  magnetic  field,  would  be  the  first  to  stop.  Eventually,  an  equilibrium 
will  be  reached  in  which  the  same  amount  of  current  is  flowing  at  every  point 
around  the  loop,  and  no  more  charge  is  being  piled  up. 

9The  wire  is  not  a  perfect  conductor,  so  this  current  produces  heat.  The 
energy  required  to  produce  this  heat  comes  from  the  hands,  which  are  doing 
mechanical  work  as  they  separate  the  magnet  from  the  loop. 


Section  1 1.5  Induced  Electric  Fields 


695 


T  t  t  t 


T  t  t  t 


z 


j/A  new  version  of  figure  i 
with  a  tiny  loop.  The  point  of  view 
is  above  the  plane  of  the  loop. 
In  the  frame  of  reference  where 
the  magnetic  field  is  constant,  the 
loop  is  moving  to  the  right. 


field  in  her  region  of  space  has  been  changing,  possibly  because  that 
bar  magnet  over  there  has  been  getting  farther  away.  She  observes 
that  a  changing  magnetic  field  creates  a  curly  electric  field. 

We  therefore  conclude  that  induction  effects  must  exist  based  on 
the  fact  that  motion  is  relative.  If  we  didn’t  want  to  admit  induction 
effects,  we  would  have  to  outlaw  flea  2’s  frame  of  reference,  but 
the  whole  idea  of  relative  motion  is  that  all  frames  of  reference  are 
created  equal,  and  there  is  no  way  to  determine  which  one  is  really 
at  rest. 

This  whole  line  of  reasoning  was  not  available  to  Faraday  and 
his  contemporaries,  since  they  thought  the  relative  nature  of  mo¬ 
tion  only  applied  to  matter,  not  to  electric  and  magnetic  fields.10 
But  with  the  advantage  of  modern  hindsight,  we  can  understand 
in  fundamental  terms  the  facts  that  Faraday  had  to  take  simply  as 
mysterious  experimental  observations.  For  example,  the  geometric 
relationship  shown  in  figure  d  follows  directly  from  the  direction  of 
the  current  we  deduced  in  the  story  of  the  two  fleas. 

1 1 .5.3  Faraday’s  law 

We  can  also  answer  the  other  questions  posed  on  page  694.  The 
divide-and-conquer  approach  should  be  familiar  by  now.  We  first 
determine  the  circulation  Te  in  the  case  where  the  wire  loop  is  very 
tiny,  j.  Then  we  can  break  down  any  big  loop  into  a  grid  of  small 
ones;  we’ve  already  seen  that  when  we  make  this  kind  of  grid,  the 
circulations  add  together.  Although  we’ll  continue  to  talk  about  a 
physical  loop  of  wire,  as  in  figure  i,  the  tiny  loop  can  really  be  just 
like  the  edges  of  an  Amperian  surface:  a  mathematical  construct 
that  doesn’t  necessarily  correspond  to  a  real  object. 

In  the  close-up  view  shown  in  figure  j,  the  field  looks  simpler. 
Just  as  a  tiny  part  of  a  curve  looks  straight,  a  tiny  part  of  this 
magnetic  field  looks  like  the  field  vectors  are  just  getting  shorter  by 
the  same  amount  with  each  step  to  the  right.  Writing  dx  for  the 
width  of  the  loop,  we  therefore  have 

dB 

B(x  +  dx)  —  B(x)  =  — —  dx 
ox 

for  the  difference  in  the  strength  of  the  field  between  the  left  and 
right  sides.  In  the  frame  of  reference  where  the  loop  is  moving,  a 
charge  q  moving  along  with  the  loop  at  velocity  v  will  experience 
a  magnetic  force  F#  =  qvB y.  In  the  frame  moving  along  with 
the  loop,  this  is  interpreted  as  an  electrical  force,  F^  =  qE y.  Ob¬ 
servers  in  the  two  frames  agree  on  how  much  force  there  is,  so  in 
the  loop’s  frame,  we  have  an  electric  field  E  =  vB y.  This  field  is 

10They  can’t  be  blamed  too  much  for  this.  As  a  consequence  of  Faraday’s 
work,  it  soon  became  apparent  that  light  was  an  electromagnetic  wave,  and  to 
reconcile  this  with  the  relative  nature  of  motion  requires  Einstein’s  version  of 
relativity,  with  all  its  subversive  ideas  how  space  and  time  are  not  absolute. 
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perpendicular  to  the  front  and  back  sides  of  the  loop,  BC  and  DA, 
so  there  is  no  contribution  to  the  circulation  along  these  sides,  but 
there  is  a  counterclockwise  contribution  to  the  circulation  on  CD, 
and  smaller  clockwise  one  on  AB.  The  result  is  a  circulation  that  is 
counterclockwise,  and  has  an  absolute  value 


|rs|  =  I E{x)  d y  -  E(x  +  dx)  dy\ 
=  | v[B(x)  —  B(x  +  dx)]|  d y 
dB\ 


dx 
dx  dB 
d  t  dx 
dB 


dx  d  y 


dx  dy 


dt 


d  A. 


Using  a  right-hand  rule,  the  counterclockwise  circulation  is  repre¬ 
sented  by  pointing  one’s  thumb  up,  but  the  vector  dB/dt  is  down. 
This  is  just  a  rephrasing  of  the  geometric  relationship  shown  in  fig¬ 
ure  d  on  page  692.  We  can  represent  the  opposing  directions  using 
a  minus  sign, 


rE 


dB 

~dt 


dA. 


Although  this  derivation  was  carried  out  with  everything  aligned 
in  a  specific  way  along  the  coordinate  axes,  it  turns  out  that  this 
relationship  can  be  generalized  as  a  vector  dot  product, 


r  E 


<9B 

~dt 


•dA. 


Finally,  we  can  take  a  finite-sized  loop  and  break  down  the  cir¬ 
culation  around  its  edges  into  a  grid  of  tiny  loops.  The  circulations 
add,  so  we  have 

„  v-  dB  ,  ,  A 

This  is  known  as  Faraday’s  law.  (I  don’t  recommend  memorizing  all 
these  names.)  Mathematically,  Faraday’s  law  is  very  similar  to  the 
structure  of  Ampere’s  law:  the  circulation  of  a  field  around  the  edges 
of  a  surface  is  equal  to  the  sum  of  something  that  points  through 
the 

If  the  loop  itself  isn’t  moving,  twisting,  or  changing  shape,  then 
the  area  vectors  don’t  change  over  time,  and  we  can  move  the  deriva¬ 
tive  outside  the  sum,  and  rewrite  Faraday’s  law  in  a  slightly  more 
transparent  form: 


rE 


d  \  - 

d&B 

dt 
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<J> 


A  changing  magnetic  flux  makes  a  curly  electric  field.  You  might 
think  based  on  Gauss’  law  for  magnetic  fields  that  would  be 
identically  zero.  However,  Gauss’  law  only  applies  to  surfaces  that 
are  closed,  i.e.,  have  no  edges. 

self-check  H 

Check  that  the  units  in  Faraday’s  law  work  out.  An  easy  way  to  approach 
this  is  to  use  the  fact  that  vB  has  the  same  units  as  E,  which  can  be 
seen  by  comparing  the  equations  for  magnetic  and  electric  forces  used 
above.  >  Answer,  p.  967 

A  pathetic  generator  example  20 

t>  The  horizontal  component  of  the  earth’s  magnetic  field  varies 
from  zero,  at  a  magnetic  pole,  to  about  10~4  T  near  the  equator. 
Since  the  distance  from  the  equator  to  a  pole  is  about  107  m, 
we  can  estimate,  very  roughly,  that  the  horizontal  component  of 
the  earth’s  magnetic  field  typically  varies  by  about  1CT11  T/m  as 
you  go  north  or  south.  Suppose  you  connect  the  terminals  of 
a  one-ohm  lightbulb  to  each  other  with  a  loop  of  wire  having  an 
area  of  1  m2.  Holding  the  loop  so  that  it  lies  in  the  east-west- 
up-down  plane,  you  run  straight  north  at  a  speed  of  10  m/s,  how 
much  current  will  flow?  Next,  repeat  the  same  calculation  for  the 
surface  of  a  neutron  star.  The  magnetic  field  on  a  neutron  star  is 
typically  109  T,  and  the  radius  of  an  average  neutron  star  is  about 
104  m. 


>  Let’s  work  in  the  frame  of  reference  of  the  running  person.  In 
this  frame  of  reference,  the  earth  is  moving,  and  therefore  the 
local  magnetic  field  is  changing  in  strength  by  10~9  T/s.  This  rate 
of  change  is  almost  exactly  the  same  throughout  the  interior  of  the 
loop,  so  we  can  dispense  with  the  summation,  and  simply  write 
Faraday’s  law  as 


Since  what  we  estimated  was  the  rate  of  change  of  the  horizontal 
component,  and  the  vector  A  is  horizontal  (perpendicular  to  the 
loop),  we  can  find  this  dot  product  simply  by  multiplying  the  two 
numbers: 


rE  =  (1(T9T/s)(1  m2) 
=  1(T9  Tm2/s 
=  1(T9  V 


This  is  certainly  not  enough  to  light  the  bulb,  and  would  not  even 
be  easy  to  measure  using  the  most  sensitive  laboratory  instru¬ 
ments. 

Now  what  about  the  neutron  star?  We’ll  pretend  you’re  tough 
enough  that  its  gravity  doesn’t  instantly  crush  you.  The  spatial 
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variation  of  the  magnetic  field  is  on  the  order  of  (109  T/104  m)  = 
1 05  T/m.  If  you  can  run  north  at  the  same  speed  of  1 0  m/s,  then 
in  your  frame  of  reference  there  is  a  temporal  (time)  variation  of 
about  1 06  T/s,  and  a  calculation  similar  to  the  previous  one  results 
in  an  emf  of  106  V!  This  isn’t  just  strong  enough  to  light  the  bulb, 
it’s  sufficient  to  evaporate  it,  and  kill  you  as  well! 

It  might  seem  as  though  having  access  to  a  region  of  rapidly 
changing  magnetic  field  would  therefore  give  us  an  infinite  supply 
of  free  energy.  However,  the  energy  that  lights  the  bulb  is  actually 
coming  from  the  mechanical  work  you  do  by  running  through  the 
field.  A  tremendous  force  would  be  required  to  make  the  wire  loop 
move  through  the  neutron  star’s  field  at  any  significant  speed. 

Speed  and  power  in  a  generator  example  2 1 

>  Figure  k  shows  three  graphs  of  the  magnetic  flux  through  a 
generator’s  coils  as  a  function  of  time.  In  graph  2,  the  generator 
is  being  cranked  at  twice  the  frequency.  In  3,  a  permanent  mag¬ 
net  with  double  the  strength  has  been  used.  In  4,  the  generator 
is  being  cranked  in  the  opposite  direction.  Compare  the  power 
generated  in  figures  2-4  with  the  the  original  case,  1 . 

>  If  the  flux  varies  as  <£>  =  A  sin  u vt,  then  the  time  derivative  occur¬ 
ring  in  Faraday’s  law  is  dO/dt  =  Aw  cos  cut.  The  absolute  value 
of  this  is  the  same  as  the  absolute  value  of  the  emf,  fE.  The 
current  through  the  lightbulb  is  proportional  to  this  emf,  and  the 
power  dissipated  depends  on  the  square  of  the  current  (P  =  l2R), 
SO  P  (X  A2w2.  Figures  2  and  3  both  give  four  times  the  output 
power  (and  require  four  times  the  input  power).  Figure  4  gives 
the  same  result  as  figure  1 ;  we  can  think  of  this  as  a  negative 
amplitude,  which  gives  the  same  result  when  squared. 

An  approximate  loop  rule  example  22 

Figure  1/1  shows  a  simple  RL  circuit  of  the  type  discussed  in  the 
last  chapter.  A  current  has  already  been  established  in  the  coil, 
let’s  say  by  a  battery.  The  battery  was  then  unclipped  from  the 
coil,  and  we  now  see  the  circuit  as  the  magnetic  field  in  and 
around  the  inductor  is  beginning  to  collapse.  I’ve  already  cau¬ 
tioned  you  that  the  loop  rule  doesn’t  apply  in  nonstatic  situations, 
so  we  can’t  assume  that  the  readings  on  the  four  voltmeters  add 
up  to  zero.  The  interesting  thing  is  that  although  they  don’t  add 
up  to  exactly  zero  in  this  circuit,  they  very  nearly  do.  Why  is  the 
loop  rule  even  approximately  valid  in  this  situation? 

The  reason  is  that  the  voltmeters  are  measuring  the  emf  VE  around 
the  path  shown  in  figure  1/2,  and  the  stray  field  of  the  solenoid  is 
extremely  weak  out  there.  In  the  region  where  the  meters  are, 
the  arrows  representing  the  magnetic  field  would  be  too  small  to 
allow  me  to  draw  them  to  scale,  so  I  have  simply  omitted  them. 
Since  the  field  is  so  weak  in  this  region,  the  flux  through  the  loop 
is  nearly  zero,  and  the  rate  of  change  of  the  flux,  dOe/dt,  is  also 
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nearly  zero.  By  Faraday’s  law,  then,  the  emf  around  this  loop  is 
nearly  zero. 

Now  consider  figure  1/3.  The  flux  through  the  interior  of  this  path 
is  not  zero,  because  the  strong  part  of  the  field  passes  through 
it,  and  not  just  once  but  many  times.  To  visualize  this,  imagine 
that  we  make  a  wire  frame  in  this  shape,  dip  it  in  a  tank  of  soapy 
water,  and  pull  it  out,  so  that  there  is  a  soap-bubble  film  span¬ 
ning  its  interior.  Faraday’s  law  refers  to  the  rate  of  change  of  the 
flux  through  a  surface  such  as  this  one.  (The  soap  film  tends 
to  assume  a  certain  special  shape  which  results  in  the  minimum 
possible  surface  area,  but  Faraday’s  law  would  be  true  for  any 
surface  that  filled  in  the  loop.)  In  the  coiled  part  of  the  wire,  the 
soap  makes  a  three-dimensional  screw  shape,  like  the  shape  you 
would  get  if  you  took  the  steps  of  a  spiral  staircase  and  smoothed 
them  into  a  ramp.  The  loop  rule  is  going  to  be  strongly  violated 
for  this  path. 

We  can  interpret  this  as  follows.  Since  the  wire  in  the  solenoid 
has  a  very  low  resistance  compared  to  the  resistances  of  the  light 
bulbs,  we  can  expect  that  the  electric  field  along  the  corkscrew 
part  of  loop  1/3  will  be  very  small.  As  an  electron  passes  through 
the  coil,  the  work  done  on  it  is  therefore  essentially  zero,  and  the 
true  emf  along  the  coil  is  zero.  In  figure  1/1,  the  meter  on  top  is 
therefore  not  telling  us  the  actual  emf  experienced  by  an  electron 
that  passes  through  the  coil.  It  is  telling  us  the  emf  experienced 
by  an  electron  that  passes  through  the  meter  itself,  which  is  a 
different  quantity  entirely.  The  other  three  meters,  however,  really 
do  tell  us  the  emf  through  the  bulbs,  since  there  are  no  magnetic 
fields  where  they  are,  and  therefore  no  funny  induction  effects. 
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11.6  Maxwell’s  Equations 

11.6.1  Induced  magnetic  fields 

We  are  almost,  but  not  quite,  done  figuring  out  the  complete  set 
of  physical  laws,  called  Maxwell’s  equations,  governing  electricity 
and  magnetism.  We  are  only  missing  one  more  term.  For  clarity, 
I’ll  state  Maxwell’s  equations  with  the  missing  part  included,  and 
then  discuss  the  physical  motivation  and  experimental  evidence  for 
sticking  it  in: 


Maxwell’s  equations 

For  any  closed  surface,  the  fluxes  through  the  surface  are 

&E  =  ^kqin  and 

=  0. 

For  any  surface  that  is  not  closed,  the  circulations  around  the  edges 
of  the  surface  are  given  by 


r  E 

c2Tb 


d&s 

~dt~ 


8&e 


and 


The  equation  is  Gauss’  law:  charges  make  diverging  electric 
fields.  The  corresponding  equation  for  tells  us  that  magnetic 
“charges”  (monopoles)  don’t  exist,  so  magnetic  fields  never  diverge. 
The  third  equation  says  that  changing  magnetic  fields  induce  curly 
electric  fields,  whose  curliness  we  can  measure  using  the  emf,  Te, 
around  a  closed  loop.  The  final  equation,  for  T#,  is  the  only  one 
where  anything  new  has  been  added.  Without  the  new  time  deriva¬ 
tive  term,  this  equation  would  simply  be  Ampere’s  law.  (I’ve  chosen 
to  move  the  c2  over  to  the  left  because  it  simplifies  the  writing,  and 
also  because  it  more  clearly  demonstrates  the  analogous  roles  played 
by  charges  and  currents  in  the  &e  and  T B  equations.) 

This  new  d&E/dt  term  says  that  just  as  a  changing  magnetic 
field  can  induce  a  curly  electric  held,  a  changing  electric  held  can 
induce  a  curly  magnetic  held.  Why  should  this  be  so?  The  following 
examples  show  that  Maxwell’s  equations  would  not  make  sense  in 
general  without  it. 

Figure  b  shows  a  mysterious  curly  magnetic  held.  Magnetic  helds 
are  supposed  to  be  made  by  moving  charges,  but  there  don’t  seem 
to  be  any  moving  charges  in  this  landscape.  Where  are  they?  One 
reasonable  guess  would  be  that  they’re  behind  your  head,  where  you 
can’t  see  them.  Suppose  there’s  a  positively  charged  particle  about 
to  hit  you  in  the  back  of  the  head.  This  particle  is  like  a  current 


a /James  Clerk  Maxwell  (1831- 
1879) 


b  /  Where  is  the  moving  charge 
responsible  for  this  magnetic 
field? 
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c  /  An  Amperian  surface  su¬ 
perimposed  on  the  landscape. 


c 


d/An  electron  jumps  through  a 
hoop. 


e  /  An  alternative  Amperian 
surface. 


going  into  the  page.  We’re  used  to  dealing  with  currents  made  by 
many  charged  particles,  but  logically  we  can’t  have  some  minimum 
number  that  would  qualify  as  a  current.  This  is  not  a  static  current, 
however,  because  the  current  at  a  given  point  in  space  is  not  staying 
the  same  over  time.  If  the  particle  is  pointlike,  then  it  takes  zero 
time  to  pass  any  particular  location,  and  the  current  is  then  infinite 
at  that  point  in  space.  A  moment  later,  when  the  particle  is  passing 
by  some  other  location,  there  will  be  an  infinite  current  there,  and 
zero  current  in  the  previous  location.  If  this  single  particle  qualifies 
as  a  current,  then  it  should  be  surrounded  by  a  curly  magnetic  field, 
just  like  any  other  current.11 

This  explanation  is  simple  and  reasonable,  but  how  do  we  know 
it’s  correct?  Well,  it  makes  another  prediction,  which  is  that  the 
positively  charged  particle  should  be  making  an  electric  field  as  well. 
Not  only  that,  but  if  it’s  headed  for  the  back  of  your  head,  then 
it’s  getting  closer  and  closer,  so  the  electric  field  should  be  getting 
stronger  over  time.  But  this  is  exactly  what  Maxwell’s  equations 
require.  There  is  no  current  Ithrough  piercing  the  Amperian  surface 
shown  in  figure  c,  so  Maxwell’s  equation  for  T b  becomes  c2T b  = 
8$e/ dt-  The  only  reason  for  an  electric  field  to  change  is  if  there  are 
charged  particles  making  it,  and  those  charged  particles  are  moving. 
When  charged  particles  are  moving,  they  make  magnetic  fields  as 
well. 

Note  that  the  above  example  is  also  sufficient  to  prove  the  pos¬ 
itive  sign  of  the  d&E/dt  term  in  Maxwell’s  equations,  which  is  dif¬ 
ferent  from  the  negative  sign  of  Faraday’s  —d^s/dt  term. 

The  addition  of  the  d^E/dt  term  has  an  even  deeper  and  more 
important  physical  meaning.  With  the  inclusion  of  this  term,  Max¬ 
well’s  equations  can  describe  correctly  the  way  in  which  disturbances 
in  the  electric  and  magnetic  fields  ripple  outwards  at  the  speed  of 
light.  Indeed,  Maxwell  was  the  first  human  to  understand  that  light 
was  in  fact  an  electromagnetic  wave.  Legend  has  it  that  it  was  on  a 
starry  night  that  he  first  realized  this  implication  of  his  equations. 
He  went  for  a  walk  with  his  wife,  and  told  her  she  was  the  only  other 
person  in  the  world  who  really  knew  what  starlight  was. 

To  see  how  the  d<&E/dt  term  relates  to  electromagnetic  waves, 
let’s  look  at  an  example  where  we  would  get  nonsense  without  it. 
Figure  d  shows  an  electron  that  sits  just  on  one  side  of  an  imagi¬ 
nary  Amperian  surface,  and  then  hops  through  it  at  some  randomly 

11  One  way  to  prove  this  rigorously  is  that  in  a  frame  of  reference  where  the 
particle  is  at  rest,  it  has  an  electric  field  that  surrounds  it  on  all  sides.  If 
the  particle  has  been  moving  with  constant  velocity  for  a  long  time,  then  this 
is  just  an  ordinary  Coulomb’s-law  field,  extending  off  to  very  large  distances, 
since  disturbances  in  the  field  ripple  outward  at  the  speed  of  light.  In  a  frame 
where  the  particle  is  moving,  this  pure  electric  field  is  experienced  instead  as  a 
combination  of  an  electric  field  and  a  magnetic  field,  so  the  magnetic  field  must 
exist  throughout  the  same  vast  region  of  space. 
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chosen  moment.  Unadorned  with  the  d&E/dt  term,  Maxwell’s  equa¬ 
tion  for  T b  reads  as  c2r b  =  47t kIthroughi  which  is  Ampere’s  law.  If 
the  electron  is  a  pointlike  particle,  then  we  have  an  infinite  current 
I through  at  the  moment  when  it  pierces  the  imaginary  surface,  and 
zero  current  at  all  other  times.  An  infinite  magnetic  circulation  T b 
can  only  be  produced  by  an  infinite  magnetic  field,  so  without  the 
d&E/dt  term,  Maxwell’s  equations  predict  nonsense:  the  edge  of  the 
surface  would  experience  an  infinite  magnetic  field  at  one  instant, 
and  zero  magnetic  field  at  all  other  times.  Even  if  the  infinity  didn’t 
upset  us,  it  doesn’t  make  sense  that  anything  special  would  happen 
at  the  moment  the  electron  passed  through  the  surface,  because  the 
surface  is  an  imaginary  mathematical  construct.  We  could  just  as 
well  have  chosen  the  curved  surface  shown  in  figure  e,  which  the 
electron  never  crosses  at  all.  We  are  already  clearly  getting  non¬ 
sensical  results  by  omitting  the  d^E/dt  term,  and  this  shouldn’t 
surprise  us  because  Ampere’s  law  only  applies  to  statics.  More  to 
the  point.  Ampere’s  law  doesn’t  have  time  in  it,  so  it  predicts  that 
this  effect  is  instantaneous.  According  to  Ampere’s  law,  we  could 
send  Morse  code  signals  by  wiggling  the  electron  back  and  forth,  and 
these  signals  would  be  received  at  distant  locations  instantly,  with¬ 
out  any  time  delay  at  all.  This  contradicts  the  theory  of  relativity, 
one  of  whose  predictions  is  that  information  cannot  be  transmitted 
at  speeds  greater  than  the  speed  of  light. 

Discussion  Questions 

A  Induced  magnetic  fields  were  introduced  in  the  text  via  the  imaginary 
landscape  shown  in  figure  b  on  page  701 ,  and  I  argued  that  the  magnetic 
field  could  have  been  produced  by  a  positive  charge  coming  from  behind 
your  head.  This  is  a  specific  assumption  about  the  number  of  charges 
(one),  the  direction  of  motion,  and  the  sign  of  the  charge.  What  are  some 
other  scenarios  that  could  explain  this  field? 

11.6.2  Lightwaves 

We  could  indeed  send  signals  using  this  scheme,  and  the  signals 
would  be  a  form  of  light.  A  radio  transmitting  antenna,  for  instance, 
is  simply  a  device  for  whipping  electrons  back  and  forth  at  mega¬ 
hertz  frequencies.  Radio  waves  are  just  like  visible  light,  but  with  a 
lower  frequency.  With  the  addition  of  the  d^E/dt  term,  Maxwell’s 
equations  are  capable  of  describing  electromagnetic  waves.  It  would 
be  possible  to  use  Maxwell’s  equations  to  calculate  the  pattern  of  the 
electric  and  magnetic  fields  rippling  outward  from  a  single  electron 
that  fidgets  at  irregular  intervals,  but  let’s  pick  a  simpler  example 
to  analyze. 

The  simplest  wave  pattern  is  a  sine  wave  like  the  one  shown 
in  figure  f.  Let’s  assume  a  magnetic  field  of  this  form,  and  see 
what  Maxwell’s  equations  tell  us  about  it.  If  the  wave  is  traveling 
through  empty  space,  then  there  are  no  charges  or  currents  present, 
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f  /  A  magnetic  field  in  the  form  of 
a  sine  wave. 
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For  example,  the  circulation 
around  this  reactangle  is  nonzero 
and  counterclockwise. 
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and  Maxwell’s  equations  become 


i  /  An  impossible  wave  pattern. 
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The  equation  =  0  has  already  been  verified  for  this  type  of 
wave  pattern  in  example  37  on  page  635.  Even  if  you  haven’t  learned 
the  techniques  from  that  section,  it  should  be  visually  plausible  that 
this  field  pattern  doesn’t  diverge  or  converge  on  any  particular  point. 

Geometry  of  the  electric  and  magnetic  fields 

The  equation  c2T B  =  d<hE/dt  tells  us  that  there  can  be  no  such 
thing  as  a  purely  magnetic  wave.  The  wave  pattern  clearly  does  have 
a  nonvanishing  circulation  around  the  edge  of  the  surface  suggested 
in  figure  g,  so  there  must  be  an  electric  flux  through  the  surface. 
This  magnetic  field  pattern  must  be  intertwined  with  an  electric 
field  pattern  that  fills  the  same  space.  There  is  also  no  way  that  the 
two  sides  of  the  equation  could  stay  synchronized  with  each  other 
unless  the  electric  field  pattern  is  also  a  sine  wave,  and  one  that 
has  the  same  wavelength,  frequency,  and  velocity.  Since  the  electric 
field  is  making  a  flux  through  the  indicated  surface,  it’s  plausible 
that  the  electric  field  vectors  lie  in  a  plane  perpendicular  to  that 
of  the  magnetic  field  vectors.  The  resulting  geometry  is  shown  in 
figure  h.  Further  justification  for  this  geometry  is  given  later  in  this 
subsection. 


direction  of  motion  of  wave 


*■ 


h  /  The  geometry  of  an  electromagnetic  wave. 


One  feature  of  figure  h  that  is  easily  justified  is  that  the  electric 
and  magnetic  fields  are  perpendicular  not  only  to  each  other,  but 
also  to  the  direction  of  propagation  of  the  wave.  In  other  words,  the 
vibration  is  sideways,  like  people  in  a  stadium  “doing  the  wave,” 
not  lengthwise,  like  the  accordion  pattern  in  figure  i.  (In  standard 
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wave  terminology,  we  say  that  the  wave  is  transverse,  not  longitudi¬ 
nal.)  The  wave  pattern  in  figure  i  is  impossible,  because  it  diverges 
from  the  middle.  For  virtually  any  choice  of  Gaussian  surface,  the 
magnetic  and  electric  fluxes  would  be  nonzero,  contradicting  the 
equations  =  0  and  <&e  =  0.12 

Reflection  example  23 

The  wave  in  figure  j  hits  a  silvered  mirror.  The  metal  is  a  good 
conductor,  so  it  has  constant  voltage  throughout,  and  the  electric 
field  equals  zero  inside  it:  the  wave  doesn’t  penetrate  and  is  1 00% 
reflected.  If  the  electric  field  is  to  be  zero  at  the  surface  as  well, 
the  reflected  wave  must  have  its  electric  field  inverted  (p.  366),  so 
that  the  incident  and  reflected  fields  cancel  there. 

But  the  magnetic  field  of  the  reflected  wave  is  not  inverted.  This  is 
because  the  reflected  wave  has  to  have  the  correct  right-handed 
relationship  between  the  fields  and  the  direction  of  propagation. 

Polarization 

Two  electromagnetic  waves  traveling  in  the  same  direction  through 
space  can  differ  by  having  their  electric  and  magnetic  fields  in  dif¬ 
ferent  directions,  a  property  of  the  wave  called  its  polarization. 


j  /  Example  23.  The  incident 
and  reflected  waves  are  drawn 
offset  from  each  other  for  clar¬ 
ity,  but  are  actually  on  top  of 
each  other  so  that  their  fields 
superpose. 


The  speed  of  light 

What  is  the  velocity  of  the  waves  described  by  Maxwell’s  equa¬ 
tions?  Maxwell  convinced  himself  that  light  was  an  electromagnetic 
wave  partly  because  his  equations  predicted  waves  moving  at  the  ve¬ 
locity  of  light,  c.  The  only  velocity  that  appears  in  the  equations  is 
c,  so  this  is  fairly  plausible,  although  a  real  calculation  is  required  in 
order  to  prove  that  the  velocity  of  the  waves  isn’t  something  like  2c 
or  c/7 r  —  or  zero,  which  is  also  c  multiplied  by  a  constant!  The  fol¬ 
lowing  discussion,  leading  up  to  a  proof  that  electromagnetic  waves 
travel  at  c,  is  meant  to  be  understandable  even  if  you’re  reading  this 
book  out  of  order,  and  haven’t  yet  learned  much  about  waves.  As 
always  with  proofs  in  this  book,  the  reason  to  read  it  isn’t  to  con¬ 
vince  yourself  that  it’s  true,  but  rather  to  build  your  intuition.  The 
style  will  be  visual.  In  all  the  following  figures,  the  wave  patterns 
are  moving  across  the  page  (let’s  say  to  the  right),  and  it  usually 
doesn’t  matter  whether  you  imagine  them  as  representing  the  wave’s 
magnetic  field  or  its  electric  field,  because  Maxwell’s  equations  in 
a  vacuum  have  the  same  form  for  both  fields.  Whichever  field  we 
imagine  the  figures  as  representing,  the  other  field  is  coming  in  and 
out  of  the  page. 

The  velocity  of  the  waves  is  not  zero.  If  the  wave  pattern  was 


12Even  if  the  fields  can’t  be  parallel  to  the  direction  of  propagation,  one  might 
wonder  whether  they  could  form  some  angle  other  than  90  degrees  with  it.  No. 
One  proof  is  given  on  page  709.  A  alternative  argument,  which  is  simpler  but 
more  esoteric,  is  that  if  there  was  such  a  pattern,  then  there  would  be  some  other 
frame  of  reference  in  which  it  would  look  like  figure  i. 
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standing  still  in  space,  then  the  right  sides  of  the  T  equations  would 
be  zero,  because  there  would  be  no  change  in  the  held  over  time 
at  a  particular  point.  But  the  left  sides  are  not  zero,  so  this  is 
impossible.13 

The  velocity  of  the  waves  is  a  fixed  number  for  a  given  wave  pat¬ 
tern.  Consider  a  typical  sinusoidal  wave  of  visible  light,  with  a  dis¬ 
tance  of  half  a  micrometer  from  one  peak  to  the  next  peak.  Suppose 
this  wave  pattern  provides  a  valid  solution  to  Maxwell’s  equations 
when  it  is  moving  with  a  certain  velocity.  We  then  know,  for  in¬ 
stance,  that  there  cannot  be  a  valid  solution  to  Maxwell’s  equations 
in  which  the  same  wave  pattern  moves  with  double  that  velocity. 
The  time  derivatives  on  the  right  sides  of  Maxwell’s  equations  for 
T e  and  T b  would  be  twice  as  big,  since  an  observer  at  a  certain 
point  in  space  would  see  the  wave  pattern  sweeping  past  at  twice 
the  rate.  But  the  left  sides  would  be  the  same,  so  the  equations 
wouldn’t  equate. 

The  velocity  is  the  same  for  all  wave  patterns.  In  other  words, 
it  isn’t  0.878c  for  one  wave  pattern,  and  1.067c  for  some  other  pat¬ 
tern.  This  is  surprising,  since,  for  example,  water  waves  with  dif¬ 
ferent  shapes  do  travel  at  different  speeds.  Similarly,  even  though 
we  speak  of  “the  speed  of  sound,”  sound  waves  do  travel  at  slightly 
different  speeds  depending  on  their  pitch  and  loudness,  although  the 
differences  are  small  unless  you’re  talking  about  cannon  blasts  or  ex¬ 
tremely  high  frequency  ultrasound.  To  see  how  Maxwell’s  equations 
give  a  consistent  velocity,  consider  figure  k.  Along  the  right  and 
left  edges  of  the  same  Amperian  surface,  the  more  compressed  wave 
pattern  of  blue  light  has  twice  as  strong  a  held,  so  the  circulations 
on  the  left  sides  of  Maxwell’s  equations  are  twice  as  large.14  To 
satisfy  Maxwell’s  equations,  the  time  derivatives  of  the  fields  must 
also  be  twice  as  large  for  the  blue  light.  But  this  is  true  only  if  the 
blue  light’s  wave  pattern  is  moving  to  the  right  at  the  same  speed  as 
the  red  light’s:  if  the  blue  light  pattern  is  sweeping  over  an  observer 
with  a  given  velocity,  then  the  time  between  peaks  is  half  as  much, 
like  the  clicking  of  the  wheels  on  a  train  whose  cars  are  half  the 
length.15 

We  can  also  check  that  bright  and  dim  light,  as  shown  in  figure 
1,  have  the  same  velocity.  If  you  haven’t  yet  learned  much  about 

13  A  young  Einstein  worried  about  what  would  happen  if  you  rode  a  motorcycle 
alongside  a  light  wave,  traveling  at  the  speed  of  light.  Would  the  light  wave  have 
a  zero  velocity  in  this  frame  of  reference?  The  only  solution  lies  in  the  theory  of 
relativity,  one  of  whose  consequences  is  that  a  material  object  like  a  student  or 
a  motorcycle  cannot  move  at  the  speed  of  light. 

14Actually,  this  is  only  exactly  true  of  the  rectangular  strip  is  made  infinitesi¬ 
mally  thin. 

15  You  may  know  already  that  different  colors  of  light  have  different  speeds 
when  they  pass  through  a  material  substance,  such  as  the  glass  or  water.  This 
is  not  in  contradiction  with  what  I’m  saying  here,  since  this  whole  analysis  is  for 
light  in  a  vacuum. 
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waves,  then  this  might  be  surprising.  A  material  object  with  more 
energy  goes  faster,  but  that’s  not  the  case  for  waves.  The  circulation 
around  the  edge  of  the  Amperian  surface  shown  in  the  figure  is  twice 
as  strong  for  the  light  whose  fields  are  doubled  in  strength,  so  the 
left  sides  of  Maxwell’s  T  equations  are  doubled.  The  right  sides  are 
also  doubled,  because  the  derivative  of  twice  a  function  is  twice  the 
derivative  of  the  original  function.  Thus  if  dim  light  moving  with 
a  particular  velocity  is  a  solution,  then  so  is  bright  light,  provided 
that  it  has  the  same  velocity. 

We  can  now  see  that  all  sinusoidal  waves  have  the  same  velocity. 
What  about  nonsinusoidal  waves  like  the  one  in  figure  m?  There 
is  a  mathematical  theorem,  due  to  Fourier,  that  says  any  function 
can  be  made  by  adding  together  sinusoidal  functions.  For  instance, 
3  sin  a;— 7  cos  3x  can  be  made  by  adding  together  the  functions  3  sin  a; 
and  —7  cos  3a:,  but  Fourier  proved  that  this  can  be  done  even  for 
functions,  like  figure  m,  that  aren’t  obviously  built  out  of  sines  and 
cosines  in  the  first  place.  Therefore  our  proof  that  sinusoidal  waves 
all  have  the  same  velocity  is  sufficient  to  demonstrate  that  other 
waves  also  have  this  same  velocity. 

We’re  now  ready  to  prove  that  this  universal  speed  for  all  electro¬ 
magnetic  waves  is  indeed  c.  Since  we’ve  already  convinced  ourselves 
that  all  such  waves  travel  at  the  same  speed,  it’s  sufficient  to  find 
the  velocity  of  one  wave  in  particular.  Let’s  pick  the  wave  whose 
fields  have  magnitudes 

E  =  E  sin(x  +  vt)  and 

B  =  B  sin(x  +  vt), 

which  is  about  as  simple  as  we  can  get.  The  peak  electric  field 
of  this  wave  has  a  strength  E,  and  the  peak  magnetic  field  is  B. 
The  sine  functions  go  through  one  complete  cycle  as  x  increases  by 
2tt  =  6.28 .. .,  so  the  distance  from  one  peak  of  this  wave  to  the 
next  —  its  wavelength  —  is  6.28. . .  meters.  This  means  that  it  is 
not  a  wave  of  visible  light  but  rather  a  radio  wave  (its  wavelength 
is  on  the  same  order  of  magnitude  as  the  size  of  a  radio  antenna). 
That’s  OK.  What  was  glorious  about  Maxwell’s  work  was  that  it 
unified  the  whole  electromagnetic  spectrum.  Light  is  simple.  Radio 
waves  aren’t  fundamentally  any  different  than  light  waves,  x-rays, 
or  gamma  rays.  16 

The  justification  for  putting  x  +  vt  inside  the  sine  functions  is  as 
follows.  As  the  wave  travels  through  space,  the  whole  pattern  just 
shifts  over.  The  fields  are  zero  at  x  =  0,  t  =  0,  since  the  sine  of 
zero  is  zero.  This  zero-point  of  the  wave  pattern  shifts  over  as  time 
goes  by;  at  any  time  t  its  location  is  given  by  x  +  vt  =  0.  After  one 

16What  makes  them  appear  to  be  unrelated  phenomena  is  that  we  experience 
them  through  their  interaction  with  atoms,  and  atoms  are  complicated,  so  they 
respond  to  various  kinds  of  electromagnetic  waves  in  complicated  ways. 
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n/The  magnetic  field  of  the 
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page. 


second,  the  zero-point  is  located  at  x  =  —(1  s)v.  The  distance  it 
travels  in  one  second  is  therefore  numerically  equal  to  v,  and  this 
is  exactly  the  concept  of  velocity:  how  far  something  goes  per  unit 
time. 

The  wave  has  to  satisfy  Maxwell’s  equations  for  T e  and  T b 
regardless  of  what  Amperian  surfaces  we  pick,  and  by  applying  them 
to  any  surface,  we  could  determine  the  speed  of  the  wave.  The 
surface  shown  in  figure  n  turns  out  to  result  in  an  easy  calculation: 
a  narrow  strip  of  width  2 £  and  height  h,  coinciding  with  the  position 
of  the  zero-point  of  the  field  at  t  =  0. 

Now  let’s  apply  the  equation  c2T b  =  d&E/dt  at  t  =  0.  Since 
the  strip  is  narrow,  we  can  approximate  the  magnetic  field  using 
sinx  ~  x,  which  is  valid  for  small  x.  The  magnetic  field  on  the  right 
edge  of  the  strip,  at  x  =  £,  is  then  Bi,  so  the  right  edge  of  the  strip 
contributes  Bih  to  the  circulation.  The  left  edge  contributes  the 
same  amount,  so  the  left  side  of  Maxwell’s  equation  is 


c2Tb  =  c 2  •  2 Bih. 


The  other  side  of  the  equation  is 


8&e 

~df 


where  we  can  dispense  with  the  usual  sum  because  the  strip  is  nar¬ 
row  and  there  is  no  variation  in  the  field  as  we  go  up  and  down 
the  strip.  The  derivative  equals  vE  cos(x  +  vt),  and  evaluating  the 
cosine  at  x  =  0,  t  =  0  gives 

^  =  2  WEih 
dt 


Maxwell’s  equation  for  T b  therefore  results  in 

2  c2Blh  =  2  Eihv 
c2B  =  vE. 


An  application  of  T^  =  —d^s/dt  gives  a  similar  result,  except 
that  there  is  no  factor  of  c2 


E  =  vB. 


(The  minus  sign  simply  represents  the  right-handed  relationship  of 
the  fields  relative  to  their  direction  of  propagation.) 
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Multiplying  these  last  two  equations  by  each  other,  we  get 

c2BE  =  v2EB 

2  2 
C  =  V 

V  =  ±c. 

This  is  the  desired  result.  (The  plus  or  minus  sign  shows  that  the 
wave  can  travel  in  either  direction.) 

As  a  byproduct  of  this  calculation,  we  can  find  the  relationship 
between  the  strengths  of  the  electric  and  magnetic  fields  in  an  elec¬ 
tromagnetic  wave.  If,  instead  of  multiplying  the  equations  c2B  =  vE 
and  E  =  vB,  we  divide  them,  we  can  easily  show  that  E  =  cB. 
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o  /  The  electromagnetic  spec¬ 
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Figure  o  shows  the  complete  spectrum  of  light  waves.  The  wave¬ 
length  A  (number  of  meters  per  cycle)  and  frequency  /  (number  of 
cycles  per  second)  are  related  by  the  equation  c  =  /A.  Maxwell’s 
equations  predict  that  all  light  waves  have  the  same  structure,  re¬ 
gardless  of  wavelength  and  frequency,  so  even  though  radio  and  x- 
rays,  for  example,  hadn’t  been  discovered,  Maxwell  predicted  that 
such  waves  would  have  to  exist.  Maxwell’s  1865  prediction  passed  an 
important  test  in  1888,  when  Heinrich  Hertz  published  the  results 
of  experiments  in  which  he  showed  that  radio  waves  could  be  ma¬ 
nipulated  in  the  same  ways  as  visible  light  waves.  Hertz  showed,  for 
example,  that  radio  waves  could  be  reflected  from  a  flat  surface,  and 
that  the  directions  of  the  reflected  and  incoming  waves  were  related 
in  the  same  way  as  with  light  waves,  forming  equal  angles  with  the 
surface.  Likewise,  light  waves  can  be  focused  with  a  curved,  dish- 
shaped  mirror,  and  Hertz  demonstrated  the  same  thing  with  radio 
waves  using  a  metal  dish. 

Momentum  of  light  waves 

A  light  wave  consists  of  electric  and  magnetic  fields,  and  fields 
contain  energy.  Thus  a  light  wave  carries  energy  with  it  when  it  trav¬ 
els  from  one  place  to  another.  If  a  material  object  has  kinetic  energy 
and  moves  from  one  place  to  another,  it  must  also  have  momentum, 
so  it  is  logical  to  ask  whether  light  waves  have  momentum  as  well. 
It  can  be  proved  based  on  relativity17  that  it  does,  and  that  the 

17See  problem  11  on  p.  446,  or  example  21  on  p.  424. 


Section  1 1 .6  Maxwell’s  Equations 


709 


momentum  and  energy  are  related  by  the  equation  U  =  p/c,  where 
p  is  the  magnitude  of  the  momentum  vector,  and  U  =  Ue  +  Um  is 
the  sum  of  the  energy  of  the  electric  and  magnetic  fields.  We  can 
now  demonstrate  this  without  explicitly  referring  to  relativity,  and 
connect  it  to  the  specific  structure  of  a  light  wave. 

The  energy  density  of  a  light  wave  is  related  to  the  magnitudes 
of  the  fields  in  a  specific  way  -  it  depends  on  the  squares  of  their 
magnitudes,  E2  and  B2,  which  are  the  same  as  the  dot  products 
E  •  E  and  B  B.  We  argued  on  page  587  that  since  energy  is  a 
scalar,  the  only  possible  expressions  for  the  energy  densities  of  the 
fields  are  dot  products  like  these,  multiplied  by  some  constants.  This 
is  because  the  dot  product  is  the  only  mathematically  sensible  way 
of  multiplying  two  vectors  to  get  a  scalar  result.  (Any  other  way 
violates  the  symmetry  of  space  itself.) 

How  does  this  relate  to  momentum?  Well,  we  know  that  if  we 
double  the  strengths  of  the  fields  in  a  light  beam,  it  will  have  four 
times  the  energy,  because  the  energy  depends  on  the  square  of  the 
fields.  But  we  then  know  that  this  quadruple-energy  light  beam 
must  have  quadruple  the  momentum  as  well.  If  there  wasn’t  this 
kind  of  consistency  between  the  momentum  and  the  energy,  then  we 
could  violate  conservation  of  momentum  by  combining  light  beams 
or  splitting  them  up.  We  therefore  know  that  the  momentum  den¬ 
sity  of  a  light  beam  must  depend  on  a  field  multiplied  by  a  field. 
Momentum,  however,  is  a  vector,  and  there  is  only  one  physically 
meaningful  way  of  multiplying  two  vectors  to  get  a  vector  result, 
which  is  the  cross  product  (see  page  952).  The  momentum  density 
can  therefore  only  depend  on  the  cross  products  E  x  E,  B  x  B,  and 
E  x  B.  But  the  first  two  of  these  are  zero,  since  the  cross  product 
vanishes  when  there  is  a  zero  angle  between  the  vectors.  Thus  the 
momentum  per  unit  volume  must  equal  E  X  B  multiplied  by  some 
constant, 

dp  =  (constant) E  x  B  du 

This  predicts  something  specific  about  the  direction  of  propagation 
of  a  light  wave:  it  must  be  along  the  line  perpendicular  to  the  electric 
and  magnetic  fields.  We’ve  already  seen  that  this  is  correct,  and  also 
that  the  electric  and  magnetic  fields  are  perpendicular  to  each  other. 
Therefore  this  cross  product  has  a  magnitude 

|E  x  B|  =  | E|  | B |  sin  90° 

=  |E||B| 


c 


where  in  the  last  step  the  relation  |E|  =  c|Bj  has  been  used. 

We  now  only  need  to  find  one  physical  example  in  order  to  fix  the 
constant  of  proportionality.  Indeed,  if  we  didn’t  know  relativity,  it 
would  be  possible  to  believe  that  the  constant  of  proportionality  was 
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zero!  The  simplest  example  of  which  I  know  is  as  follows.  Suppose  a 
piece  of  wire  of  length  t  is  bathed  in  electromagnetic  waves  coming 
in  sideways,  and  let’s  say  for  convenience  that  this  is  a  radio  wave, 
with  a  wavelength  that  is  large  compared  to  £,  so  that  the  fields 
don’t  change  significantly  across  the  length  of  the  wire.  Let’s  say  the 
electric  field  of  the  wave  happens  to  be  aligned  with  the  wire.  Then 
there  is  an  emf  between  the  ends  of  the  wire  which  equals  El,  and 
since  the  wire  is  small  compared  to  the  wavelength,  we  can  pretend 
that  the  field  is  uniform,  not  curly,  in  which  case  voltage  is  a  well- 
defined  concept,  and  this  is  equivalent  to  a  voltage  difference  Ay  = 
El  between  the  ends  of  the  wire.  The  wire  obeys  Ohm’s  law,  and 
a  current  flows  in  response  to  the  wave.18  Equating  the  expressions 
d U /  d t  and  IAV  for  the  power  dissipated  by  ohmic  heating,  we  have 

dU  =  I  El  dt 

for  the  energy  the  wave  transfers  to  the  wire  in  a  time  interval  dt. 

Note  that  although  some  electrons  have  been  set  in  motion  in  the 
wire,  we  haven’t  yet  seen  any  momentum  transfer,  since  the  protons 
are  experiencing  the  same  amount  of  electric  force  in  the  opposite 
direction.  However,  the  electromagnetic  wave  also  has  a  magnetic 
field,  and  a  magnetic  held  transfers  momentum  to  (exerts  a  force 
on)  a  current.  This  is  only  a  force  on  the  electrons,  because  they’re 
what  make  the  current.  The  magnitude  of  this  force  equals  II B 
(homework  problem  6),  and  using  the  definition  of  force,  dp/di,  we 
find  for  the  magnitude  of  the  momentum  transferred: 

dp  =  UBdt 

We  now  know  both  the  amount  of  energy  and  the  amount  of 
momentum  that  the  wave  has  lost  by  interacting  with  the  wire. 
Dividing  these  two  equations,  we  find 

dp  B 

dU^  E 
_  1 

c 

which  is  what  we  expected  based  on  relativity.  This  can  now  be 
restated  in  the  form  dp  =  (constant)E  x  B  du  (homework  problem 
40). 

Note  that  although  the  equations  p  =  U/c  and  dp  =  (constant) E  x 
B  du  are  consistent  with  each  other  for  a  sine  wave,  they  are  not 
consistent  with  each  other  in  general.  The  relativistic  argument 
leading  up  to  p  =  U/c  assumed  that  we  were  only  talking  about 

18This  current  will  soon  come  to  a  grinding  halt,  because  we  don’t  have  a 
complete  circuit,  but  let’s  say  we’re  talking  about  the  first  picosecond  during 
which  the  radio  wave  encounters  the  wire.  This  is  why  real  radio  antennas  are 
not  very  short  compared  to  a  wavelength! 
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p/A  classical  calculation  of 
the  momentum  of  a  light  wave. 
An  antenna  of  length  l  is  bathed 
in  an  electromagnetic  wave. 
The  black  arrows  represent  the 
electric  field,  the  white  circles  the 
magnetic  field  coming  out  of  the 
page.  The  wave  is  traveling  to  the 
right. 


q/A  simplified  drawing  of 
the  1903  experiment  by  Nichols 
and  Hull  that  verified  the  pre¬ 
dicted  momentum  of  light  waves. 
Two  circular  mirrors  were  hung 
from  a  fine  quartz  fiber,  inside 
an  evacuated  bell  jar.  A  150 
mW  beam  of  light  was  shone 
on  one  of  the  mirrors  for  6  s, 
producing  a  tiny  rotation,  which 
was  measurable  by  an  optical 
lever  (not  shown).  The  force  was 
within  0.6%  of  the  theoretically 
predicted  value  (problem  1 1  on 
p.  446)  of  0.001  pN.  For  compar¬ 
ison,  a  short  clipping  of  a  single 
human  hair  weighs  ~  1  pN. 
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a  single  thing  traveling  in  a  single  direction,  whereas  no  such  as¬ 
sumption  was  made  in  arguing  for  the  E  X  B  form.  For  instance,  if 
two  light  beams  of  equal  strength  are  traveling  through  one  another, 
going  in  opposite  directions,  their  total  momentum  is  zero,  which  is 
consistent  with  the  E  X  B  form,  but  not  with  U/c. 

Some  examples  were  given  in  chapter  3  of  situations  where  it 
actually  matters  that  light  has  momentum.  Figure  q  shows  the  first 
confirmation  of  this  fact  in  the  laboratory. 

Angular  momentum  of  light  waves 

For  completeness,  we  note  that  since  light  carries  momentum,  it 
must  also  be  possible  for  it  to  have  angular  momentum.  If  you’ve 
studied  chemistry,  here’s  an  example  of  why  this  can  be  important. 
You  know  that  electrons  in  atoms  can  exist  in  states  labeled  s,  p, 
d,  f,  and  so  on.  What  you  might  not  have  realized  is  that  these 
are  angular  momentum  labels.  The  s  state,  for  example,  has  zero 
angular  momentum.  If  light  didn’t  have  angular  momentum,  then, 
for  example,  it  wouldn’t  be  possible  for  a  hydrogen  atom  in  a  p  state 
to  change  to  the  lower-energy  s  state  by  emitting  light.  Conservation 
of  angular  momentum  requires  that  the  light  wave  carry  away  all 
the  angular  momentum  originally  possessed  by  the  electron  in  the 
p  state,  since  in  the  s  state  it  has  none. 

Discussion  Questions 

A  Positive  charges  1  and  2  are  moving  as  shown.  What  electric  and 
magnetic  forces  do  they  exert  on  each  other?  What  does  this  imply  for 
conservation  of  momentum? 

B  1.  The  figure  shows  a  line  of  charges  moving  to  the  right,  creating 
a  current  /.  An  Amperian  surface  in  the  form  of  a  disk  has  been  superim¬ 
posed.  Use  Maxwell’s  equations  to  find  the  field  B  at  point  P. 

2.  A  tiny  gap  is  chopped  out  of  the  line  of  charge.  What  happens  when 
this  gap  is  directly  underneath  the  point  P? 

C  The  diagram  shows  an  electric  field  pattern  frozen  at  one  moment  in 
time.  Let’s  imagine  that  it’s  the  electric  part  of  an  electromagnetic  wave. 
Consider  four  possible  directions  in  which  it  could  be  propagating:  left, 
right,  up,  and  down.  Determine  whether  each  of  these  is  consistent  with 
Maxwell’s  equations.  If  so,  infer  the  direction  of  the  magnetic  field. 

D  Discuss  what  happens  to  the  wave  pattern  shown  in  the  diagram  if 
we  switch  to  a  frame  of  reference  moving  along  with  the  wave. 
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11.7  Electromagnetic  Properties  of  Materials 

Different  types  of  matter  have  a  variety  of  useful  electrical  and  mag¬ 
netic  properties.  Some  are  conductors,  and  some  are  insulators. 
Some,  like  iron  and  nickel,  can  be  magnetized,  while  others  have 
useful  electrical  properties,  e.g.,  dielectrics,  discussed  qualitatively 
in  the  discussion  question  on  page  597,  which  allow  us  to  make 
capacitors  with  much  higher  values  of  capacitance  than  would  oth¬ 
erwise  be  possible.  We  need  to  organize  our  knowledge  about  the 
properties  that  materials  can  possess,  and  see  whether  this  knowl¬ 
edge  allows  us  to  calculate  anything  useful  with  Maxwell’s  equations. 

11.7.1  Conductors 

A  perfect  conductor,  such  as  a  superconductor,  has  no  DC  elec¬ 
trical  resistance.  It  is  not  possible  to  have  a  static  electric  field 
inside  it,  because  then  charges  would  move  in  response  to  that  field, 
and  the  motion  of  the  charges  would  tend  to  reduce  the  held,  con¬ 
trary  to  the  assumption  that  the  held  was  static.  Things  are  a  little 
different  at  the  surface  of  a  perfect  conductor  than  on  the  interior. 
We  expect  that  any  net  charges  that  exist  on  the  conductor  will 
spread  out  under  the  influence  of  their  mutual  repulsion,  and  settle 
on  the  surface.  As  we  saw  in  chapter  10,  Gauss’s  law  requires  that 
the  helds  on  the  two  sides  of  a  sheet  of  charge  have  |Ej_ j  —  Ej_j2| 
proportional  to  the  surface  charge  density,  and  since  the  held  inside 
the  conductor  is  zero,  we  infer  that  there  can  be  a  held  on  or  im¬ 
mediately  outside  the  conductor,  with  a  nonvanishing  component 
perpendicular  to  the  surface.  The  component  of  the  held  parallel 
to  the  surface  must  vanish,  however,  since  otherwise  it  would  cause 
the  charges  to  move  along  the  surface. 

On  a  hot  summer  day,  the  reason  the  sun  feels  warm  on  your 
skin  is  that  the  oscillating  helds  of  the  light  waves  excite  currents 
in  your  skin,  and  these  currents  dissipate  energy  by  ohmic  heating. 
In  a  perfect  conductor,  however,  this  could  never  happen,  because 
there  is  no  such  thing  as  ohmic  heating.  Since  electric  helds  can’t 
penetrate  a  perfect  conductor,  we  also  know  that  an  electromag¬ 
netic  wave  can  never  pass  into  one.  By  conservation  of  energy,  we 
know  that  the  wave  can’t  just  vanish,  and  if  the  energy  can’t  be 
dissipated  as  heat,  then  the  only  remaining  possibility  is  that  all  of 
the  wave’s  energy  is  reflected.  This  is  why  metals,  which  are  good 
electrical  conductors,  are  also  highly  rehective.  They  are  not  perfect 
electrical  conductors,  however,  so  they  are  not  perfectly  rehective. 
The  wave  enters  the  conductor,  but  immediately  excites  oscillating 
currents,  and  these  oscillating  currents  dissipate  the  energy  both  by 
ohmic  heating  and  by  reradiating  the  reflected  wave.  Since  the  parts 
of  Maxwell’s  equations  describing  radiation  have  time  derivatives  in 
them,  the  efficiency  of  this  reradiation  process  depends  strongly  on 
frequency.  When  the  frequency  is  high  and  the  material  is  a  good 
conductor,  reflection  predominates,  and  is  so  efficient  that  the  wave 
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a  /  A  capacitor  with  a  dielec¬ 
tric  between  the  plates. 


only  penetrates  to  a  very  small  depth,  called  the  skin  depth.  In 
the  limit  of  poor  conduction  and  low  frequencies,  absorption  pre¬ 
dominates,  and  the  skin  depth  becomes  much  greater.  In  a  high- 
frequency  AC  circuit,  the  skin  depth  in  a  copper  wire  is  very  small, 
and  therefore  the  signals  in  such  a  circuit  are  propagated  entirely  at 
the  surfaces  of  the  wires.  In  the  limit  of  low  frequencies,  i.e. ,  DC, 
the  skin  depth  approaches  infinity,  so  currents  are  carried  uniformly 
over  the  wires’  cross-sections. 

We  can  quantify  how  well  a  particular  material  conducts  elec¬ 
tricity.  We  know  that  the  resistance  of  a  wire  is  proportional  to  its 
length,  and  inversely  proportional  to  its  cross-sectional  area.  The 
constant  of  proportionality  is  1  /a,  where  a  (not  the  same  a  as  the 
surface  charge  density)  is  called  the  electrical  conductivity.  Exposed 
to  an  electric  field  E,  a  conductor  responds  with  a  current  per  unit 
cross-sectional  area  J  =  a E.  The  skin  depth  is  proportional  to 
I/VTct,  where  /  is  the  frequency  of  the  wave. 

11.7.2  Dielectrics 

A  material  with  a  very  low  conductivity  is  an  insulator.  Such 
materials  are  usually  composed  of  atoms  or  molecules  whose  elec¬ 
trons  are  strongly  bound  to  them;  since  the  atoms  or  molecules 
have  zero  total  charge,  their  motion  cannot  create  an  electric  cur¬ 
rent.  But  even  though  they  have  zero  charge,  they  may  not  have 
zero  dipole  moment.  Imagine  such  a  substance  filling  in  the  space 
between  the  plates  of  a  capacitor,  as  in  figure  a.  For  simplicity, 
we  assume  that  the  molecules  are  oriented  randomly  at  first,  a/1, 
and  then  become  completely  aligned  when  a  field  is  applied,  a/2. 
The  effect  has  been  to  take  all  of  the  negatively  charged  black  ands 
of  the  molecules  and  shift  them  upward,  and  the  opposite  for  the 
positively  charged  white  ends.  Where  the  black  and  white  charges 
overlap,  there  is  still  zero  net  charge,  but  we  have  a  strip  of  nega¬ 
tive  charge  at  the  top,  and  a  strip  of  positive  charge  at  the  bottom, 
a/3.  The  effect  has  been  to  cancel  out  part  of  the  charge  that  was 
deposited  on  the  plates  of  the  capacitor.  Now  this  is  very  subtle,  be¬ 
cause  Maxwell’s  equations  treat  these  charges  on  an  equal  basis,  but 
in  terms  of  practical  measurements,  they  are  completely  different. 
The  charge  on  the  plates  can  be  measured  be  inserting  an  ammeter 
in  the  circuit,  and  integrating  the  current  over  time.  But  the  charges 
in  the  layers  at  the  top  and  bottom  of  the  dielectric  never  flowed 
through  any  wires,  and  cannot  be  detected  by  an  ammeter.  In  other 
words,  the  total  charge,  q,  appearing  in  Maxwell’s  equartions  is  ac- 
tualy  q  =  (/free  —  (/bound,  where  (/free  is  the  charge  that  moves  freely 
through  wires,  and  can  be  detected  in  an  ammeter,  while  qbound  is 
the  charge  bound  onto  the  individual  molecules,  which  can’t.  We 
will,  however,  detect  the  presence  of  the  bound  charges  via  their 
electric  fields.  Since  their  electric  fields  partially  cancel  the  fields  of 
the  free  charges,  a  voltmeter  will  register  a  smaller  than  expected 
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voltage  difference  between  the  plates.  If  we  measure  q{ree/V,  we  have 
a  result  that  is  larger  than  the  capacitance  we  would  have  expected. 

Although  the  relationship  E  -H-  q  between  electric  fields  and  their 
sources  is  unalterably  locked  in  by  Gauss’s  law,  that’s  not  what  we 
see  in  practical  measurements.  In  this  example,  we  can  measure  the 
voltage  difference  between  the  plates  of  the  capacitor  and  divide 
by  the  distance  between  them  to  find  E,  and  then  integrate  an 
ammeter  reading  to  find  qa-ee,  and  we  will  find  that  Gauss’s  law 
appears  not  to  hold.  We  have  E  -B-  (?free/ (constant),  where  the 
constant  fudge  factor  is  greater  than  one.  This  constant  is  a  property 
of  the  dielectric  material,  and  tells  us  how  many  dipoles  there  are, 
how  strong  they  are,  and  how  easily  they  can  be  reoriented.  The 
conventional  notation  is  to  incorporate  this  fudge  factor  into  Gauss’s 
law  by  defining  an  altered  version  of  the  electric  field, 

D  =  cE, 


and  to  rewrite  Gauss’s  law  as 

=  Qin,  free- 

The  constant  e  is  a  property  of  the  material,  known  as  its  permittiv¬ 
ity.  In  a  vacuum,  e  takes  on  a  value  known  as  eQ,  defined  as  1/(47 xk). 
In  a  dielectric,  e  is  greater  than  eD.  When  a  dielectric  is  present  be¬ 
tween  the  plates  of  a  capacitor,  its  capacitance  is  proportional  to  e 
(problem  38).  The  following  table  gives  some  sample  values  of  the 
permittivities  of  a  few  substances. 

substance  e/eQ  at  zero  frequency 

vacuum  1 

air  1.00054 

water  80 

barium  titanate  1250 

A  capacitor  with  a  very  high  capacitance  is  potentially  a  superior 
replacement  for  a  battery,  but  until  the  1990’s  this  was  impracti¬ 
cal  because  capacitors  with  high  enough  values  couldn’t  be  made, 
even  with  dielectrics  having  the  largest  known  permittivities.  Such 
supercapactors,  some  with  values  in  the  kilofarad  range,  are  now 
available.  Most  of  them  do  not  use  dielectric  at  all;  the  very  high 
capacitance  values  are  instead  obtained  by  using  electrodes  that  are 
not  parallel  metal  plates  at  all,  but  exotic  materials  such  as  aerogels, 
which  allows  the  spacing  between  the  “electrodes”  to  be  very  small. 

Although  figure  a/2  shows  the  dipoles  in  the  dielectric  being 
completely  aligned,  this  is  not  a  situation  commonly  encountered 
in  practice.  In  such  a  situation,  the  material  would  be  as  polarized 
as  it  could  possibly  be,  and  if  the  held  was  increased  further,  it 
would  not  respond.  In  reality,  a  capacitor,  for  example,  would  nor¬ 
mally  be  operated  with  fields  that  produced  quite  a  small  amount 
of  alignment,  and  it  would  be  under  these  conditions  that  the  linear 


b/A  stud  finder  is  used  to 
locate  the  wooden  beams,  or 
studs,  that  form  the  frame  behind 
the  wallboard.  It  is  a  capacitor 
whose  capacitance  changes 
when  it  is  brought  close  to  a 
substance  with  a  particular 
permittivity.  Although  the  wall 
is  external  to  the  capacitor,  a 
change  in  capacitance  is  still 
observed,  because  the  capacitor 
has  “fringing  fields”  that  extend 
outside  the  region  between  its 
plates. 
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c  /  The  magnetic  version  of 
figure  a.  A  magnetically  perme¬ 
able  material  is  placed  at  the 
center  of  a  solenoid. 


relationship  D  =  eE  would  actually  be  a  good  approximation.  Be¬ 
fore  a  material’s  maximum  polarization  is  reached,  it  may  actually 
spark  or  burn  up. 

self-check  I 

Suppose  a  parallel-plate  capacitor  is  built  so  that  a  slab  of  dielectric 
material  can  be  slid  in  or  out.  (This  is  similar  to  the  way  the  stud  finder 
in  figure  b  works.)  We  insert  the  dielectric,  hook  the  capacitor  up  to 
a  battery  to  charge  it,  and  then  use  an  ammeter  and  a  voltmeter  to 
observe  what  happens  when  the  dielectric  is  withdrawn.  Predict  the 
changes  observed  on  the  meters,  and  correlate  them  with  the  expected 
change  in  capacitance.  Discuss  the  energy  transformations  involved, 
and  determine  whether  positive  or  negative  work  is  done  in  removing 
the  dielectric.  >  Answer,  p.  967 

11.7.3  Magnetic  materials 

Magnetic  permeability 

Atoms  and  molecules  may  have  magnetic  dipole  moments  as 
well  as  electric  dipole  moments.  Just  as  an  electric  dipole  contains 
bound  charges,  a  magnetic  dipole  has  bound  currents,  which  come 
from  the  motion  of  the  electrons  as  they  orbit  the  nucleus,  c/1.  Such 
a  substance,  subjected  to  a  magnetic  field,  tends  to  align  itself,  c/2, 
so  that  a  sheet  of  current  circulates  around  the  externally  applied 
field.  Figure  c/3  is  closely  analogous  to  figure  a/3;  in  the  central 
gray  area,  the  atomic  currents  cancel  out,  but  the  atoms  at  the 
outer  surface  form  a  sheet  of  bound  current.  However,  whereas  like 
charges  repel  and  opposite  charges  attract,  it  works  the  other  way 
around  for  currents:  currents  in  the  same  direction  attract,  and 
currents  in  opposite  directions  repel.  Therefore  the  bound  currents 
in  a  material  inserted  inside  a  solenoid  tend  to  reinforce  the  free 
currents,  and  the  result  is  to  strengthen  the  held.  The  total  current 
is  I  =  /free  +  /bound ,  and  we  define  an  altered  version  of  the  magnetic 
held, 


and  rewrite  Ampere’s  law  as 


1//  —  /through,  free' 


d  /  Example  24:  a  cutaway 

view  of  a  solenoid. 


The  constant  p  is  the  permeability,  with  a  vacuum  value  of  pQ  = 
47 rk/c2.  Here  are  the  magnetic  permeabilities  of  some  substances: 


substance  A/Ao 

vacuum  1 

aluminum  1.00002 

steel  700 

transformer  iron  4,000 
mu-metal  20,000 
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An  iron-core  electromagnet  example  24 

>  A  solenoid  has  1000  turns  of  wire  wound  along  a  cylindrical 
core  with  a  length  of  10  cm.  If  a  current  of  1.0  A  is  used,  find  the 
magnetic  field  inside  the  solenoid  if  the  core  is  air,  and  if  the  core 
is  made  of  iron  with  =  4, 000. 

o  Air  has  essentially  the  same  permability  as  vacuum,  so  using 
the  result  of  example  13  on  page  680,  we  find  that  the  field  is 
0.013  T. 

We  now  consider  the  case  where  the  core  is  filled  with  iron.  The 
original  derivation  in  example  1 3  started  from  Ampere’s  law,  which 
we  now  rewrite  as  rH  =  /through,  tree-  As  argued  previously,  the  only 
significant  contributions  to  the  circulation  come  from  line  segment 
AB.  This  segment  lies  inside  the  iron,  where  H  =  B/p.  The  H 
field  is  the  same  as  in  the  air-core  case,  since  the  new  form  of 
Ampere’s  law  only  relates  H  to  the  current  in  the  wires  (the  free 
current).  This  means  that  B  =  jllH  is  greater  by  a  factor  of  4,000 
than  in  the  air-core  case,  or  52  T.  This  is  an  extremely  intense 
field  —  so  intense,  in  fact,  that  the  iron’s  magnetic  polarization 
would  probably  become  saturated  before  we  could  actually  get 
the  field  that  high. 

The  electromagnet  of  example  24  could  also  be  used  as  an  in¬ 
ductor,  and  its  inductance  would  be  proportional  to  the  permittivity 
of  the  core.  This  makes  it  possible  to  construct  high-value  induc¬ 
tors  that  are  relatively  compact.  Permeable  cores  are  also  used  in 
transformers. 

A  transformer  or  inductor  with  a  permeable  core  does  have 
some  disadvantages,  however,  in  certain  applications.  The  oscil¬ 
lating  magnetic  field  induces  an  electric  field,  and  because  the  core 
is  typically  a  metal,  these  currents  dissipate  energy  strongly  as  heat. 
This  behaves  like  a  fairly  large  resistance  in  series  with  the  coil.  Fig¬ 
ure  f  shows  a  method  for  reducing  this  effect.  The  iron  core  of  this 
transformer  has  been  constructed  out  of  laminated  layers,  which  has 
the  effect  of  blocking  the  conduction  of  the  eddy  currents. 

A  ferrite  bead  example  25 

Cables  designed  to  carry  audio  signals  are  typically  made  with 
two  adjacent  conductors,  such  that  the  current  flowing  out  through 
one  conductor  comes  back  through  the  other  one.  Computer  ca¬ 
bles  are  similar,  but  usually  have  several  such  pairs  bundled  in¬ 
side  the  insulator.  This  paired  arrangement  is  known  as  differen¬ 
tial  mode,  and  has  the  advantage  of  cutting  down  on  the  reception 
and  transmission  of  interference.  In  terms  of  transmission,  the 
magnetic  field  created  by  the  outgoing  current  is  almost  exactly 
canceled  by  the  field  from  the  return  current,  so  electromagnetic 
waves  are  only  weakly  induced.  In  reception,  both  conductors  are 
bathed  in  the  same  electric  and  magnetic  fields,  so  an  emf  that 
adds  current  on  one  side  subtracts  current  from  the  other  side, 


e  /  Example  24:  without  the 
iron  core,  the  field  is  so  weak  that 
it  barely  deflects  the  compass. 
With  it,  the  deflection  is  nearly 
90°. 


f  /  A  transformer  with  a  lami¬ 
nated  iron  core.  The  input  and 
output  coils  are  inside  the  paper 
wrapper.  The  iron  core  is  the 
black  part  that  passes  through 
the  coils  at  the  center,  and 
also  wraps  around  them  on  the 
outside. 


g  /  Example  25:  ferrite  beads. 
The  top  panel  shows  a  clip-on 
type,  while  the  bottom  shows  one 
built  into  a  cable. 
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resulting  in  cancellation. 

The  opposite  of  differential  mode  is  called  common  mode.  In 
common  mode,  all  conductors  have  currents  flowing  in  the  same 
direction.  Even  when  a  circuit  is  designed  to  operate  in  differential 
mode,  it  may  not  have  exactly  equal  currents  in  the  two  conduc¬ 
tors  with  /-i  +  /2  =  0,  meaning  that  current  is  leaking  off  to  ground  at 
one  end  of  the  circuit  or  the  other.  Although  paired  cables  are  rel¬ 
atively  immune  to  differential-mode  interference,  they  do  not  have 
any  automatic  protection  from  common-mode  interference. 

Figure  g  shows  a  device  for  reducing  common-mode  interference 
called  a  ferrite  bead,  which  surrounds  the  cable  like  a  bead  on 
a  string.  Ferrite  is  a  magnetically  permeable  alloy.  In  this  appli¬ 
cation,  the  ohmic  properties  of  the  ferrite  actually  turn  put  to  be 
advantageous. 

Let’s  consider  common-mode  transmission  of  interference.  The 
bare  cable  has  some  DC  resistance,  but  is  also  surrounded  by  a 
magnetic  field,  so  it  has  inductance  as  well.  This  means  that  it 
behaves  like  a  series  L-R  circuit,  with  an  impedance  that  varies 
as  R  +  iwL,  where  both  R  and  L  are  very  small.  When  we  add  the 
ferrite  bead,  the  inductance  is  increased  by  orders  of  magnitude, 
but  so  is  the  resistance.  Neither  R  nor  L  is  actually  constant  with 
respect  to  frequency,  but  both  are  much  greater  than  for  the  bare 
cable. 

Suppose,  for  example,  that  a  signal  is  being  transmitted  from  a 
digital  camera  to  a  computer  via  a  USB  cable.  The  camera  has 
an  internal  impedance  that  is  on  the  order  of  10  O,  the  computer’s 
input  also  has  a  ~  10  O  impedance,  and  in  differential  mode  the 
ferrite  bead  has  no  effect,  so  the  cable’s  impedance  has  its  low, 
designed  value  (probably  also  about  10  n,  for  good  impedance 
matching).  The  signal  is  transmitted  unattenuated  from  the  cam¬ 
era  to  the  computer,  and  there  is  almost  no  radiation  from  the 
cable. 

But  in  reality  there  will  be  a  certain  amount  of  common-mode 
current  as  well.  With  respect  to  common  mode,  the  ferrite  bead 
has  a  large  impedance,  with  the  exact  value  depending  on  fre¬ 
quency,  but  typically  on  the  order  of  100  O  for  frequencies  in 
the  MHz  range.  We  now  have  a  series  circuit  consisting  of  three 
impedances:  10,  100,  and  10  O.  For  a  given  emf  applied  by  an 
external  radio  wave,  the  current  induced  in  the  circuit  has  been 
attenuated  by  an  order  of  magnitude,  relative  to  its  value  without 
the  ferrite  bead. 

Why  is  the  ferrite  is  necessary  at  all?  Why  not  just  insert  ordinary 
air-core  inductors  in  the  circuit?  We  could,  for  example,  have  two 
solenoidal  coils,  one  in  the  outgoing  line  and  one  in  the  return  line, 
interwound  with  one  another  with  their  windings  oriented  so  that 
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their  differential-mode  fields  would  cancel.  There  are  two  good 
reasons  to  prefer  the  ferrite  bead  design.  One  is  that  it  allows 
a  clip-on  device  like  the  one  in  the  top  panel  of  figure  g,  which 
can  be  added  without  breaking  the  circuit.  The  other  is  that  our 
circuit  will  inevitably  have  some  stray  capacitance,  and  will  there¬ 
fore  act  like  an  LRC  circuit,  with  a  resonance  at  some  frequency. 
At  frequencies  close  to  the  resonant  frequency,  the  circuit  would 
absorb  and  transmit  common-mode  interference  very  strongly, 
which  is  exactly  the  opposite  of  the  effect  we  were  hoping  to 
produce.  The  resonance  peak  could  be  made  low  and  broad  by 
adding  resistance  in  series,  but  this  extra  resistance  would  atten¬ 
uate  the  differential-mode  signals  as  well  as  the  common-mode 
ones.  The  ferrite’s  resistance,  however,  is  actually  a  purely  mag¬ 
netic  effect,  so  it  vanishes  in  differential  mode. 

Surprisingly,  some  materials  have  magnetic  permeabilities  less 
than  //Q.  This  cannot  be  accounted  for  in  the  model  above,  and 
although  there  are  semiclassical  arguments  that  can  explain  it  to 
some  extent,  it  is  fundamentally  a  quantum  mechanical  effect.  Ma¬ 
terials  with  fi>  fi0  are  called  paramagnetic,  while  those  with  fj,  <  /iQ 
are  referred  to  as  diamagnetic.  Diamagnetism  is  generally  a  much 
weaker  effect  than  paramagnetism,  and  is  easily  masked  if  there  is 
any  trace  of  contamination  from  a  paramagnetic  material.  Diamag¬ 
netic  materials  have  the  interesting  property  that  they  are  repelled 
from  regions  of  strong  magnetic  field,  and  it  is  therefore  possible  to 
levitate  a  diamagnetic  object  above  a  magnet,  as  in  figure  h. 

A  complete  statement  of  Maxwell’s  equations  in  the  presence  of 
electric  and  magnetic  materials  is  as  follows: 


h  /  A  frog  is  levitated  dia- 
magnetically  by  the  nonuniform 
field  inside  a  powerful  magnet. 
Evidently  frog  has  g  <  go- 


=  Qiree 


&B  =  0 


rE 


Th 


d  t 
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dt 


T  Ifree 


Comparison  with  the  vacuum  case  shows  that  the  speed  of  an 
electromagnetic  wave  moving  through  a  substance  described  by  per¬ 
mittivity  and  permeability  e  and  /j  is  1/^/eJi.  For  most  substances, 
[i  Ei  /r0,  and  e  is  highly  frequency-dependent. 

Suppose  we  have  a  boundary  between  two  substances.  By  con¬ 
structing  a  Gaussian  or  Amperian  surface  that  extends  across  the 
boundary,  we  can  arrive  at  various  constraints  on  how  the  fields 
must  behave  as  me  move  from  one  substance  into  the  other,  when 
there  are  no  free  currents  or  charges  present,  and  the  fields  are 
static.  An  interesting  example  is  the  application  of  Faraday’s  law, 
Th  =  0,  to  the  case  where  one  medium  —  let’s  say  it’s  air  —  has 


jj± 


i  /  At  a  boundary  between 
two  substances  with  p2  >  m ,  the 
H  field  has  a  continuous  com¬ 
ponent  parallel  to  the  surface, 
which  implies  a  discontinuity  in 
the  parallel  component  of  the 
magnetic  field  B. 
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a  low  permeability,  while  the  other  one  has  a  very  high  one.  We 
will  violate  Faraday’s  law  unless  the  component  of  the  H  field  par¬ 
allel  to  the  boundary  is  a  continuous  function,  Hyp  =  H||j2-  This 
means  that  if  n/n 0  is  very  high,  the  component  of  B  =  //H  par¬ 
allel  to  the  surface  will  have  an  abrupt  discontinuity,  being  much 
stronger  inside  the  high-permeability  material.  The  result  is  that 
when  a  magnetic  field  enters  a  high-permeability  material,  it  tends 
to  twist  abruptly  to  one  side,  and  the  pattern  of  the  field  tends  to 
be  channeled  through  the  material  like  water  through  a  funnel.  In 
a  transformer,  a  permeable  core  functions  to  channel  more  of  the 
magnetic  flux  from  the  input  coil  to  the  output  coil.  Figure  j  shows 
another  example,  in  which  the  effect  is  to  shield  the  interior  of  the 
sphere  from  the  externally  imposed  field.  Special  high-permeability 
alloys,  with  trade  names  like  Mu-Metal,  are  sold  for  this  purpose. 
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j/A  hollow  sphere  with  p/go  =  10,  is  immersed  in  a  uniform,  externally  imposed  magnetic  field.  The 
interior  of  the  sphere  is  shielded  from  the  field.  The  arrows  map  the  magnetic  field  B.  (See  homework  problem 
47,  page  734.) 


Ferromagnetism 

The  very  last  magnetic  phenomenon  we’ll  discuss  is  probably  the 
very  first  experience  you  ever  had  of  magnetism.  Ferromagnetism  is 
a  phenomenon  in  which  a  material  tends  to  organize  itself  so  that  it 
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has  a  nonvanishing  magnetic  field.  It  is  exhibited  strongly  by  iron 
and  nickel,  which  explains  the  origin  of  the  name. 


Figure  k/1  is  a  simple  one-dimensional  model  of  ferromagnetism. 
Each  magnetic  compass  needle  represents  an  atom.  The  compasses 
in  the  chain  are  stable  when  aligned  with  one  another,  because  each 
one’s  north  end  is  attracted  to  its  neighbor’s  south  end.  The  chain 
can  be  turned  around,  k/2,  without  disrupting  its  organization,  and 
the  compasses  do  not  realign  themselves  with  the  Earth’s  field,  be¬ 
cause  their  torques  on  one  another  are  stronger  than  the  Earth’s 
torques  on  them.  The  system  has  a  memory.  For  example,  if  I  want 
to  remind  myself  that  my  friend’s  address  is  137  Coupling  Ct.,  I  can 
align  the  chain  at  an  angle  of  137  degrees.  The  model  fails,  how¬ 
ever,  as  an  explanation  of  real  ferromagnetism,  because  in  two  or 
more  dimensions,  the  most  stable  arrangement  of  a  set  of  interacting 
magnetic  dipoles  is  something  more  like  k/3,  in  which  alternating 
rows  point  in  opposite  directions.  In  this  two-dimensional  pattern, 
every  compass  is  aligned  in  the  most  stable  way  with  all  four  of  its 
neighbors.  This  shows  that  ferromagnetism,  like  diamagnetism,  has 
no  purely  classical  explanation;  a  full  explanation  requires  quantum 
mechanics. 

Because  ferromagnetic  substances  “remember”  the  history  of 
how  they  were  prepared,  they  are  commonly  used  to  store  infor¬ 
mation  in  computers.  Figure  1  shows  16  bits  from  an  ancient  (ca. 
1970)  4-kilobyte  random-access  memory,  in  which  each  doughnut¬ 
shaped  iron  “core”  can  be  magnetized  in  one  of  two  possible  direc¬ 
tions,  so  that  it  stores  one  bit  of  information.  Today,  RAM  is  made 
of  transistors  rather  than  magnetic  cores,  but  a  remnant  of  the  old 
technology  remains  in  the  term  “core  dump,”  meaning  “memory 
dump,”  as  in  “my  girlfriend  gave  me  a  total  core  dump  about  her 
mom’s  divorce.”  Most  computer  hard  drives  today  do  store  their  in¬ 
formation  on  rotating  magnetic  platters,  but  the  platter  technology 
may  be  obsoleted  by  flash  memory  in  the  near  future. 

The  memory  property  of  ferromagnets  can  be  depicted  on  the 
type  of  graph  shown  in  figure  m,  known  as  a  hysteresis  curve.  The 
y  axis  is  the  magnetization  of  a  sample  of  the  material  —  a  mea¬ 
sure  of  the  extent  to  which  its  atomic  dipoles  are  aligned  with  one 


k  /  A  model  of  ferromagnetism. 


I  /  Magnetic  core  memory. 


m  /  A  hysteresis  curve. 
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n  /  A  fluxgate  compass. 


another.  If  the  sample  is  initially  unmagnetized,  1,  and  a  field  H 
is  externally  applied,  the  magnetization  increases,  2,  but  eventu¬ 
ally  becomes  saturated,  3,  so  that  higher  fields  do  not  result  in  any 
further  magnetization,  4.  The  external  field  can  then  be  reduced, 
5,  and  even  eliminated  completely,  but  the  material  will  retain  its 
magnetization.  It  is  a  permanent  magnet.  To  eliminate  its  magneti¬ 
zation  completely,  a  substantial  field  must  be  applied  in  the  opposite 
direction.  If  this  reversed  held  is  made  stronger,  then  the  substance 
will  eventually  become  magnetized  just  as  strongly  in  the  opposite 
direction.  Since  the  hysteresis  curve  is  nonlinear,  and  is  not  a  func¬ 
tion  (it  has  more  than  one  value  of  M  for  a  particular  value  of  B),  a 
ferromagnetic  material  does  not  have  a  single,  well-defined  value  of 
the  permeability  jit;  a  value  like  4,000  for  transformer  iron  represents 
some  kind  of  a  rough  average. 

The  fluxgate  compass  example  26 

The  fluxgate  compass  is  a  type  of  magnetic  compass  without 
moving  parts,  commonly  used  on  ships  and  aircraft.  An  AC  cur¬ 
rent  is  applied  in  a  coil  wound  around  a  ferromagnetic  core,  driv¬ 
ing  the  core  repeatedly  around  a  hysteresis  loop.  Because  the 
hysteresis  curve  is  highly  nonlinear,  the  addition  of  an  external 
field  such  as  the  Earth’s  alters  the  core’s  behavior.  Suppose,  for 
example,  that  the  axis  of  the  coil  is  aligned  with  the  magnetic 
north-south.  The  core  will  reach  saturation  more  quickly  when 
the  coil’s  field  is  in  the  same  direction  as  the  Earth’s,  but  will  not 
saturate  as  early  in  the  next  half-cycle,  when  the  two  fields  are 
in  opposite  directions.  With  the  use  of  multiple  coils,  the  com¬ 
ponents  of  the  Earth’s  field  can  be  measured  along  two  or  three 
axes,  permitting  the  compass’s  orientation  to  be  determined  in 
two  or  (for  aircraft)  three  dimensions. 

Sharp  magnet  poles  example  27 

Although  a  ferromagnetic  material  does  not  really  have  a  single 
value  of  the  magnetic  permeability,  there  is  still  a  strong  tendency 
to  have  By  «  0  just  outside  the  magnet’s  surface,  for  the  same 
reasons  as  discussed  above  for  high-permeability  substances  in 
general.  For  example,  if  we  have  a  cylindrical  bar  magnet  about 
the  size  and  shape  of  your  finger,  magnetized  lengthwise,  then 
the  field  near  the  ends  is  nearly  perpendicular  to  the  surfaces, 
while  the  field  near  the  sides,  although  it  may  be  oriented  nearly 
parallel  to  the  surface,  is  very  weak,  so  that  we  still  have  By  « 
0.  This  is  in  close  analogy  to  the  situation  for  the  electric  field 
near  the  surface  of  a  conductor  in  equilibrium,  for  which  Ey  =  0. 
This  analogy  is  close  enough  so  that  we  can  recycle  much  of  our 
knowledge  about  electrostatics. 

For  example,  we  saw  in  example  9,  p.  530,  and  problem  37, 
p.  556,  that  charge  tends  to  collect  on  the  most  highly  curved 
portions  of  a  conductor,  and  therefore  becomes  especially  dense 
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near  a  corner  or  knife-edge.  This  gives  us  a  way  of  making  es¬ 
pecially  intense  magnetic  fields.  Most  people  would  imagine  that 
a  very  intense  field  could  be  made  simply  by  using  a  very  large 
and  bulky  permanent  magnet,  but  this  doesn’t  actually  work  very 
well,  because  magnetic  dipole  fields  fall  off  as  1  /r3,  so  that  at  a 
point  near  the  surface,  nearly  all  the  field  is  contributed  by  atoms 
near  the  surface.  Our  analogy  with  electrostatics  suggests  that 
we  should  instead  construct  a  permanent  magnet  with  a  sharp 
edge. 

Figure  o  shows  the  cross-sectional  shapes  of  two  magnet  poles 
used  in  the  historic  Stern-Gerlach  experiment  (sec.  1 3.5.1 ,  p.  905). 
The  external  magnetic  field  is  represented  using  field  lines.  The 
field  lines  enter  and  exit  the  surfaces  perpendicularly,  and  they 
are  particularly  dense  near  the  corner  of  the  upper  pole,  indicat¬ 
ing  a  strong  field.  The  spreading  of  the  field  lines  indicates  that 
the  field  is  strongly  nonuniform,  becoming  much  weaker  toward 
the  bottom  of  the  gap  between  the  poles.  This  strong  nonunifor¬ 
mity  was  crucial  for  the  experiment,  in  which  the  magnets  were 
used  as  part  of  a  dipole  spectrometer.  See  example  6  and  figure 
o  on  p.  573  for  an  explanation  of  an  electric  version  of  such  a 
spectrometer. 

This  chapter  is  summarized  on  page  1003.  Notation  and  terminology 

are  tabulated  on  pages  984-985. 
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The  symbols  \/,  ^ ,  etc.  are  explained  on  page  737. 

1  A  particle  with  a  charge  of  1.0  C  and  a  mass  of  1.0  kg  is 

observed  moving  past  point  P  with  a  velocity  (1.0  m/s)x.  The 
electric  field  at  point  P  is  (1.0  V/m)y,  and  the  magnetic  field  is 
(2.0  T)y.  Find  the  force  experienced  by  the  particle.  V 

2  For  a  positively  charged  particle  moving  through  a  magnetic 
field,  the  directions  of  the  v,  B,  and  F  vectors  are  related  by  a 
right-hand  rule: 

v  along  the  fingers,  with  the  hand  flat 
B  along  the  fingers,  with  the  knuckles  bent 
F  along  the  thumb 

Make  a  three-dimensional  model  of  the  three  vectors  using  pencils 
or  rolled-up  pieces  of  paper  to  represent  the  vectors  assembled  with 
their  tails  together.  Make  all  three  vectors  perpendicular  to  each 
other.  Now  write  down  every  possible  way  in  which  the  rule  could  be 
rewritten  by  scrambling  up  the  three  symbols  v,  B,  and  F.  Referring 
to  your  model,  which  are  correct  and  which  are  incorrect? 

3  A  charged  particle  is  released  from  rest.  We  see  it  start  to 
move,  and  as  it  gets  going,  we  notice  that  its  path  starts  to  curve. 
Can  we  tell  whether  this  region  of  space  has  E  /  0,  or  B  /  0,  or 
both?  Assume  that  no  other  forces  are  present  besides  the  possible 
electrical  and  magnetic  ones,  and  that  the  fields,  if  they  are  present, 
are  uniform. 

4  A  charged  particle  is  in  a  region  of  space  in  which  there  is  a 
uniform  magnetic  field  B  =  B z.  There  is  no  electric  field,  and  no 
other  forces  act  on  the  particle.  In  each  case,  describe  the  future 
motion  of  the  particle,  given  its  initial  velocity. 

(a)  vQ  =  0 

(b)  vQ  =  (1  m/s)z 

(c)  vG  =  (1  m/s)y 

5  (a)  A  line  charge,  with  charge  per  unit  length  A,  moves  at 
velocity  v  along  its  own  length.  How  much  charge  passes  a  given 
point  in  time  df?  What  is  the  resulting  current?  t>  Answer,  p.  970 
(b)  Show  that  the  units  of  your  answer  in  part  a  work  out  correctly. 

Remark:  This  constitutes  a  physical  model  of  an  electric  current,  and  it  would 
be  a  physically  realistic  model  of  a  beam  of  particles  moving  in  a  vacuum,  such 
as  the  electron  beam  in  a  television  tube.  It  is  not  a  physically  realistic  model 
of  the  motion  of  the  electrons  in  a  current-carrying  wire,  or  of  the  ions  in  your 
nervous  system;  the  motion  of  the  charge  carriers  in  these  systems  is  much  more 
complicated  and  chaotic,  and  there  are  charges  of  both  signs,  so  that  the  total 
charge  is  zero.  But  even  when  the  model  is  physically  unrealistic,  it  still  gives  the 
right  answers  when  you  use  it  to  compute  magnetic  effects.  This  is  a  remarkable 
fact,  which  we  will  not  prove.  The  interested  reader  is  referred  to  E.M.  Purcell, 
Electricity  and  Magnetism ,  McGraw  Hill,  1963. 
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6  Two  parallel  wires  of  length  L  carry  currents  I\  and  I2 ■  They 
are  separated  by  a  distance  R,  and  we  assume  R  is  much  less  than 
L,  so  that  our  results  for  long,  straight  wires  are  accurate.  The  goal 
of  this  problem  is  to  compute  the  magnetic  forces  acting  between 
the  wires. 

(a)  Neither  wire  can  make  a  force  on  itself.  Therefore,  our  first  step 

in  computing  wire  l’s  force  on  wire  2  is  to  find  the  magnetic  field 
made  only  by  wire  1,  in  the  space  occupied  by  wire  2.  Express  this 
field  in  terms  of  the  given  quantities.  v 

(b)  Let’s  model  the  current  in  wire  2  by  pretending  that  there  is 

a  line  charge  inside  it,  possessing  density  per  unit  length  A2  and 
moving  at  velocity  v2.  Relate  A2  and  V2  to  the  current  I2 ,  using  the 
result  of  problem  5a.  Now  find  the  magnetic  force  wire  1  makes  on 
wire  2,  in  terms  of  I\,  I2,  L,  and  R.  t>  Answer,  p.  970 

(c)  Show  that  the  units  of  the  answer  to  part  b  work  out  to  be 

newtons.  - 

7  Suppose  a  charged  particle  is  moving  through  a  region  of 
space  in  which  there  is  an  electric  field  perpendicular  to  its  velocity 
vector,  and  also  a  magnetic  field  perpendicular  to  both  the  particle’s 
velocity  vector  and  the  electric  field.  Show  that  there  will  be  one 
particular  velocity  at  which  the  particle  can  be  moving  that  results 
in  a  total  force  of  zero  on  it.  Relate  this  velocity  to  the  magnitudes 
of  the  electric  and  magnetic  fields.  (Such  an  arrangement,  called  a 
velocity  filter,  is  one  way  of  determining  the  speed  of  an  unknown 
particle.) 

8  The  following  data  give  the  results  of  two  experiments  in  which 
charged  particles  were  released  from  the  same  point  in  space,  and 
the  forces  on  them  were  measured: 

qi  =  1  /jC  ,  V!  =  (1  m/s)x  ,  Fi  =  (-1  nrN)y 
q2  =  -2  fiC  ,  v2  =  (-1  m/s)x ,  F2  =  (-2  mN)y 

The  data  are  insufficient  to  determine  the  magnetic  field  vector; 
demonstrate  this  by  giving  two  different  magnetic  field  vectors,  both 
of  which  are  consistent  with  the  data.  ^ 

9  The  following  data  give  the  results  of  two  experiments  in  which 
charged  particles  were  released  from  the  same  point  in  space,  and 
the  forces  on  them  were  measured: 

qi  =  1  nC  ,  vi  =  (1  m/s)z  ,  Fi  =  (5  pN)x  +  (2  pN)y 
q2  =  1  nC  ,  v2  =  (3  m/s)z  ,  F2  =  (10  pN)x  +  (4  pN)y 

Is  there  a  nonzero  electric  field  at  this  point?  A  nonzero  magnetic 
field?  ■ 

10  This  problem  is  a  continuation  of  problem  6.  Note  that  the 
answer  to  problem  6b  is  given  on  page  970. 

(a)  Interchanging  the  l’s  and  2’s  in  the  answer  to  problem  6b,  what 
is  the  magnitude  of  the  magnetic  force  from  wire  2  acting  on  wire 
1?  Is  this  consistent  with  Newton’s  third  law? 


Problems 


725 


(b)  Suppose  the  currents  are  in  the  same  direction.  Make  a  sketch, 
and  use  the  right-hand  rule  to  determine  whether  wire  1  pulls  wire 
2  towards  it,  or  pushes  it  away. 

(c)  Apply  the  right-hand  rule  again  to  find  the  direction  of  wire  2’s 
force  on  wire  1.  Does  this  agree  with  Newton’s  third  law? 

(d)  What  would  happen  if  wire  l’s  current  was  in  the  opposite  di¬ 
rection  compared  to  wire  2’s?  ■ 

11  (a)  In  the  photo  of  the  vacuum  tube  apparatus  in  figure  o  on 
page  662,  infer  the  direction  of  the  magnetic  field  from  the  motion 
of  the  electron  beam.  (The  answer  is  given  in  the  answer  to  the 
self-check  on  that  page.) 

(b)  Based  on  your  answer  to  part  a,  find  the  direction  of  the  currents 
in  the  coils. 

(c)  What  direction  are  the  electrons  in  the  coils  going? 

(d)  Are  the  currents  in  the  coils  repelling  the  currents  consisting  of 
the  beam  inside  the  tube,  or  attracting  them?  Check  your  answer 
by  comparing  with  the  result  of  problem  10. 

12  A  charged  particle  of  mass  m  and  charge  q  moves  in  a  circle 
due  to  a  uniform  magnetic  field  of  magnitude  B,  which  points  per¬ 
pendicular  to  the  plane  of  the  circle. 

(a)  Assume  the  particle  is  positively  charged.  Make  a  sketch  show¬ 
ing  the  direction  of  motion  and  the  direction  of  the  field,  and  show 
that  the  resulting  force  is  in  the  right  direction  to  produce  circular 
motion. 

(b)  Find  the  radius,  r,  of  the  circle,  in  terms  of  m,  q ,  v,  and  B.  V 

(c)  Show  that  your  result  from  part  b  has  the  right  units. 

(d)  Discuss  all  four  variables  occurring  on  the  right-hand  side  of  your 
answer  from  part  b.  Do  they  make  sense?  For  instance,  what  should 
happen  to  the  radius  when  the  magnetic  field  is  made  stronger? 
Does  your  equation  behave  this  way? 

(e)  Restate  your  result  so  that  it  gives  the  particle’s  angular  fre¬ 

quency,  w,  in  terms  of  the  other  variables,  and  show  that  v  drops 
out.  V 

Remark:  A  charged  particle  can  be  accelerated  in  a  circular  device  called  a 
cyclotron,  in  which  a  magnetic  field  is  what  keeps  them  from  going  off  straight. 
This  frequency  is  therefore  known  as  the  cyclotron  frequency.  The  particles  are 
accelerated  by  other  forces  (electric  forces),  which  are  AC.  As  long  as  the  electric 
field  is  operated  at  the  correct  cyclotron  frequency  for  the  type  of  particles  being 
manipulated,  it  will  stay  in  sync  with  the  particles,  giving  them  a  shove  in  the 
right  direction  each  time  they  pass  by.  The  particles  are  speeding  up,  so  this 
only  works  because  the  cyclotron  frequency  is  independent  of  velocity. 

13  Each  figure  represents  the  motion  of  a  positively  charged 
particle.  The  dots  give  the  particles’  positions  at  equal  time  inter¬ 
vals.  In  each  case,  determine  whether  the  motion  was  caused  by 
an  electric  force,  a  magnetic  force,  or  a  frictional  force,  and  explain 
your  reasoning.  If  possible,  determine  the  direction  of  the  magnetic 
or  electric  field.  All  fields  are  uniform.  In  (a),  the  particle  stops  for 
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an  instant  at  the  upper  right,  but  then  comes  back  down  and  to  the 
left,  retracing  the  same  dots.  In  (b),  it  stops  on  the  upper  right  and 
stays  there.  - 


Problem  13. 

14  One  model  of  the  hydrogen  atom  has  the  electron  circling 
around  the  proton  at  a  speed  of  2.2  X  106  m/s,  in  an  orbit  with  a 
radius  of  0.05  nm.  (Although  the  electron  and  proton  really  orbit 
around  their  common  center  of  mass,  the  center  of  mass  is  very  close 
to  the  proton,  since  it  is  2000  times  more  massive.  For  this  problem, 
assume  the  proton  is  stationary.) 

(a)  Treat  the  circling  electron  as  a  current  loop,  and  calculate  the 
current. 

(b)  Estimate  the  magnetic  field  created  at  the  center  of  the  atom 

by  the  electron.  V 

(c)  Does  the  proton  experience  a  nonzero  force  from  the  electron’s 
magnetic  held?  Explain. 

(d)  Does  the  electron  experience  a  magnetic  held  from  the  proton? 
Explain. 

(e)  Does  the  electron  experience  a  magnetic  held  created  by  its  own 
current?  Explain. 

(f)  Is  there  an  electric  force  acting  between  the  proton  and  electron? 

If  so,  calculate  it.  V 

(g)  Is  there  a  gravitational  force  acting  between  the  proton  and  elec¬ 
tron?  If  so,  calculate  it. 

(h)  An  inward  force  is  required  to  keep  the  electron  in  its  orbit  - 
otherwise  it  would  obey  Newton’s  hrst  law  and  go  straight,  leaving 
the  atom.  Based  on  your  answers  to  the  previous  parts,  which  force 
or  forces  (electric,  magnetic  and  gravitational)  contributes  signifi¬ 
cantly  to  this  inward  force?  (Based  on  a  problem  by  Arnold  Arons.) 

■ 

15  The  equation  Bz  =  f3kIA/c2r 3  was  found  on  page  672  for  the 
distant  held  of  a  dipole.  Show,  as  asserted  there,  that  the  constant 
f3  must  be  unitless. 

16  The  following  data  give  the  results  of  three  experiments  in 
which  charged  particles  were  released  from  the  same  point  in  space, 
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and  the  forces  on  them  were  measured: 

q\  =  1  C  ,  vi  =  0  ,  Fi  =  (1  N)y 

qi  =  1  C  ,  v2  =  (1  m/s)x  ,  F2  =  (1  N)y 

®  =  1C,  v3  =  (1  m/s)z  ,  F3  =  0 

Determine  the  electric  and  magnetic  fields.  V  - 

17  If  you  put  four  times  more  current  through  a  solenoid,  how 
many  times  more  energy  is  stored  in  its  magnetic  field?  V 

18  A  Helmholtz  coil  is  defined  as  a  pair  of  identical  circular 
coils  lying  in  parallel  planes  and  separated  by  a  distance,  h,  equal 
to  their  radius,  b.  (Each  coil  may  have  more  than  one  turn  of  wire.) 
Current  circulates  in  the  same  direction  in  each  coil,  so  the  fields 
tend  to  reinforce  each  other  in  the  interior  region.  This  configuration 
has  the  advantage  of  being  fairly  open,  so  that  other  apparatus  can 
be  easily  placed  inside  and  subjected  to  the  field  while  remaining 
visible  from  the  outside.  The  choice  of  h  =  b  results  in  the  most 
uniform  possible  field  near  the  center.  A  photograph  of  a  Helmholtz 
coil  was  shown  in  figure  o  on  page  662. 

(a)  Find  the  percentage  drop  in  the  field  at  the  center  of  one  coil, 
compared  to  the  full  strength  at  the  center  of  the  whole  apparatus. 

V 

(b)  What  value  of  h  (not  equal  to  b)  would  make  this  percentage 

difference  equal  to  zero?  V  ^ 

19  The  figure  shows  a  nested  pair  of  circular  wire  loops  used 
to  create  magnetic  fields.  (The  twisting  of  the  leads  is  a  practical 
trick  for  reducing  the  magnetic  fields  they  contribute,  so  the  fields 
are  very  nearly  what  we  would  expect  for  an  ideal  circular  current 
loop.)  The  coordinate  system  below  is  to  make  it  easier  to  discuss 
directions  in  space.  One  loop  is  in  the  y  —  z  plane,  the  other  in  the 
x  —  y  plane.  Each  of  the  loops  has  a  radius  of  1.0  cm,  and  carries 
1.0  A  in  the  direction  indicated  by  the  arrow. 

(a)  Calculate  the  magnetic  field  that  would  be  produced  by  one  such 

loop,  at  its  center.  V 

(b)  Describe  the  direction  of  the  magnetic  field  that  would  be  pro¬ 
duced,  at  its  center,  by  the  loop  in  the  x  —  y  plane  alone. 

(c)  Do  the  same  for  the  other  loop. 

(d)  Calculate  the  magnitude  of  the  magnetic  field  produced  by  the 

two  loops  in  combination,  at  their  common  center.  Describe  its 
direction.  V  ^ 


Problem  19. 
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20  Four  long  wires  are  arranged,  as  shown,  so  that  their  cross- 

section  forms  a  square,  with  connections  at  the  ends  so  that  current 
flows  through  all  four  before  exiting.  Note  that  the  current  is  to  the 
right  in  the  two  back  wires,  but  to  the  left  in  the  front  wires.  If  the 
dimensions  of  the  cross-sectional  square  (height  and  front-to-back) 
are  b,  find  the  magnetic  field  (magnitude  and  direction)  along  the 
long  central  axis.  v  ■ 

21  In  problem  16,  the  three  experiments  gave  enough  information 

to  determine  both  fields.  Is  it  possible  to  design  a  procedure  so  that, 
using  only  two  such  experiments,  we  can  always  find  E  and  B?  If 
so,  design  it.  If  not,  why  not?  ■ 

22  Use  the  Biot-Savart  law  to  derive  the  magnetic  field  of  a  long, 

straight  wire,  and  show  that  this  reproduces  the  result  of  example 
6  on  page  664.  ^ 

23  (a)  Modify  the  calculation  on  page  669  to  determine  the 

component  of  the  magnetic  field  of  a  sheet  of  charge  that  is  perpen¬ 
dicular  to  the  sheet.  V 

(b)  Show  that  your  answer  has  the  right  units. 

(c)  Show  that  your  answer  approaches  zero  as  z  approaches  infinity. 

(d)  What  happens  to  your  answer  in  the  case  of  a  =  bl  Explain 

why  this  makes  sense.  81 

24  Consider  two  solenoids,  one  of  which  is  smaller  so  that  it 

can  be  put  inside  the  other.  Assume  they  are  long  enough  so  that 
each  one  only  contributes  significantly  to  the  field  inside  itself,  and 
the  interior  fields  are  nearly  uniform.  Consider  the  configuration 
where  the  small  one  is  inside  the  big  one  with  their  currents  circu¬ 
lating  in  the  same  direction,  and  a  second  configuration  in  which 
the  currents  circulate  in  opposite  directions.  Compare  the  energies 
of  these  configurations  with  the  energy  when  the  solenoids  are  far 
apart.  Based  on  this  reasoning,  which  configuration  is  stable,  and 
in  which  configuration  will  the  little  solenoid  tend  to  get  twisted 
around  or  spit  out?  >  Hint,  p.  960  ■ 

25  (a)  A  solenoid  can  be  imagined  as  a  series  of  circular  current 
loops  that  are  spaced  along  their  common  axis.  Integrate  the  result 
of  example  12  on  page  677  to  show  that  the  field  on  the  axis  of  a 
solenoid  can  be  written  as  B  =  (2tt kr)/c2) (cos /?  +  cosy),  where  the 
angles  /3  and  7  are  defined  in  the  figure. 

(b)  Show  that  in  the  limit  where  the  solenoid  is  very  long,  this 
exact  result  agrees  with  the  approximate  one  derived  in  example  13 
on  page  680  using  Ampere’s  law. 

(c)  Note  that,  unlike  the  calculation  using  Ampere’s  law,  this  one 
is  valid  at  points  that  are  near  the  mouths  of  the  solenoid,  or  even 
outside  it  entirely.  If  the  solenoid  is  long,  at  what  point  on  the  axis 
is  the  field  equal  to  one  half  of  its  value  at  the  center  of  the  solenoid? 

(d)  What  happens  to  your  result  when  you  apply  it  to  points  that 
are  very  far  away  from  the  solenoid?  Does  this  make  sense? 
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A  nautilus  shell  is  approxi¬ 
mately  a  logarithmic  spiral,  of  the 
type  in  problem  28. 


26  The  first  step  in  the  proof  of  Ampere’s  law  on  page  681  is  to 

show  that  Ampere’s  law  holds  in  the  case  shown  in  figure  f/1,  where 
a  circular  Amperian  loop  is  centered  on  a  long,  straight  wire  that  is 
perpendicular  to  the  plane  of  the  loop.  Carry  out  this  calculation, 
using  the  result  for  the  field  of  a  wire  that  was  established  without 
using  Ampere’s  law.  ^ 

27  A  certain  region  of  space  has  a  magnetic  field  given  by  B  = 

bxy.  Find  the  electric  current  flowing  through  the  square  defined 
by  z  =  0,  0  <  x  <  a,  and  0  <  y  <  a.  V  M 

28  Perform  a  calculation  similar  to  the  one  in  problem  54,  but 

for  a  logarithmic  spiral,  defined  by  r  =  weu6,  and  show  that  the 
field  is  B  =  (kl/c2,u)(l/a  —  1/6).  Note  that  the  solution  to  problem 
54  is  given  in  the  back  of  the  book.  ^ 

29  (a)  For  the  geometry  described  in  example  8  on  page  667, 
find  the  field  at  a  point  the  lies  in  the  plane  of  the  wires,  but  not 
between  the  wires,  at  a  distance  6  from  the  center  line.  Use  the 
same  technique  as  in  that  example. 

(b)  Now  redo  the  calculation  using  the  technique  demonstrated  on 
page  672.  The  integrals  are  nearly  the  same,  but  now  the  reasoning 
is  reversed:  you  already  know  (5  =  1,  and  you  want  to  find  an 
unknown  field.  The  only  difference  in  the  integrals  is  that  you  are 
tiling  a  different  region  of  the  plane  in  order  to  mock  up  the  currents 
in  the  two  wires.  Note  that  you  can’t  tile  a  region  that  contains 
a  point  of  interest,  since  the  technique  uses  the  field  of  a  distant 
dipole.  V  ■ 

30  (a)  A  long,  skinny  solenoid  consists  of  N  turns  of  wire 

wrapped  uniformly  around  a  hollow  cylinder  of  length  i  and  cross- 
sectional  area  A.  Find  its  inductance.  V 

(b)  Show  that  your  answer  has  the  right  units  to  be  an  inductance. 


/2h 

Problem  32. 

31  Consider  two  solenoids,  one  of  which  is  smaller  so  that  it 
can  be  put  inside  the  other.  Assume  they  are  long  enough  to  act 
like  ideal  solenoids,  so  that  each  one  only  contributes  significantly 
to  the  field  inside  itself,  and  the  interior  fields  are  nearly  uniform. 
Consider  the  configuration  where  the  small  one  is  partly  inside  and 
partly  hanging  out  of  the  big  one,  with  their  currents  circulating  in 
the  same  direction.  Their  axes  are  constrained  to  coincide. 

(a)  Find  the  difference  in  the  magnetic  energy  between  the  configu¬ 

ration  where  the  solenoids  are  separate  and  the  configuration  where 
the  small  one  is  inserted  into  the  big  one.  Your  equation  will  in¬ 
clude  the  length  x  of  the  part  of  the  small  solenoid  that  is  inside 
the  big  one,  as  well  as  other  relevant  variables  describing  the  two 
solenoids.  V 

(b)  Based  on  your  answer  to  part  a,  find  the  force  acting  between 

the  solenoids.  v  ■ 
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32  Verify  Ampere’s  law  in  the  case  shown  in  the  figure,  as¬ 
suming  the  known  equation  for  the  field  of  a  wire.  A  wire  carrying 
current  I  passes  perpendicularly  through  the  center  of  the  rectan¬ 
gular  Amperian  surface.  The  length  of  the  rectangle  is  infinite,  so 
it’s  not  necessary  to  compute  the  contributions  of  the  ends. 

33  The  purpose  of  this  problem  is  to  find  how  the  gain  of  a 
transformer  depends  on  its  construction. 

(a)  The  number  of  loops  of  wire,  N,  in  a  solenoid  is  changed,  while 
keeping  the  length  constant.  How  does  the  impedance  depend  on  N ? 
State  your  answer  as  a  proportionality,  e.g.,  Z  oc  N 3  or  Z  oc  N~5. 

(b)  For  a  given  AC  voltage  applied  across  the  inductor,  how  does 
the  magnetic  field  depend  on  N ?  You  need  to  take  into  account 
both  the  dependence  of  a  solenoid’s  field  on  N  for  a  given  current 
and  your  answer  to  part  a,  which  affects  the  current. 

(c)  Now  consider  a  transformer  consisting  of  two  solenoids.  The 
input  side  has  N\  loops,  and  the  output  A^.  We  wish  to  find  how  the 
output  voltage  V2  depends  on  N\ ,  N2,  and  the  input  voltage  Vf.  The 
text  has  already  established  V2  oc  V\  N2 ,  so  it  only  remains  to  find 
the  dependence  on  N\.  Use  your  result  from  part  b  to  accomplish 
this.  The  ratio  V2/V1  is  called  the  voltage  gain. 

34  Problem  33  dealt  with  the  dependence  of  a  transformer’s  gain 
on  the  number  of  loops  of  wire  in  the  input  solenoid.  Carry  out  a 
similar  analysis  of  how  the  gain  depends  on  the  frequency  at  which 
the  circuit  is  operated. 

35  A  U-shaped  wire  makes  electrical  contact  with  a  second, 
straight  wire,  which  rolls  along  it  to  the  right,  as  shown  in  the 
figure.  The  whole  thing  is  immersed  in  a  uniform  magnetic  field, 
which  is  perpendicular  to  the  plane  of  the  circuit.  The  resistance  of 
the  rolling  wire  is  much  greater  than  that  of  the  U. 

(a)  Find  the  direction  of  the  force  on  the  wire  based  on  conservation 
of  energy. 

(b)  Verify  the  direction  of  the  force  using  right-hand  rules. 

(c)  Find  magnitude  of  the  force  acting  on  the  wire.  There  is  more 

than  one  way  to  do  this,  but  please  do  it  using  Faraday’s  law  (which 
works  even  though  it’s  the  Amperian  surface  itself  that  is  changing, 
rather  than  the  field).  V 

(d)  Consider  how  the  answer  to  part  a  would  have  changed  if  the 
direction  of  the  field  had  been  reversed,  and  also  do  the  case  where 
the  direction  of  the  rolling  wire’s  motion  is  reversed.  Verify  that 
this  is  in  agreement  with  your  answer  to  part  c. 

36  A  charged  particle  is  in  motion  at  speed  v,  in  a  region  of 
vacuum  through  which  an  electromagnetic  wave  is  passing.  In  what 
direction  should  the  particle  be  moving  in  order  to  minimize  the 
total  force  acting  on  it?  Consider  both  possibilities  for  the  sign  of 
the  charge.  (Based  on  a  problem  by  David  J.  Raymond.) 
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Problem  37. 


37  A  wire  loop  of  resistance  R  and  area  A,  lying  in  the  y  —  z 
plane,  falls  through  a  nonuniform  magnetic  field  B  =  kzic,  where  k 
is  a  constant.  The  z  axis  is  vertical. 

(a)  Find  the  direction  of  the  force  on  the  wire  based  on  conservation 
of  energy. 

(b)  Verify  the  direction  of  the  force  using  right-hand  rules. 

(c)  Find  the  magnetic  force  on  the  wire.  V  - 

38  A  capacitor  has  parallel  plates  of  area  A,  separated  by  a 
distance  h.  If  there  is  a  vacuum  between  the  plates,  then  Gauss’s 
law  gives  E  =  Anker  =  Ankq/A  for  the  field  between  the  plates,  and 
combining  this  with  E  =  V/h,  we  find  C  =  q/V  =  [1/An k)A/h. 

(a)  Generalize  this  derivation  to  the  case  where  there  is  a  dielectric 

between  the  plates,  (b)  Suppose  we  have  a  list  of  possible  materials 
we  could  choose  as  dielectrics,  and  we  wish  to  construct  a  capacitor 
that  will  have  the  highest  possible  energy  density,  Ue/v,  where  v  is 
the  volume.  For  each  dielectric,  we  know  its  permittivity  e,  and  also 
the  maximum  electric  field  E  it  can  sustain  without  breaking  down 
and  allowing  sparks  to  cross  between  the  plates.  Write  the  maximum 
energy  density  in  terms  of  these  two  variables,  and  determine  a  figure 
of  merit  that  could  be  used  to  decide  which  material  would  be  the 
best  choice.  ■ 

39  (a)  For  each  term  appearing  on  the  right  side  of  Maxwell’s 
equations,  give  an  example  of  an  everyday  situation  it  describes. 

(b)  Most  people  doing  calculations  in  the  SI  system  of  units  don’t 
use  k  and  k/c 2.  Instead,  they  express  everything  in  terms  of  the 
constants 


eD  =  —r  and 
47 rk 

An rk 


Rewrite  Maxwell’s  equations  in  terms  of  these  constants,  eliminating 
k  and  c  everywhere.  ^ 

40  (a)  Prove  that  in  an  electromagnetic  plane  wave,  half  the 

energy  is  in  the  electric  field  and  half  in  the  magnetic  field. 


(b)  Based  on  your  result  from  part  a,  find  the  proportionality 
constant  in  the  relation  dp  oc  E  xBdu,  where  dp  is  the  momentum 
of  the  part  of  a  plane  light  wave  contained  in  the  volume  du.  The 
vector  ExB,  multiplied  by  the  appropriate  constant,  is  known  as  the 
Poynting  vector,  and  even  outside  the  context  of  an  electromagnetic 
plane  wave,  it  can  be  interpreted  as  a  momentum  density  or  rate  of 
energy  flow.  (To  do  this  problem,  you  need  to  know  the  relativistic 
relationship  between  the  energy  and  momentum  of  a  beam  of  light 
from  problem  11  on  p.  446.)  V  - 
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41  (a)  A  beam  of  light  has  cross-sectional  area  A  and  power 
P,  i.e.,  P  is  the  number  of  joules  per  second  that  enter  a  window 
through  which  the  beam  passes.  Find  the  energy  density  U jv  in 
terms  of  P ,  A,  and  universal  constants. 

(b)  Find  E  and  B,  the  amplitudes  of  the  electric  and  magnetic  fields, 

in  terms  of  P,  A,  and  universal  constants  (i.e.,  your  answer  should 
not  include  U  or  v).  You  will  need  the  result  of  problem  40a.  A  real 
beam  of  light  usually  consists  of  many  short  wavetrains,  not  one  big 
sine  wave,  but  don’t  worry  about  that.  V  >  Hint,  p.  960 

(c)  A  beam  of  sunlight  has  an  intensity  of  P/A  =  1.35  x  103  W/m2, 
assuming  no  clouds  or  atmospheric  absorption.  This  is  known  as  the 
solar  constant.  Compute  E  and  B,  and  compare  with  the  strengths 
of  static  fields  you  experience  in  everyday  life:  E  106  V/m  in  a 
thunderstorm,  and  B  ~  10-3  T  for  the  Earth’s  magnetic  field. 

v  m 

42  The  circular  parallel-plate  capacitor  shown  in  the  figure  is 

being  charged  up  over  time,  with  the  voltage  difference  across  the 
plates  varying  as  V  =  -st,  where  s  is  a  constant.  The  plates  have 
radius  b ,  and  the  distance  between  them  is  d.  We  assume  d  <C  b, 
so  that  the  electric  field  between  the  plates  is  uniform,  and  parallel 
to  the  axis.  Find  the  induced  magnetic  field  at  a  point  between  the 
plates,  at  a  distance  R  from  the  axis.  >  Hint,  p.  960  C  ^ 

43  A  positively  charged  particle  is  released  from  rest  at  the  origin 
at  t  =  0,  in  a  region  of  vacuum  through  which  an  electromagnetic 
wave  is  passing.  The  particle  accelerates  in  response  to  the  wave. 
In  this  region  of  space,  the  wave  varies  as  E  =  xF  sin  ut,  B  = 
yBsinwt,  and  we  assume  that  the  particle  has  a  relatively  large 
value  of  m/q,  so  that  its  response  to  the  wave  is  sluggish,  and  it 
never  ends  up  moving  at  any  speed  comparable  to  the  speed  of  light. 
Therefore  we  don’t  have  to  worry  about  the  spatial  variation  of  the 
wave;  we  can  just  imagine  that  these  are  uniform  fields  imposed  by 
some  external  mechanism  on  this  region  of  space. 

(a)  Find  the  particle’s  coordinates  as  functions  of  time.  v 

(b)  Show  that  the  motion  is  confined  to  —zmax  <  z  <  zmax,  where 

Zmax  =  1-101  ^FB/m2w3j.  IS 

44  Electromagnetic  waves  are  supposed  to  have  their  electric 
and  magnetic  Helds  perpendicular  to  each  other.  (Throughout  this 
problem,  assume  we’re  talking  about  waves  traveling  through  a  vac¬ 
uum,  and  that  there  is  only  a  single  sine  wave  traveling  in  a  single 
direction,  not  a  superposition  of  sine  waves  passing  through  each 
other.)  Suppose  someone  claims  they  can  make  an  electromagnetic 
wave  in  which  the  electric  and  magnetic  fields  lie  in  the  same  plane. 
Prove  that  this  is  impossible  based  on  Maxwell’s  equations. 

45  Repeat  the  self-check  on  page  716,  but  with  one  change  in 
the  procedure:  after  we  charge  the  capacitor,  we  open  the  circuit, 
and  then  continue  with  the  observations. 
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46  On  page  719,  I  proved  that  Hy^  =  Hy)2  at  the  boundary 
between  two  substances  if  there  is  no  free  current  and  the  fields  are 
static.  In  fact,  each  of  Maxwell’s  four  equations  implies  a  constraint 
with  a  similar  structure.  Some  are  constraints  on  the  field  compo¬ 
nents  parallel  to  the  boundary,  while  others  are  constraints  on  the 
perpendicular  parts.  Since  some  of  the  fields  referred  to  in  Maxwell’s 
equations  are  the  electric  and  magnetic  fields  E  and  B,  while  others 
are  the  auxiliary  fields  D  and  H,  some  of  the  constraints  deal  with 
E  and  B,  others  with  D  and  H.  Find  the  other  three  constraints. 

a 

47  (a)  Figure  j  on  page  720  shows  a  hollow  sphere  with  n/ n0  = 
x ,  inner  radius  a,  and  outer  radius  b ,  which  has  been  subjected 
to  an  external  held  B0.  Finding  the  fields  on  the  exterior,  in  the 
shell,  and  on  the  interior  requires  finding  a  set  of  fields  that  satisfies 
five  boundary  conditions:  (1)  far  from  the  sphere,  the  held  must 
approach  the  constant  B0;  (2)  at  the  outer  surface  of  the  sphere, 
the  held  must  have  Hy ^  =  Hy  2,  as  discussed  on  page  719;  (3)  the 
same  constraint  applies  at  the  inner  surface  of  the  sphere;  (4)  and 
(5)  there  is  an  additional  constraint  on  the  helds  at  the  inner  and 
outer  surfaces,  as  found  in  problem  46.  The  goal  of  this  problem 
is  to  find  the  solution  for  the  helds,  and  from  it,  to  prove  that  the 
interior  held  is  uniform,  and  given  by 


B  = 


9x 

(2x  +  l)(.x  +  2)  —  2||(x  —  l)2 


This  is  a  very  difficult  problem  to  solve  from  first  principles,  because 
it’s  not  obvious  what  form  the  helds  should  have,  and  if  you  hadn’t 
been  told,  you  probably  wouldn’t  have  guessed  that  the  interior  held 
would  be  uniform.  We  could,  however,  guess  that  once  the  sphere 
becomes  polarized  by  the  external  held,  it  would  become  a  dipole, 
and  at  r  >  5,  the  held  would  be  a  uniform  held  superimposed  on 
the  held  of  a  dipole.  It  turns  out  that  even  close  to  the  sphere,  the 
solution  has  exactly  this  form.  In  order  to  complete  the  solution, 
we  need  to  find  the  held  in  the  shell  (a  <  r  <  b),  but  the  only  way 
this  held  could  match  up  with  the  detailed  angular  variation  of  the 
interior  and  exterior  helds  would  be  if  it  was  also  a  superposition  of 
a  uniform  held  with  a  dipole  held.  The  hnal  result  is  that  we  have 
four  unknowns:  the  strength  of  the  dipole  component  of  the  external 
held,  the  strength  of  the  uniform  and  dipole  components  of  the  held 
within  the  shell,  and  the  strength  of  the  uniform  interior  held.  These 
four  unknowns  are  to  be  determined  by  imposing  constraints  (2) 
through  (5)  above. 

(b)  Show  that  the  expression  from  part  a  has  physically  reasonable 
behavior  in  its  dependence  on  x  and  a/b.  ■ 
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48  Two  long,  parallel  strips  of  thin  metal  foil  form  a  configuration 
like  a  long,  narrow  sandwich.  The  air  gap  between  them  has  height 
h,  the  width  of  each  strip  is  w,  and  their  length  is  i.  Each  strip 
carries  current  /,  and  we  assume  for  concreteness  that  the  currents 
are  in  opposite  directions,  so  that  the  magnetic  force,  F,  between 
the  strips  is  repulsive. 

(a)  Find  the  force  in  the  limit  of  w  3>  h.  v 

(b)  Find  the  force  in  the  limit  of  w  <C  h ,  which  is  like  two  ordinary 
wires. 

(c)  Discuss  the  relationship  between  the  two  results. 

49  Suppose  we  are  given  a  permanent  magnet  with  a  compli¬ 
cated,  asymmetric  shape.  Describe  how  a  series  of  measurements 
with  a  magnetic  compass  could  be  used  to  determine  the  strength 
and  direction  of  its  magnetic  field  at  some  point  of  interest.  Assume 
that  you  are  only  able  to  see  the  direction  to  which  the  compass 
needle  settles;  you  cannot  measure  the  torque  acting  on  it.  £ 

50  On  page  686,  the  curl  of  xy  was  computed.  Now  consider 
the  fields  xx  and  yy. 

(a)  Sketch  these  fields. 

(b)  Using  the  same  technique  of  explicitly  constructing  a  small 

square,  prove  that  their  curls  are  both  zero.  Do  not  use  the  compo¬ 
nent  form  of  the  curl;  this  was  one  step  in  deriving  the  component 
form  of  the  curl.  ^ 

51  If  you  watch  a  movie  played  backwards,  some  vectors  reverse 
their  direction.  For  instance,  people  walk  backwards,  with  their 
velocity  vectors  flipped  around.  Other  vectors,  such  as  forces,  keep 
the  same  direction,  e.g.,  gravity  still  pulls  down.  An  electric  field 
is  another  example  of  a  vector  that  doesn’t  turn  around:  positive 
charges  are  still  positive  in  the  time-reversed  universe,  so  they  still 
make  diverging  electric  fields,  and  likewise  for  the  converging  fields 
around  negative  charges. 

(a)  How  does  the  momentum  of  a  material  object  behave  under 

time-reversal?  >  Solution,  p.  981 

(b)  The  laws  of  physics  are  still  valid  in  the  time-reversed  universe. 

For  example,  show  that  if  two  material  objects  are  interacting,  and 
momentum  is  conserved,  then  momentum  is  still  conserved  in  the 
time-reversed  universe.  >  Solution,  p.  981 

(c)  Discuss  how  currents  and  magnetic  fields  would  behave  under 

time  reversal.  >  Hint,  p.  960 

(d)  Similarly,  show  that  the  equation  dp  tx  Ex  B  is  still  valid  under 

time  reversal.  ■ 

52  This  problem  is  a  more  advanced  exploration  of  the  time- 
reversal  ideas  introduced  in  problem  51. 

(a)  In  that  problem,  we  assumed  that  charge  did  not  flip  its  sign  un¬ 
der  time  reversal.  Suppose  we  make  the  opposite  assumption,  that 
charge  does  change  its  sign.  This  is  an  idea  introduced  by  Richard 
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Feynman:  that  antimatter  is  really  matter  traveling  backward  in 
time!  Determine  the  time-reversal  properties  of  E  and  B  under  this 
new  assumption,  and  show  that  dp  oc  E  x  B  is  still  valid  under 
time-reversal. 

(b)  Show  that  Maxwell’s  equations  are  time-reversal  symmetric,  i.e., 
that  if  the  fields  E(x,  y,  z,  t )  and  B(x,  y,  z,  t )  satisfy  Maxwell’s  equa¬ 
tions,  then  so  do  E (x,  y,  z,  — t )  and  B(x,  y,  z,  — t ).  Demonstrate  this 
under  both  possible  assumptions  about  charge,  q  q  and  q  — >•  —q. 

■ 

53  The  purpose  of  this  problem  is  to  prove  that  the  constant 
of  proportionality  a  in  the  equation  d Um  =  aB2  dv,  for  the  energy 
density  of  the  magnetic  field,  is  given  by  a  =  c2/8nk  as  asserted  on 
page  671.  The  geometry  we’ll  use  consists  of  two  sheets  of  current, 
like  a  sandwich  with  nothing  in  between  but  some  vacuum  in  which 
there  is  a  magnetic  field.  The  currents  are  in  opposite  directions, 
and  we  can  imagine  them  as  being  joined  together  at  the  ends  to 
form  a  complete  circuit,  like  a  tube  made  of  paper  that  has  been 
squashed  almost  flat.  The  sheets  have  lengths  L  in  the  direction 
parallel  to  the  current,  and  widths  w.  They  are  separated  by  a  dis¬ 
tance  d,  which,  for  convenience,  we  assume  is  small  compared  to  L 
and  w.  Thus  each  sheet’s  contribution  to  the  field  is  uniform,  and 
can  be  approximated  by  the  expression  2irkri/c2. 

(a)  Make  a  drawing  similar  to  the  one  in  figure  11.2.1  on  page  670, 
and  show  that  in  this  opposite-current  configuration,  the  magnetic 
fields  of  the  two  sheets  reinforce  in  the  region  between  them,  pro¬ 
ducing  double  the  field,  but  cancel  on  the  outside. 

(b)  By  analogy  with  the  case  of  a  single  strand  of  wire,  one  sheet’s 
force  on  the  other  is  ILBi,  were  /  =  rjw  is  the  total  current  in  one 
sheet,  and  B\  =  B/2  is  the  field  contributed  by  only  one  of  the 
sheets,  since  the  sheet  can’t  make  any  net  force  on  itself.  Based  on 
your  drawing  and  the  right-hand  rule,  show  that  this  force  is  repul¬ 
sive. 

For  the  rest  of  the  problem,  consider  a  process  in  which  the  sheets 
start  out  touching,  and  are  then  separated  to  a  distance  d.  Since 
the  force  between  the  sheets  is  repulsive,  they  do  mechanical  work 
on  the  outside  world  as  they  are  separated,  in  much  the  same  way 
that  the  piston  in  an  engine  does  work  as  the  gases  inside  the  cylin¬ 
der  expand.  At  the  same  time,  however,  there  is  an  induced  emf 
which  would  tend  to  extinguish  the  current,  so  in  order  to  maintain 
a  constant  current,  energy  will  have  to  be  drained  from  a  battery. 
There  are  three  types  of  energy  involved:  the  increase  in  the  mag¬ 
netic  field  energy,  the  increase  in  the  energy  of  the  outside  world, 
and  the  decrease  in  energy  as  the  battery  is  drained.  (We  assume 
the  sheets  have  very  little  resistance,  so  there  is  no  ohmic  heating 
involved.)  v 

(c)  Find  the  mechanical  work  done  by  the  sheets,  which  equals  the 
increase  in  the  energy  of  the  outside  world.  Show  that  your  result 
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can  be  stated  in  terms  of  rj,  the  final  volume  v  =  wLd ,  and  nothing 
else  but  numerical  and  physical  constants.  V 

(d)  The  power  supplied  by  the  battery  is  P  =  IT e  (like  P  =  IAV, 

but  with  an  emf  instead  of  a  voltage  difference),  and  the  circulation 
is  given  by  T  =  —  d<h^/ d t.  The  negative  sign  indicates  that  the 
battery  is  being  drained.  Calculate  the  energy  supplied  by  the  bat¬ 
tery,  and,  as  in  part  c,  show  that  the  result  can  be  stated  in  terms 
of  77,  v,  and  universal  constants.  V 

(e)  Find  the  increase  in  the  magnetic-field  energy,  in  terms  of  rj,  v, 

and  the  unknown  constant  a.  V 

(f)  Use  conservation  of  energy  to  relate  your  answers  from  parts  c, 

d,  and  e,  and  solve  for  a.  V  B 

54  Magnet  coils  are  often  wrapped  in  multiple  layers.  The 
figure  shows  the  special  case  where  the  layers  are  all  confined  to  a 
single  plane,  forming  a  spiral.  Since  the  thickness  of  the  wires  (plus 
their  insulation)  is  fixed,  the  spiral  that  results  is  a  mathematical 
type  known  as  an  Archimedean  spiral,  in  which  the  turns  are  evenly 
spaced.  The  equation  of  the  spiral  is  r  =  wO ,  where  w  is  a  constant. 

For  a  spiral  that  starts  from  r  =  a  and  ends  at  r  =  b,  show  that  the 
field  at  the  center  is  given  by  (kl / c2w)\nb/ a. 

>  Solution,  p.  981 

Problem  54. 


Key  to  symbols: 

easy  -  typical  B  challenging  ■  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  11B:  Polarization 

Apparatus: 

calcite  (Iceland  spar)  crystal 
Polaroid  film 

1.  Lay  the  crystal  on  a  piece  of  paper  that  has  print  on  it.  You  will  observe  a  double  image. 
See  what  happens  if  you  rotate  the  crystal. 

Evidently  the  crystal  does  something  to  the  light  that  passes  through  it  on  the  way  from  the 
page  to  your  eye.  One  beam  of  light  enters  the  crystal  from  underneath,  but  two  emerge  from 
the  top;  by  conservation  of  energy  the  energy  of  the  original  beam  must  be  shared  between 
them.  Consider  the  following  three  possible  interpretations  of  what  you  have  observed: 

(a)  The  two  new  beams  differ  from  each  other,  and  from  the  original  beam,  only  in  energy. 
Their  other  properties  are  the  same. 

(b)  The  crystal  adds  to  the  light  some  mysterious  new  property  (not  energy),  which  comes  in 
two  flavors,  X  and  Y.  Ordinary  light  doesn’t  have  any  of  either.  One  beam  that  emerges  from 
the  crystal  has  some  X  added  to  it,  and  the  other  beam  has  Y. 

(c)  There  is  some  mysterious  new  property  that  is  possessed  by  all  light.  It  comes  in  two  flavors, 
X  and  Y,  and  most  ordinary  light  sources  make  an  equal  mixture  of  type  X  and  type  Y  light. 
The  original  beam  is  an  even  mixture  of  both  types,  and  this  mixture  is  then  split  up  by  the 
crystal  into  the  two  purified  forms. 

In  parts  2  and  3  you’ll  make  observations  that  will  allow  you  to  figure  out  which  of  these  is 
correct. 

2.  Now  place  a  polaroid  film  over  the  crystal  and  see  what  you  observe.  What  happens  when 
you  rotate  the  Him  in  the  horizontal  plane?  Does  this  observation  allow  you  to  rule  out  any  of 
the  three  interpretations? 


3.  Now  put  the  polaroid  film  under  the  crystal  and  try  the  same  thing.  Putting  together  all 
your  observations,  which  interpretation  do  you  think  is  correct? 


4.  Look  at  an  overhead  light  fixture  through  the  polaroid,  and  try  rotating  it.  What  do  you 
observe?  What  does  this  tell  you  about  the  light  emitted  by  the  lightbulb? 
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5.  Now  position  yourself  with  your  head  under  a  light  fixture  and  directly  over  a  shiny  surface, 
such  as  a  glossy  tabletop.  You’ll  see  the  lamp’s  reflection,  and  the  light  coining  from  the  lamp 
to  your  eye  will  have  undergone  a  reflection  through  roughly  a  180-degree  angle  (i.e.,  it  very 
nearly  reversed  its  direction).  Observe  this  reflection  through  the  polaroid,  and  try  rotating  it. 
Finally,  position  yourself  so  that  you  are  seeing  glancing  reflections,  and  try  the  same  thing. 
Summarize  what  happens  to  light  with  properties  X  and  Y  when  it  is  reflected.  (This  is  the 
principle  behind  polarizing  sunglasses.) 
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Chapter  12 

Optics 

12.1  The  Ray  Model  of  Light 

Ads  for  one  Macintosh  computer  bragged  that  it  could  do  an  arith¬ 
metic  calculation  in  less  time  than  it  took  for  the  light  to  get  from  the 
screen  to  your  eye.  We  find  this  impressive  because  of  the  contrast 
between  the  speed  of  light  and  the  speeds  at  which  we  interact  with 
physical  objects  in  our  environment.  Perhaps  it  shouldn’t  surprise 
us,  then,  that  Newton  succeeded  so  well  in  explaining  the  motion  of 
objects,  but  was  far  less  successful  with  the  study  of  light. 

The  climax  of  our  study  of  electricity  and  magnetism  was  discov¬ 
ery  that  light  is  an  electromagnetic  wave.  Knowing  this,  however,  is 
not  the  same  as  knowing  everything  about  eyes  and  telescopes.  In 
fact,  the  full  description  of  light  as  a  wave  can  be  rather  cumber¬ 
some.  We  will  instead  spend  most  of  our  treatment  of  optics  making 
use  of  a  simpler  model  of  light,  the  ray  model,  which  does  a  fine  job 
in  most  practical  situations.  Not  only  that,  but  we  will  even  back¬ 
track  a  little  and  start  with  a  discussion  of  basic  ideas  about  light 
and  vision  that  predated  the  discovery  of  electromagnetic  waves. 
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12.1.1  The  nature  of  light 

The  cause  and  effect  relationship  in  vision 

Despite  its  title,  this  chapter  is  far  from  your  first  look  at  light. 
That  familiarity  might  seem  like  an  advantage,  but  most  people  have 
never  thought  carefully  about  light  and  vision.  Even  smart  people 
who  have  thought  hard  about  vision  have  come  up  with  incorrect 
ideas.  The  ancient  Greeks,  Arabs  and  Chinese  had  theories  of  light 
and  vision,  all  of  which  were  mostly  wrong,  and  all  of  which  were 
accepted  for  thousands  of  years. 

One  thing  the  ancients  did  get  right  is  that  there  is  a  distinction 
between  objects  that  emit  light  and  objects  that  don’t.  When  you 
see  a  leaf  in  the  forest,  it’s  because  three  different  objects  are  doing 
their  jobs:  the  leaf,  the  eye,  and  the  sun.  But  luminous  objects 
like  the  sun,  a  flame,  or  the  filament  of  a  light  bulb  can  be  seen  by 
the  eye  without  the  presence  of  a  third  object.  Emission  of  light 
is  often,  but  not  always,  associated  with  heat.  In  modern  times, 
we  are  familiar  with  a  variety  of  objects  that  glow  without  being 
heated,  including  fluorescent  lights  and  glow-in-the-dark  toys. 

How  do  we  see  luminous  objects?  The  Greek  philosophers  Pythago¬ 
ras  (b.  ca.  560  BC)  and  Empedocles  of  Acragas  (b.  ca.  492 
BC),  who  unfortunately  were  very  influential,  claimed  that  when 
you  looked  at  a  candle  flame,  the  flame  and  your  eye  were  both 
sending  out  some  kind  of  mysterious  stuff,  and  when  your  eye’s  stuff 
collided  with  the  candle’s  stuff,  the  candle  would  become  evident  to 
your  sense  of  sight. 

Bizarre  as  the  Greek  “collision  of  stuff  theory”  might  seem,  it 
had  a  couple  of  good  features.  It  explained  why  both  the  candle 
and  your  eye  had  to  be  present  for  your  sense  of  sight  to  function. 
The  theory  could  also  easily  be  expanded  to  explain  how  we  see 
nonluminous  objects.  If  a  leaf,  for  instance,  happened  to  be  present 
at  the  site  of  the  collision  between  your  eye’s  stuff  and  the  candle’s 
stuff,  then  the  leaf  would  be  stimulated  to  express  its  green  nature, 
allowing  you  to  perceive  it  as  green. 

Modern  people  might  feel  uneasy  about  this  theory,  since  it  sug¬ 
gests  that  greenness  exists  only  for  our  seeing  convenience,  implying 
a  human  precedence  over  natural  phenomena.  Nowadays,  people 
would  expect  the  cause  and  effect  relationship  in  vision  to  be  the 
other  way  around,  with  the  leaf  doing  something  to  our  eye  rather 
than  our  eye  doing  something  to  the  leaf.  But  how  can  you  tell? 
The  most  common  way  of  distinguishing  cause  from  effect  is  to  de¬ 
termine  which  happened  first,  but  the  process  of  seeing  seems  to 
occur  too  quickly  to  determine  the  order  in  which  things  happened. 
Certainly  there  is  no  obvious  time  lag  between  the  moment  when 
you  move  your  head  and  the  moment  when  your  reflection  in  the 
mirror  moves. 
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Today,  photography  provides  the  simplest  experimental  evidence 
that  nothing  has  to  be  emitted  from  your  eye  and  hit  the  leaf  in  order 
to  make  it  “greenify.”  A  camera  can  take  a  picture  of  a  leaf  even 
if  there  are  no  eyes  anywhere  nearby.  Since  the  leaf  appears  green 
regardless  of  whether  it  is  being  sensed  by  a  camera,  your  eye,  or 
an  insect’s  eye,  it  seems  to  make  more  sense  to  say  that  the  leaf’s 
greenness  is  the  cause,  and  something  happening  in  the  camera  or 
eye  is  the  effect. 

Light  is  a  thing,  and  it  travels  from  one  point  to  another. 

Another  issue  that  few  people  have  considered  is  whether  a  can¬ 
dle’s  flame  simply  affects  your  eye  directly,  or  whether  it  sends  out 
light  which  then  gets  into  your  eye.  Again,  the  rapidity  of  the  effect 
makes  it  difficult  to  tell  what’s  happening.  If  someone  throws  a  rock 
at  you,  you  can  see  the  rock  on  its  way  to  your  body,  and  you  can 
tell  that  the  person  affected  you  by  sending  a  material  substance 
your  way,  rather  than  just  harming  you  directly  with  an  arm  mo¬ 
tion,  which  would  be  known  as  “action  at  a  distance.”  It  is  not  easy 
to  do  a  similar  observation  to  see  whether  there  is  some  “stuff”  that 
travels  from  the  candle  to  your  eye,  or  whether  it  is  a  case  of  action 
at  a  distance. 

Newtonian  physics  includes  both  action  at  a  distance  (e.g.,  the 
earth’s  gravitational  force  on  a  falling  object)  and  contact  forces 
such  as  the  normal  force,  which  only  allow  distant  objects  to  exert 
forces  on  each  other  by  shooting  some  substance  across  the  space 
between  them  (e.g.,  a  garden  hose  spraying  out  water  that  exerts  a 
force  on  a  bush). 

One  piece  of  evidence  that  the  candle  sends  out  stuff  that  travels 
to  your  eye  is  that  as  in  figure  a,  intervening  transparent  substances 
can  make  the  candle  appear  to  be  in  the  wrong  location,  suggesting 
that  light  is  a  thing  that  can  be  bumped  off  course.  Many  peo¬ 
ple  would  dismiss  this  kind  of  observation  as  an  optical  illusion, 
however.  (Some  optical  illusions  are  purely  neurological  or  psycho¬ 
logical  effects,  although  some  others,  including  this  one,  turn  out  to 
be  caused  by  the  behavior  of  light  itself.) 

A  more  convincing  way  to  decide  in  which  category  light  belongs 
is  to  find  out  if  it  takes  time  to  get  from  the  candle  to  your  eye;  in 
Newtonian  physics,  action  at  a  distance  is  supposed  to  be  instan¬ 
taneous.  The  fact  that  we  speak  casually  today  of  “the  speed  of 
light”  implies  that  at  some  point  in  history,  somebody  succeeded  in 
showing  that  light  did  not  travel  infinitely  fast.  Galileo  tried,  and 
failed,  to  detect  a  finite  speed  for  light,  by  arranging  with  a  person 
in  a  distant  tower  to  signal  back  and  forth  with  lanterns.  Galileo 
uncovered  his  lantern,  and  when  the  other  person  saw  the  light,  he 
uncovered  his  lantern.  Galileo  was  unable  to  measure  any  time  lag 
that  was  significant  compared  to  the  limitations  of  human  reflexes. 


■4 


glass 


a  /  Light  from  a  candle  is  bumped 
off  course  by  a  piece  of  glass. 
Inserting  the  glass  causes  the 
apparent  location  of  the  candle 
to  shift.  The  same  effect  can 
be  produced  by  taking  off  your 
eyeglasses  and  looking  at  which 
you  see  near  the  edge  of  the 
lens,  but  a  flat  piece  of  glass 
works  just  as  well  as  a  lens  for 
this  purpose. 
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b/An  image  of  Jupiter  and 
its  moon  lo  (left)  from  the  Cassini 
probe. 


c  /  The  earth  is  moving  to¬ 
ward  Jupiter  and  lo.  Since  the 
distance  is  shrinking,  it  is  taking 
less  and  less  time  for  the  light  to 
get  to  us  from  lo,  and  lo  appears 
to  circle  Jupiter  more  quickly  than 
normal.  Six  months  later,  the 
earth  will  be  on  the  opposite  side 
of  the  sun,  and  receding  from 
Jupiter  and  lo,  so  lo  will  appear 
to  revolve  around  Jupiter  more 
slowly. 


The  first  person  to  prove  that  light’s  speed  was  finite,  and  to 
determine  it  numerically,  was  Ole  Roerner,  in  a  series  of  measure¬ 
ments  around  the  year  1675.  Roerner  observed  lo,  one  of  Jupiter’s 
moons,  over  a  period  of  several  years.  Since  lo  presumably  took  the 
same  amount  of  time  to  complete  each  orbit  of  Jupiter,  it  could  be 
thought  of  as  a  very  distant,  very  accurate  clock.  A  practical  and  ac¬ 
curate  pendulum  clock  had  recently  been  invented,  so  Roerner  could 
check  whether  the  ratio  of  the  two  clocks’  cycles,  about  42.5  hours 
to  1  orbit,  stayed  exactly  constant  or  changed  a  little.  If  the  process 
of  seeing  the  distant  moon  was  instantaneous,  there  would  be  no 
reason  for  the  two  to  get  out  of  step.  Even  if  the  speed  of  light  was 
finite,  you  might  expect  that  the  result  would  be  only  to  offset  one 
cycle  relative  to  the  other.  The  earth  does  not,  however,  stay  at  a 
constant  distance  from  Jupiter  and  its  moons.  Since  the  distance  is 
changing  gradually  due  to  the  two  planets’  orbital  motions,  a  finite 
speed  of  light  would  make  the  “lo  clock”  appear  to  run  faster  as  the 
planets  drew  near  each  other,  and  more  slowly  as  their  separation 
increased.  Roerner  did  find  a  variation  in  the  apparent  speed  of  Io’s 
orbits,  which  caused  Io’s  eclipses  by  Jupiter  (the  moments  when  lo 
passed  in  front  of  or  behind  Jupiter)  to  occur  about  7  minutes  early 
when  the  earth  was  closest  to  Jupiter,  and  7  minutes  late  when  it 
was  farthest.  Based  on  these  measurements,  Roerner  estimated  the 
speed  of  light  to  be  approximately  2  x  108  m/s,  which  is  in  the  right 
ballpark  compared  to  modern  measurements  of  3  X  108  m/s.  (I’m  not 
sure  whether  the  fairly  large  experimental  error  was  mainly  due  to 
imprecise  knowledge  of  the  radius  of  the  earth’s  orbit  or  limitations 
in  the  reliability  of  pendulum  clocks.) 

Light  can  travel  through  a  vacuum. 

Many  people  are  confused  by  the  relationship  between  sound 
and  light.  Although  we  use  different  organs  to  sense  them,  there  are 
some  similarities.  For  instance,  both  light  and  sound  are  typically 
emitted  in  all  directions  by  their  sources.  Musicians  even  use  visual 
metaphors  like  “tone  color,”  or  “a  bright  timbre”  to  describe  sound. 
One  way  to  see  that  they  are  clearly  different  phenomena  is  to  note 
their  very  different  velocities.  Sure,  both  are  pretty  fast  compared  to 
a  flying  arrow  or  a  galloping  horse,  but  as  we  have  seen,  the  speed  of 
light  is  so  great  as  to  appear  instantaneous  in  most  situations.  The 
speed  of  sound,  however,  can  easily  be  observed  just  by  watching  a 
group  of  schoolchildren  a  hundred  feet  away  as  they  clap  their  hands 
to  a  song.  There  is  an  obvious  delay  between  when  you  see  their 
palms  come  together  and  when  you  hear  the  clap. 

The  fundamental  distinction  between  sound  and  light  is  that 
sound  is  an  oscillation  in  air  pressure,  so  it  requires  air  (or  some 
other  medium  such  as  water)  in  which  to  travel.  Today,  we  know 
that  outer  space  is  a  vacuum,  so  the  fact  that  we  get  light  from  the 
sun,  moon  and  stars  clearly  shows  that  air  is  not  necessary  for  the 
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propagation  of  light. 

Discussion  Questions 

A  If  you  observe  thunder  and  lightning,  you  can  tell  how  far  away  the 
storm  is.  Do  you  need  to  know  the  speed  of  sound,  of  light,  or  of  both? 

B  When  phenomena  like  X-rays  and  cosmic  rays  were  first  discovered, 
suggest  a  way  one  could  have  tested  whether  they  were  forms  of  light. 

C  Why  did  Roemer  only  need  to  know  the  radius  of  the  earth’s  orbit, 
not  Jupiter’s,  in  order  to  find  the  speed  of  light? 

12.1.2  Interaction  of  light  with  matter 

Absorption  of  light 

The  reason  why  the  sun  feels  warm  on  your  skin  is  that  the 
sunlight  is  being  absorbed,  and  the  light  energy  is  being  transformed 
into  heat  energy.  The  same  happens  with  artificial  light,  so  the  net 
result  of  leaving  a  light  turned  on  is  to  heat  the  room.  It  doesn’t 
matter  whether  the  source  of  the  light  is  hot,  like  the  sun,  a  flame, 
or  an  incandescent  light  bulb,  or  cool,  like  a  fluorescent  bulb.  (If 
your  house  has  electric  heat,  then  there  is  absolutely  no  point  in 
fastidiously  turning  off  lights  in  the  winter;  the  lights  will  help  to 
heat  the  house  at  the  same  dollar  rate  as  the  electric  heater.) 

This  process  of  heating  by  absorption  is  entirely  different  from 
heating  by  thermal  conduction,  as  when  an  electric  stove  heats 
spaghetti  sauce  through  a  pan.  Heat  can  only  be  conducted  through 
matter,  but  there  is  vacuum  between  us  and  the  sun,  or  between  us 
and  the  filament  of  an  incandescent  bulb.  Also,  heat  conduction  can 
only  transfer  heat  energy  from  a  hotter  object  to  a  colder  one,  but  a 
cool  fluorescent  bulb  is  perfectly  capable  of  heating  something  that 
had  already  started  out  being  warmer  than  the  bulb  itself. 

How  we  see  nonluminous  objects 

Not  all  the  light  energy  that  hits  an  object  is  transformed  into 
heat.  Some  is  reflected,  and  this  leads  us  to  the  question  of  how 
we  see  nonluminous  objects.  If  you  ask  the  average  person  how  we 
see  a  light  bulb,  the  most  likely  answer  is  “The  light  bulb  makes 
light,  which  hits  our  eyes.”  But  if  you  ask  how  we  see  a  book,  they 
are  likely  to  say  “The  bulb  lights  up  the  room,  and  that  lets  me 
see  the  book.”  All  mention  of  light  actually  entering  our  eyes  has 
mysteriously  disappeared. 

Most  people  would  disagree  if  you  told  them  that  light  was  re¬ 
flected  from  the  book  to  the  eye,  because  they  think  of  reflection  as 
something  that  mirrors  do,  not  something  that  a  book  does.  They 
associate  reflection  with  the  formation  of  a  reflected  image,  which 
does  not  seem  to  appear  in  a  piece  of  paper. 

Imagine  that  you  are  looking  at  your  reflection  in  a  nice  smooth 
piece  of  aluminum  foil,  fresh  off  the  roll.  You  perceive  a  face,  not  a 
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d  /  Two  self-portraits  of  the 
author,  one  taken  in  a  mirror  and 
one  with  a  piece  of  aluminum  foil. 


specular  diffuse 

reflection  reflection 


e  /  Specular  and  diffuse  re¬ 
flection. 


piece  of  metal.  Perhaps  you  also  see  the  bright  reflection  of  a  lamp 
over  your  shoulder  behind  you.  Now  imagine  that  the  foil  is  just 
a  little  bit  less  smooth.  The  different  parts  of  the  image  are  now 
a  little  bit  out  of  alignment  with  each  other.  Your  brain  can  still 
recognize  a  face  and  a  lamp,  but  it’s  a  little  scrambled,  like  a  Picasso 
painting.  Now  suppose  you  use  a  piece  of  aluminum  foil  that  has 
been  crumpled  up  and  then  flattened  out  again.  The  parts  of  the 
image  are  so  scrambled  that  you  cannot  recognize  an  image.  Instead, 
your  brain  tells  you  you’re  looking  at  a  rough,  silvery  surface. 

Mirror-like  reflection  at  a  specific  angle  is  known  as  specular 
reflection,  and  random  reflection  in  many  directions  is  called  diffuse 
reflection.  Diffuse  reflection  is  how  we  see  nonluminous  objects. 
Specular  reflection  only  allows  us  to  see  images  of  objects  other 
than  the  one  doing  the  reflecting.  In  top  part  of  figure  d,  imagine 
that  the  rays  of  light  are  coming  from  the  sun.  If  you  are  looking 
down  at  the  reflecting  surface,  there  is  no  way  for  your  eye-brain 
system  to  tell  that  the  rays  are  not  really  coming  from  a  sun  down 
below  you. 

Figure  f  shows  another  example  of  how  we  can’t  avoid  the  con¬ 
clusion  that  light  bounces  off  of  things  other  than  mirrors.  The 
lamp  is  one  I  have  in  my  house.  It  has  a  bright  bulb,  housed  in  a 
completely  opaque  bowl-shaped  metal  shade.  The  only  way  light 
can  get  out  of  the  lamp  is  by  going  up  out  of  the  top  of  the  bowl. 
The  fact  that  I  can  read  a  book  in  the  position  shown  in  the  figure 
means  that  light  must  be  bouncing  off  of  the  ceiling,  then  bouncing 
off  of  the  book,  then  finally  getting  to  my  eye. 

This  is  where  the  shortcomings  of  the  Greek  theory  of  vision 
become  glaringly  obvious.  In  the  Greek  theory,  the  light  from  the 
bulb  and  my  mysterious  “eye  rays”  are  both  supposed  to  go  to  the 
book,  where  they  collide,  allowing  me  to  see  the  book.  But  we  now 
have  a  total  of  four  objects:  lamp,  eye,  book,  and  ceiling.  Where 
does  the  ceiling  come  in?  Does  it  also  send  out  its  own  mysterious 
“ceiling  rays,”  contributing  to  a  three-way  collision  at  the  book? 
That  would  just  be  too  bizarre  to  believe! 

The  differences  among  white,  black,  and  the  various  shades  of 
gray  in  between  is  a  matter  of  what  percentage  of  the  light  they 
absorb  and  what  percentage  they  reflect.  That’s  why  light-colored 
clothing  is  more  comfortable  in  the  summer,  and  light-colored  up¬ 
holstery  in  a  car  stays  cooler  that  dark  upholstery. 
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Numerical  measurement  of  the  brightness  of  light 

We  have  already  seen  that  the  physiological  sensation  of  loudness 
relates  to  the  sound’s  intensity  (power  per  unit  area),  but  is  not 
directly  proportional  to  it.  If  sound  A  has  an  intensity  of  1  nW/m2, 
sound  B  is  10  nW/m2,  and  sound  C  is  100  nW /m2,  then  the  increase 
in  loudness  from  B  to  C  is  perceived  to  be  the  same  as  the  increase 
from  A  to  B,  not  ten  times  greater.  That  is,  the  sensation  of  loudness 
is  logarithmic. 

The  same  is  true  for  the  brightness  of  light.  Brightness  is  re¬ 
lated  to  power  per  unit  area,  but  the  psychological  relationship  is 
a  logarithmic  one  rather  than  a  proportionality.  For  doing  physics, 
it’s  the  power  per  unit  area  that  we’re  interested  in.  The  relevant 
unit  is  W /nr2 .  One  way  to  determine  the  brightness  of  light  is  to 
measure  the  increase  in  temperature  of  a  black  object  exposed  to 
the  light.  The  light  energy  is  being  converted  to  heat  energy,  and 
the  amount  of  heat  energy  absorbed  in  a  given  amount  of  time  can 
be  related  to  the  power  absorbed,  using  the  known  heat  capacity 
of  the  object.  More  practical  devices  for  measuring  light  intensity, 
such  as  the  light  meters  built  into  some  cameras,  are  based  on  the 
conversion  of  light  into  electrical  energy,  but  these  meters  have  to 
be  calibrated  somehow  against  heat  measurements. 

Discussion  Questions 

A  The  curtains  in  a  room  are  drawn,  but  a  small  gap  lets  light  through, 
illuminating  a  spot  on  the  floor.  It  may  or  may  not  also  be  possible  to  see 
the  beam  of  sunshine  crossing  the  room,  depending  on  the  conditions. 
What’s  going  on? 

B  Laser  beams  are  made  of  light.  In  science  fiction  movies,  laser 
beams  are  often  shown  as  bright  lines  shooting  out  of  a  laser  gun  on  a 
spaceship.  Why  is  this  scientifically  incorrect? 

C  A  documentary  film-maker  went  to  Harvard’s  1987  graduation  cer¬ 
emony  and  asked  the  graduates,  on  camera,  to  explain  the  cause  of  the 
seasons.  Only  two  out  of  23  were  able  to  give  a  correct  explanation,  but 
you  now  have  all  the  information  needed  to  figure  it  out  for  yourself,  as¬ 
suming  you  didn’t  already  know.  The  figure  shows  the  earth  in  its  winter 
and  summer  positions  relative  to  the  sun.  Hint:  Consider  the  units  used 
to  measure  the  brightness  of  light,  and  recall  that  the  sun  is  lower  in  the 
sky  in  winter,  so  its  rays  are  coming  in  at  a  shallower  angle. 

12.1.3  The  ray  model  of  light 

Models  of  light 

Note  how  I’ve  been  casually  diagramming  the  motion  of  light 
with  pictures  showing  light  rays  as  lines  on  the  page.  More  formally, 
this  is  known  as  the  ray  model  of  light.  The  ray  model  of  light 
seems  natural  once  we  convince  ourselves  that  light  travels  through 
space,  and  observe  phenomena  like  sunbeams  coming  through  holes 
in  clouds.  Having  already  been  introduced  to  the  concept  of  light 
as  an  electromagnetic  wave,  you  know  that  the  ray  model  is  not  the 


f  /  Light  bounces  off  of  the 
ceiling,  then  off  of  the  book. 


winter  summer 


g  /  Discussion  question  C. 
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ultimate  truth  about  light,  but  the  ray  model  is  simpler,  and  in  any 
case  science  always  deals  with  models  of  reality,  not  the  ultimate 
nature  of  reality.  The  following  table  summarizes  three  models  of 
light. 


h  /  Three  models  of  light. 


ray  model 


Advantage:  Simplicity. 


wave  model 


Advantage:  Color  is  described 
naturally  in  terms  of  wavelength. 
Required  in  order  to  explain  the 
interaction  of  light  with  material 
objects  of  sizes  comparable  to  or 
smaller  than  a  wavelength  of  light. 


particle  model 


Required  in  order  to  explain  the 
#  interaction  of  light  with  individual 
atoms.  At  the  atomic  level,  it 
becomes  apparent  that  a  beam  of 
light  has  a  certain  graininess  to  it. 


The  ray  model  is  a  generic  one.  By  using  it  we  can  discuss  the 
path  taken  by  the  light,  without  committing  ourselves  to  any  specific 
description  of  what  it  is  that  is  moving  along  that  path.  We  will 
use  the  nice  simple  ray  model  for  most  of  our  treatment  of  optics, 
and  with  it  we  can  analyze  a  great  many  devices  and  phenomena. 
Not  until  section  12.5  will  we  concern  ourselves  specifically  with 
wave  optics,  although  in  the  intervening  chapters  I  will  sometimes 
analyze  the  same  phenomenon  using  both  the  ray  model  and  the 
wave  model. 

Note  that  the  statements  about  the  applicability  of  the  various 
models  are  only  rough  guides.  For  instance,  wave  interference  effects 
are  often  detectable,  if  small,  when  light  passes  around  an  obstacle 
that  is  quite  a  bit  bigger  than  a  wavelength.  Also,  the  criterion  for 
when  we  need  the  particle  model  really  has  more  to  do  with  energy 
scales  than  distance  scales,  although  the  two  turn  out  to  be  related. 

The  alert  reader  may  have  noticed  that  the  wave  model  is  re¬ 
quired  at  scales  smaller  than  a  wavelength  of  light  (on  the  order  of  a 
micrometer  for  visible  light),  and  the  particle  model  is  demanded  on 
the  atomic  scale  or  lower  (a  typical  atom  being  a  nanometer  or  so  in 
size).  This  implies  that  at  the  smallest  scales  we  need  both  the  wave 
model  and  the  particle  model.  They  appear  incompatible,  so  how 
can  we  simultaneously  use  both?  The  answer  is  that  they  are  not 
as  incompatible  as  they  seem.  Light  is  both  a  wave  and  a  particle, 
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but  a  full  understanding  of  this  apparently  nonsensical  statement  is 
a  topic  for  section  13.2. 


i  /  Examples  of  ray  diagrams. 


Ray  diagrams 

Without  even  knowing  how  to  use  the  ray  model  to  calculate 
anything  numerically,  we  can  learn  a  great  deal  by  drawing  ray 
diagrams.  For  instance,  if  you  want  to  understand  how  eyeglasses 
help  you  to  see  in  focus,  a  ray  diagram  is  the  right  place  to  start. 
Many  students  under-utilize  ray  diagrams  in  optics  and  instead  rely 
on  rote  memorization  or  plugging  into  formulas.  The  trouble  with 
memorization  and  plug-ins  is  that  they  can  obscure  what’s  really 
going  on,  and  it  is  easy  to  get  them  wrong.  Often  the  best  plan  is  to 
do  a  ray  diagram  first,  then  do  a  numerical  calculation,  then  check 
that  your  numerical  results  are  in  reasonable  agreement  with  what 
you  expected  from  the  ray  diagram. 


j/1.  Correct.  2.  Incorrect:  im¬ 
plies  that  diffuse  reflection  only 
gives  one  ray  from  each  reflecting 
point.  3.  Correct,  but  unneces¬ 
sarily  complicated 


Figure  j  shows  some  guidelines  for  using  ray  diagrams  effectively. 
The  light  rays  bend  when  they  pass  out  through  the  surface  of  the 
water  (a  phenomenon  that  we’ll  discuss  in  more  detail  later).  The 
rays  appear  to  have  come  from  a  point  above  the  goldfish’s  actual 
location,  an  effect  that  is  familiar  to  people  who  have  tried  spear¬ 
fishing. 

•  A  stream  of  light  is  not  really  confined  to  a  finite  number  of 
narrow  lines.  We  just  draw  it  that  way.  In  j/1,  it  has  been 
necessary  to  choose  a  finite  number  of  rays  to  draw  (five), 
rather  than  the  theoretically  infinite  number  of  rays  that  will 
diverge  from  that  point. 
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•  There  is  a  tendency  to  conceptualize  rays  incorrectly  as  ob¬ 
jects.  In  his  Optics,  Newton  goes  out  of  his  way  to  caution 
the  reader  against  this,  saying  that  some  people  “consider  ... 
the  refraction  of  . . .  rays  to  be  the  bending  or  breaking  of  them 
in  their  passing  out  of  one  medium  into  another.”  But  a  ray 
is  a  record  of  the  path  traveled  by  light,  not  a  physical  thing 
that  can  be  bent  or  broken. 

•  In  theory,  rays  may  continue  infinitely  far  into  the  past  and 
future,  but  we  need  to  draw  lines  of  finite  length.  In  j/1,  a 
judicious  choice  has  been  made  as  to  where  to  begin  and  end 
the  rays.  There  is  no  point  in  continuing  the  rays  any  farther 
than  shown,  because  nothing  new  and  exciting  is  going  to 
happen  to  them.  There  is  also  no  good  reason  to  start  them 
earlier,  before  being  reflected  by  the  fish,  because  the  direction 
of  the  diffusely  reflected  rays  is  random  anyway,  and  unrelated 
to  the  direction  of  the  original,  incoming  ray. 

•  When  representing  diffuse  reflection  in  a  ray  diagram,  many 
students  have  a  mental  block  against  drawing  many  rays  fan¬ 
ning  out  from  the  same  point.  Often,  as  in  example  j/2,  the 
problem  is  the  misconception  that  light  can  only  be  reflected 
in  one  direction  from  one  point. 

•  Another  difficulty  associated  with  diffuse  reflection,  example 
j/3,  is  the  tendency  to  think  that  in  addition  to  drawing  many 
rays  coming  out  of  one  point,  we  should  also  be  drawing  many 
rays  coming  from  many  points.  In  j/1,  drawing  many  rays 
coming  out  of  one  point  gives  useful  information,  telling  us, 
for  instance,  that  the  fish  can  be  seen  from  any  angle.  Drawing 
many  sets  of  rays,  as  in  j/3,  does  not  give  us  any  more  useful 
information,  and  just  clutters  up  the  picture  in  this  example. 
The  only  reason  to  draw  sets  of  rays  fanning  out  from  more 
than  one  point  would  be  if  different  things  were  happening  to 
the  different  sets. 

Discussion  Question 

A  Suppose  an  intelligent  tool-using  fish  is  spear-hunting  for  humans. 
Draw  a  ray  diagram  to  show  how  the  fish  has  to  correct  its  aim.  Note 
that  although  the  rays  are  now  passing  from  the  air  to  the  water,  the  same 
rules  apply:  the  rays  are  closer  to  being  perpendicular  to  the  surface  when 
they  are  in  the  water,  and  rays  that  hit  the  air-water  interface  at  a  shallow 
angle  are  bent  the  most. 

12.1.4  Geometry  of  specular  reflection 

To  change  the  motion  of  a  material  object,  we  use  a  force.  Is 
there  any  way  to  exert  a  force  on  a  beam  of  light?  Experiments 
show  that  electric  and  magnetic  fields  do  not  deflect  light  beams,  so 
apparently  light  has  no  electric  charge.  Light  also  has  no  mass,  so 
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until  the  twentieth  century  it  was  believed  to  be  immune  to  gravity 
as  well.  Einstein  predicted  that  light  beams  would  be  very  slightly 
deflected  by  strong  gravitational  fields,  and  he  was  proved  correct 
by  observations  of  rays  of  starlight  that  came  close  to  the  sun,  but 
obviously  that’s  not  what  makes  mirrors  and  lenses  work! 

If  we  investigate  how  light  is  reflected  by  a  mirror,  we  will  find 
that  the  process  is  horrifically  complex,  but  the  final  result  is  sur¬ 
prisingly  simple.  What  actually  happens  is  that  the  light  is  made 
of  electric  and  magnetic  fields,  and  these  fields  accelerate  the  elec¬ 
trons  in  the  mirror.  Energy  from  the  light  beam  is  momentarily 
transformed  into  extra  kinetic  energy  of  the  electrons,  but  because 
the  electrons  are  accelerating  they  re-radiate  more  light,  convert¬ 
ing  their  kinetic  energy  back  into  light  energy.  We  might  expect 
this  to  result  in  a  very  chaotic  situation,  but  amazingly  enough,  the 
electrons  move  together  to  produce  a  new,  reflected  beam  of  light, 
which  obeys  two  simple  rules: 


•  The  angle  of  the  reflected  ray  is  the  same  as  that  of  the  incident 
ray. 


•  The  reflected  ray  lies  in  the  plane  containing  the  incident  ray 
and  the  normal  (perpendicular)  line.  This  plane  is  known  as 
the  plane  of  incidence. 


The  two  angles  can  be  defined  either  with  respect  to  the  normal, 
like  angles  B  and  C  in  the  figure,  or  with  respect  to  the  reflecting 
surface,  like  angles  A  and  D.  There  is  a  convention  of  several  hundred 
years’  standing  that  one  measures  the  angles  with  respect  to  the 
normal,  but  the  rule  about  equal  angles  can  logically  be  stated  either 
as  B=C  or  as  A=D. 

The  phenomenon  of  reflection  occurs  only  at  the  boundary  be¬ 
tween  two  media,  just  like  the  change  in  the  speed  of  light  that 
passes  from  one  medium  to  another.  As  we  have  seen  in  section  6.2, 
this  is  the  way  all  waves  behave. 

Most  people  are  surprised  by  the  fact  that  light  can  be  reflected 
back  from  a  less  dense  medium.  For  instance,  if  you  are  diving  and 
you  look  up  at  the  surface  of  the  water,  you  will  see  a  reflection  of 
yourself. 


k  /  The  geometry  of  specular 
reflection. 
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self-check  A 

Each  of  these  diagrams  is  supposed  to  show  two  different  rays  being 
reflected  from  the  same  point  on  the  same  mirror.  Which  are  correct, 
and  which  are  incorrect? 

2  \  f  3 

>  Answer,  p.  967 

Reversibility  of  light  rays 

The  fact  that  specular  reflection  displays  equal  angles  of  inci¬ 
dence  and  reflection  means  that  there  is  a  symmetry:  if  the  ray  had 
come  in  from  the  right  instead  of  the  left  in  the  figure  above,  the  an¬ 
gles  would  have  looked  exactly  the  same.  This  is  not  just  a  pointless 
detail  about  specular  reflection.  It’s  a  manifestation  of  a  very  deep 
and  important  fact  about  nature,  which  is  that  the  laws  of  physics 
do  not  distinguish  between  past  and  future.  Cannonballs  and  plan¬ 
ets  have  trajectories  that  are  equally  natural  in  reverse,  and  so  do 
light  rays.  This  type  of  symmetry  is  called  time- reversal  symmetry. 

Typically,  time-reversal  symmetry  is  a  characteristic  of  any  pro¬ 
cess  that  does  not  involve  heat.  For  instance,  the  planets  do  not 
experience  any  friction  as  they  travel  through  empty  space,  so  there 
is  no  frictional  heating.  We  should  thus  expect  the  time-reversed 
versions  of  their  orbits  to  obey  the  laws  of  physics,  which  they  do. 
In  contrast,  a  book  sliding  across  a  table  does  generate  heat  from 
friction  as  it  slows  down,  and  it  is  therefore  not  surprising  that  this 
type  of  motion  does  not  appear  to  obey  time-reversal  symmetry.  A 
book  lying  still  on  a  flat  table  is  never  observed  to  spontaneously 
start  sliding,  sucking  up  heat  energy  and  transforming  it  into  kinetic 
energy. 

Similarly,  the  only  situation  we’ve  observed  so  far  where  light 
does  not  obey  time-reversal  symmetry  is  absorption,  which  involves 
heat.  Your  skin  absorbs  visible  light  from  the  sun  and  heats  up, 
but  we  never  observe  people’s  skin  to  glow,  converting  heat  energy 
into  visible  light.  People’s  skin  does  glow  in  infrared  light,  but 
that  doesn’t  mean  the  situation  is  symmetric.  Even  if  you  absorb 
infrared,  you  don’t  emit  visible  light,  because  your  skin  isn’t  hot 
enough  to  glow  in  the  visible  spectrum. 

These  apparent  heat-related  asymmetries  are  not  actual  asym¬ 
metries  in  the  laws  of  physics.  The  interested  reader  may  wish  to 
learn  more  about  this  from  optional  chapter  5  on  thermodynamics. 

Ray  tracing  on  a  computer  example  1 

A  number  of  techniques  can  be  used  for  creating  artificial  visual 
scenes  in  computer  graphics.  Figure  I  shows  such  a  scene,  which 
was  created  by  the  brute-force  technique  of  simply  constructing 
a  very  detailed  ray  diagram  on  a  computer.  This  technique  re¬ 
quires  a  great  deal  of  computation,  and  is  therefore  too  slow  to 
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be  used  for  video  games  and  computer-animated  movies.  One 
trick  for  speeding  up  the  computation  is  to  exploit  the  reversibility 
of  light  rays.  If  one  was  to  trace  every  ray  emitted  by  every  illu¬ 
minated  surface,  only  a  tiny  fraction  of  those  would  actually  end 
up  passing  into  the  virtual  “camera,”  and  therefore  almost  all  of 
the  computational  effort  would  be  wasted.  One  can  instead  start 
a  ray  at  the  camera,  trace  it  backward  in  time,  and  see  where  it 
would  have  come  from.  With  this  technique,  there  is  no  wasted 
effort. 


I /This  photorealistic  image  of  a  nonexistent  countertop  was  pro¬ 
duced  completely  on  a  computer,  by  computing  a  complicated  ray 
diagram. 
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The  Ray  Model  of  Light 
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Discussion  Questions 


m  /  Discussion  question  B. 


A  If  a  light  ray  has  a  velocity  vector  with  components  cx  and  cy,  what 
will  happen  when  it  is  reflected  from  a  surface  that  lies  along  the  y  axis? 
Make  sure  your  answer  does  not  imply  a  change  in  the  ray’s  speed. 

B  Generalizing  your  reasoning  from  discussion  question  A,  what  will 
happen  to  the  velocity  components  of  a  light  ray  that  hits  a  corner,  as 
shown  in  the  figure,  and  undergoes  two  reflections? 

C  Three  pieces  of  sheet  metal  arranged  perpendicularly  as  shown  in 
the  figure  form  what  is  known  as  a  radar  corner.  Let’s  assume  that  the 
radar  corner  is  large  compared  to  the  wavelength  of  the  radar  waves,  so 
that  the  ray  model  makes  sense.  If  the  radar  corner  is  bathed  in  radar 
rays,  at  least  some  of  them  will  undergo  three  reflections.  Making  a  fur¬ 
ther  generalization  of  your  reasoning  from  the  two  preceding  discussion 
questions,  what  will  happen  to  the  three  velocity  components  of  such  a 
ray?  What  would  the  radar  corner  be  useful  for? 


o/The  solid  lines  are  physi¬ 
cally  possible  paths  for  light  rays 
traveling  from  A  to  B  and  from 
A  to  C.  They  obey  the  principle 
of  least  time.  The  dashed  lines 
do  not  obey  the  principle  of 
least  time,  and  are  not  physically 
possible. 


12.1.5  *  The  principle  of  least  time  for  reflection 

We  had  to  choose  between  an  unwieldy  explanation  of  reflection 
at  the  atomic  level  and  a  simpler  geometric  description  that  was 
not  as  fundamental.  There  is  a  third  approach  to  describing  the 
interaction  of  light  and  matter  which  is  very  deep  and  beautiful. 
Emphasized  by  the  twentieth-century  physicist  Richard  Feynman, 
it  is  called  the  principle  of  least  time,  or  Fermat’s  principle. 

Let’s  start  with  the  motion  of  light  that  is  not  interacting  with 
matter  at  all.  In  a  vacuum,  a  light  ray  moves  in  a  straight  line.  This 
can  be  rephrased  as  follows:  of  all  the  conceivable  paths  light  could 
follow  from  P  to  Q,  the  only  one  that  is  physically  possible  is  the 
path  that  takes  the  least  time. 

What  about  reflection?  If  light  is  going  to  go  from  one  point  to 
another,  being  reflected  on  the  way,  the  quickest  path  is  indeed  the 
one  with  equal  angles  of  incidence  and  reflection.  If  the  starting  and 
ending  points  are  equally  far  from  the  reflecting  surface,  o,  it’s  not 
hard  to  convince  yourself  that  this  is  true,  just  based  on  symmetry. 
There  is  also  a  tricky  and  simple  proof,  shown  in  figure  p,  for  the 
more  general  case  where  the  points  are  at  different  distances  from 
the  surface. 

Not  only  does  the  principle  of  least  time  work  for  light  in  a 


754 


Chapter  12  Optics 


B 


vacuum  and  light  undergoing  reflection,  we  will  also  see  in  a  later 
chapter  that  it  works  for  the  bending  of  light  when  it  passes  from 
one  medium  into  another. 

Although  it  is  beautiful  that  the  entire  ray  model  of  light  can 
be  reduced  to  one  simple  rule,  the  principle  of  least  time,  it  may 
seem  a  little  spooky  to  speak  as  if  the  ray  of  light  is  intelligent, 
and  has  carefully  planned  ahead  to  find  the  shortest  route  to  its 
destination.  How  does  it  know  in  advance  where  it’s  going?  What 
if  we  moved  the  mirror  while  the  light  was  en  route,  so  conditions 
along  its  planned  path  were  not  what  it  “expected?”  The  answer 
is  that  the  principle  of  least  time  is  really  a  shortcut  for  finding 
certain  results  of  the  wave  model  of  light,  which  is  the  topic  of  the 
last  chapter  of  this  book. 

There  are  a  couple  of  subtle  points  about  the  principle  of  least 
time.  First,  the  path  does  not  have  to  be  the  quickest  of  all  pos¬ 
sible  paths;  it  only  needs  to  be  quicker  than  any  path  that  differs 
infinitesimally  from  it.  In  figure  p,  for  instance,  light  could  get  from 
A  to  B  either  by  the  reflected  path  AQB  or  simply  by  going  straight 
from  A  to  B.  Although  AQB  is  not  the  shortest  possible  path,  it 
cannot  be  shortened  by  changing  it  infinitesimally,  e.g.,  by  moving 
Q  a  little  to  the  right  or  left.  On  the  other  hand,  path  APB  is  phys¬ 
ically  impossible,  because  it  is  possible  to  improve  on  it  by  moving 
point  P  infinitesimally  to  the  right. 

It’s  not  quite  right  to  call  this  the  principle  of  least  time.  In  fig¬ 
ure  q,  for  example,  the  four  physically  possible  paths  by  which  a  ray 
can  return  to  the  center  consist  of  two  shortest-time  paths  and  two 
longest-tinre  paths.  Strictly  speaking,  we  should  refer  to  the  prin¬ 
ciple  of  least  or  greatest  time ,  but  most  physicists  omit  the  niceties, 
and  assume  that  other  physicists  understand  that  both  maxima  and 
minima  are  possible. 


c 


p  /  Paths  AQB  and  APB  are 
two  conceivable  paths  that  a  ray 
could  follow  to  get  from  A  to  B 
with  one  reflection,  but  only  AQB 
is  physically  possible.  We  wish 
to  prove  that  the  path  AQB,  with 
equal  angles  of  incidence  and 
reflection,  is  shorter  than  any 
other  path,  such  as  APB.  The 
trick  is  to  construct  a  third  point, 
C,  lying  as  far  below  the  surface 
as  B  lies  above  it.  Then  path 
AQC  is  a  straight  line  whose 
length  is  the  same  as  AQB’s,  and 
path  APC  has  the  same  length  as 
path  APB.  Since  AQC  is  straight, 
it  must  be  shorter  than  any  other 
path  such  as  APC  that  connects 
A  and  C,  and  therefore  AQB  must 
be  shorter  than  any  path  such  as 
APB. 


q  /  Light  is  emitted  at  the  center 
of  an  elliptical  mirror.  There  are 
four  physically  possible  paths  by 
which  a  ray  can  be  reflected  and 
return  to  the  center. 
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a  /  An  image  formed  by  a 
mirror. 


12.2  Images  by  Reflection 

Infants  are  always  fascinated  by  the  antics  of  the  Baby  in  the  Mirror. 
Now  if  you  want  to  know  something  about  mirror  images  that  most 
people  don’t  understand,  try  this.  First  bring  this  page  closer  and 
closer  to  your  eyes,  until  you  can  no  longer  focus  on  it  without 
straining.  Then  go  in  the  bathroom  and  see  how  close  you  can 
get  your  face  to  the  surface  of  the  mirror  before  you  can  no  longer 
easily  focus  on  the  image  of  your  own  eyes.  You  will  find  that 
the  shortest  comfortable  eye-mirror  distance  is  much  less  than  the 
shortest  comfortable  eye-paper  distance.  This  demonstrates  that 
the  image  of  your  face  in  the  mirror  acts  as  if  it  had  depth  and 
existed  in  the  space  behind  the  mirror.  If  the  image  was  like  a  flat 
picture  in  a  book,  then  you  wouldn’t  be  able  to  focus  on  it  from 
such  a  short  distance. 

In  this  chapter  we  will  study  the  images  formed  by  flat  and 
curved  mirrors  on  a  qualitative,  conceptual  basis.  Although  this 
type  of  image  is  not  as  commonly  encountered  in  everyday  life  as 
images  formed  by  lenses,  images  formed  by  reflection  are  simpler  to 
understand,  so  we  discuss  them  first.  In  section  12.3  we  will  turn 
to  a  more  mathematical  treatment  of  images  made  by  reflection. 
Surprisingly,  the  same  equations  can  also  be  applied  to  lenses,  which 
are  the  topic  of  section  12.4. 

12.2.1  A  virtual  image 

We  can  understand  a  mirror  image  using  a  ray  diagram.  Figure 
a  shows  several  light  rays,  1,  that  originated  by  diffuse  reflection  at 
the  person’s  nose.  They  bounce  off  the  mirror,  producing  new  rays, 
2.  To  anyone  whose  eye  is  in  the  right  position  to  get  one  of  these 
rays,  they  appear  to  have  come  from  a  behind  the  mirror,  3,  where 
they  would  have  originated  from  a  single  point.  This  point  is  where 
the  tip  of  the  image-person’s  nose  appears  to  be.  A  similar  analysis 
applies  to  every  other  point  on  the  person’s  face,  so  it  looks  as 
though  there  was  an  entire  face  behind  the  mirror.  The  customary 
way  of  describing  the  situation  requires  some  explanation: 

Customary  description  in  physics:  There  is  an  image  of  the  face 
behind  the  mirror. 


Translation:  The  pattern  of  rays  coming  from  the  mirror  is  exactly 
the  same  as  it  would  be  if  there  were  a  face  behind  the  mirror. 
Nothing  is  really  behind  the  mirror. 


This  is  referred  to  as  a  virtual  image,  because  the  rays  do  not 
actually  cross  at  the  point  behind  the  mirror.  They  only  appear  to 
have  originated  there. 

self-check  B 
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Imagine  that  the  person  in  figure  a  moves  his  face  down  quite  a  bit  — 
a  couple  of  feet  in  real  life,  or  a  few  inches  on  this  scale  drawing.  The 
mirror  stays  where  it  is.  Draw  a  new  ray  diagram.  Will  there  still  be  an 
image?  If  so,  where  is  it  visible  from?  >  Answer,  p.  968 

The  geometry  of  specular  reflection  tells  us  that  rays  1  and  2 
are  at  equal  angles  to  the  normal  (the  imaginary  perpendicular  line 
piercing  the  mirror  at  the  point  of  reflection).  This  means  that 
ray  2’s  imaginary  continuation,  3,  forms  the  same  angle  with  the 
mirror  as  ray  1.  Since  each  ray  of  type  3  forms  the  same  angles  with 
the  mirror  as  its  partner  of  type  1,  we  see  that  the  distance  of  the 
image  from  the  mirror  is  the  same  as  that  of  the  actual  face  from 
the  mirror,  and  it  lies  directly  across  from  it.  The  image  therefore 
appears  to  be  the  same  size  as  the  actual  face. 


image  of 
eye  chart 


mirror 


3.5  m 


3.5  m 


eye  chart 


b  /  Example  2. 


An  eye  exam  example  2 

Figure  b  shows  a  typical  setup  in  an  optometrist’s  examination 
room.  The  patient’s  vision  is  supposed  to  be  tested  at  a  distance 
of  6  meters  (20  feet  in  the  U.S.),  but  this  distance  is  larger  than 
the  amount  of  space  available  in  the  room.  Therefore  a  mirror  is 
used  to  create  an  image  of  the  eye  chart  behind  the  wall. 

The  Praxinoscope  example  3 

Figure  c  shows  an  old-fashioned  device  called  a  praxinoscope, 
which  displays  an  animated  picture  when  spun.  The  removable 
strip  of  paper  with  the  pictures  printed  on  it  has  twice  the  radius 
of  the  inner  circle  made  of  flat  mirrors,  so  each  picture’s  virtual 
image  is  at  the  center.  As  the  wheel  spins,  each  picture’s  image 
is  replaced  by  the  next. 


c  /  The  praxinoscope. 
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Discussion  Question 


A  The  figure  shows  an  object  that  is  off  to  one  side  of  a  mirror.  Draw 
a  ray  diagram.  Is  an  image  formed?  If  so,  where  is  it,  and  from  which 
directions  would  it  be  visible? 


O 
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12.2.2  Curved  mirrors 

An  image  in  a  flat  mirror  is  a  pretechnological  example:  even 
animals  can  look  at  their  reflections  in  a  calm  pond.  We  now  pass 
to  our  first  nontrivial  example  of  the  manipulation  of  an  image  by 
technology:  an  image  in  a  curved  mirror.  Before  we  dive  in,  let’s 
consider  why  this  is  an  important  example.  If  it  was  just  a  ques¬ 
tion  of  memorizing  a  bunch  of  facts  about  curved  mirrors,  then  you 
would  rightly  rebel  against  an  effort  to  spoil  the  beauty  of  your  lib¬ 
erally  educated  brain  by  force-feeding  you  technological  trivia.  The 
reason  this  is  an  important  example  is  not  that  curved  mirrors  are 
so  important  in  and  of  themselves,  but  that  the  results  we  derive  for 
curved  bowl-shaped  mirrors  turn  out  to  be  true  for  a  large  class  of 
other  optical  devices,  including  mirrors  that  bulge  outward  rather 
than  inward,  and  lenses  as  well.  A  microscope  or  a  telescope  is  sim¬ 
ply  a  combination  of  lenses  or  mirrors  or  both.  What  you’re  really 
learning  about  here  is  the  basic  building  block  of  all  optical  devices 
from  movie  projectors  to  octopus  eyes. 

Because  the  mirror  in  figure  d  is  curved,  it  bends  the  rays  back 
closer  together  than  a  flat  mirror  would:  we  describe  it  as  converging. 
Note  that  the  term  refers  to  what  it  does  to  the  light  rays,  not  to  the 
physical  shape  of  the  mirror’s  surface  .  (The  surface  itself  would  be 
described  as  concave.  The  term  is  not  all  that  hard  to  remember, 
because  the  hollowed-out  interior  of  the  mirror  is  like  a  cave.)  It 
is  surprising  but  true  that  all  the  rays  like  3  really  do  converge  on 
a  point,  forming  a  good  image.  We  will  not  prove  this  fact,  but  it 
is  true  for  any  mirror  whose  curvature  is  gentle  enough  and  that 
is  symmetric  with  respect  to  rotation  about  the  perpendicular  line 
passing  through  its  center  (not  asymmetric  like  a  potato  chip).  The 
old-fashioned  method  of  making  mirrors  and  lenses  is  by  grinding 
them  in  grit  by  hand,  and  this  automatically  tends  to  produce  an 
almost  perfect  spherical  surface. 

Bending  a  ray  like  2  inward  implies  bending  its  imaginary  contin¬ 
uation  3  outward,  in  the  same  way  that  raising  one  end  of  a  seesaw 
causes  the  other  end  to  go  down.  The  image  therefore  forms  deeper 
behind  the  mirror.  This  doesn’t  just  show  that  there  is  extra  dis¬ 
tance  between  the  image-nose  and  the  mirror;  it  also  implies  that 
the  image  itself  is  bigger  from  front  to  back.  It  has  been  magnified 
in  the  front-to-back  direction. 

It  is  easy  to  prove  that  the  same  magnification  also  applies  to  the 
image’s  other  dimensions.  Consider  a  point  like  E  in  figure  e.  The 
trick  is  that  out  of  all  the  rays  diffusely  reflected  by  E,  we  pick  the 
one  that  happens  to  head  for  the  mirror’s  center,  C.  The  equal-angle 
property  of  specular  reflection  plus  a  little  straightforward  geometry 
easily  leads  us  to  the  conclusion  that  triangles  ABC  and  CDE  are 
the  same  shape,  with  ABC  being  simply  a  scaled-up  version  of  CDE. 
The  magnification  of  depth  equals  the  ratio  BC/CD,  and  the  up- 


d  /  An  image  formed  by  a 
curved  mirror. 


e/The  image  is  magnified 
by  the  same  factor  in  depth  and 
in  its  other  dimensions. 


f/ Increased  magnification 

always  comes  at  the  expense  of 
decreased  field  of  view. 
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down  magnification  is  AB/DE.  A  repetition  of  the  same  proof  shows 
that  the  magnification  in  the  third  dimension  (out  of  the  page)  is 
also  the  same.  This  means  that  the  image-head  is  simply  a  larger 
version  of  the  real  one,  without  any  distortion.  The  scaling  factor 
is  called  the  magnification,  M.  The  image  in  the  figure  is  magnified 
by  a  factor  M  =  1.9. 

Note  that  we  did  not  explicitly  specify  whether  the  mirror  was 
a  sphere,  a  paraboloid,  or  some  other  shape.  However,  we  assumed 
that  a  focused  image  would  be  formed,  which  would  not  necessarily 
be  true,  for  instance,  for  a  mirror  that  was  asymmetric  or  very  deeply 
curved. 

12.2.3  A  real  image 

If  we  start  by  placing  an  object  very  close  to  the  mirror,  g/1, 
and  then  move  it  farther  and  farther  away,  the  image  at  first  behaves 
as  we  would  expect  from  our  everyday  experience  with  flat  mirrors, 
receding  deeper  and  deeper  behind  the  mirror.  At  a  certain  point, 
however,  a  dramatic  change  occurs.  When  the  object  is  more  than 
a  certain  distance  from  the  mirror,  g/2,  the  image  appears  upside- 
down  and  in  fro?it.  of  the  mirror. 


g/1.  A  virtual  image.  2.  Areal 
image.  As  you’ll  verify  in  home¬ 
work  problem  12,  the  image  is 
upside-down 


Here’s  what’s  happened.  The  mirror  bends  light  rays  inward,  but 
when  the  object  is  very  close  to  it,  as  in  g/1,  the  rays  coming  from  a 
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given  point  on  the  object  are  too  strongly  diverging  (spreading)  for 
the  mirror  to  bring  them  back  together.  On  reflection,  the  rays  are 
still  diverging,  just  not  as  strongly  diverging.  But  when  the  object 
is  sufficiently  far  away,  g/2,  the  mirror  is  only  intercepting  the  rays 
that  came  out  in  a  narrow  cone,  and  it  is  able  to  bend  these  enough 
so  that  they  will  reconverge. 

Note  that  the  rays  shown  in  the  figure,  which  both  originated  at 
the  same  point  on  the  object,  reunite  when  they  cross.  The  point 
where  they  cross  is  the  image  of  the  point  on  the  original  object. 
This  type  of  image  is  called  a  real  image,  in  contradistinction  to  the 
virtual  images  we’ve  studied  before. 


Definition:  A  real  image  is  one  where  rays  actually  cross.  A  virtual 
image  is  a  point  from  which  rays  only  appear  to  have  come. 


The  use  of  the  word  “real”  is  perhaps  unfortunate.  It  sounds 
as  though  we  are  saying  the  image  was  an  actual  material  object, 
which  of  course  it  is  not. 

The  distinction  between  a  real  image  and  a  virtual  image  is  an 
important  one,  because  a  real  image  can  be  projected  onto  a  screen 
or  photographic  film.  If  a  piece  of  paper  is  inserted  in  figure  g/2 
at  the  location  of  the  image,  the  image  will  be  visible  on  the  paper 
(provided  the  object  is  bright  and  the  room  is  dark).  Your  eye  uses 
a  lens  to  make  a  real  image  on  the  retina. 

self-check  C 

Sketch  another  copy  of  the  face  in  figure  g/1,  even  farther  from  the 
mirror,  and  draw  a  ray  diagram.  What  has  happened  to  the  location  of 
the  image?  o  Answer,  p.  968 

12.2.4  Images  of  images 

If  you  are  wearing  glasses  right  now,  then  the  light  rays  from  the 
page  are  being  manipulated  first  by  your  glasses  and  then  by  the  lens 
of  your  eye.  You  might  think  that  it  would  be  extremely  difficult 
to  analyze  this,  but  in  fact  it  is  quite  easy.  In  any  series  of  optical 
elements  (mirrors  or  lenses  or  both) ,  each  element  works  on  the  rays 
furnished  by  the  previous  element  in  exactly  the  same  manner  as  if 
the  image  formed  by  the  previous  element  was  an  actual  object. 

Figure  h  shows  an  example  involving  only  mirrors.  The  Newto¬ 
nian  telescope,  invented  by  Isaac  Newton,  consists  of  a  large  curved 
mirror,  plus  a  second,  flat  mirror  that  brings  the  light  out  of  the 
tube.  (In  very  large  telescopes,  there  may  be  enough  room  to  put 
a  camera  or  even  a  person  inside  the  tube,  in  which  case  the  sec¬ 
ond  mirror  is  not  needed.)  The  tube  of  the  telescope  is  not  vital;  it 
is  mainly  a  structural  element,  although  it  can  also  be  helpful  for 
blocking  out  stray  light.  The  lens  has  been  removed  from  the  front 
of  the  camera  body,  and  is  not  needed  for  this  setup.  Note  that  the 


h  /  A  Newtonian  telescope 
being  used  with  a  camera. 
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i/A  Newtonian  telescope 
being  used  for  visual  rather  than 
photographic  observing.  In  real 
life,  an  eyepiece  lens  is  normally 
used  for  additional  magnification, 
but  this  simpler  setup  will  also 
work. 


two  sample  rays  have  been  drawn  parallel,  because  an  astronomical 
telescope  is  used  for  viewing  objects  that  are  extremely  far  away. 
These  two  “parallel”  lines  actually  meet  at  a  certain  point,  say  a 
crater  on  the  moon,  so  they  can’t  actually  be  perfectly  parallel,  but 
they  are  parallel  for  all  practical  purposes  since  we  would  have  to 
follow  them  upward  for  a  quarter  of  a  million  miles  to  get  to  the 
point  where  they  intersect. 

The  large  curved  mirror  by  itself  would  form  an  image  I,  but  the 
small  flat  mirror  creates  an  image  of  the  image,  V .  The  relationship 
between  I  and  l'  is  exactly  the  same  as  it  would  be  if  I  was  an  actual 
object  rather  than  an  image:  I  and  I7  are  at  equal  distances  from 
the  plane  of  the  mirror,  and  the  line  between  them  is  perpendicular 
to  the  plane  of  the  mirror. 

One  surprising  wrinkle  is  that  whereas  a  flat  mirror  used  by  itself 
forms  a  virtual  image  of  an  object  that  is  real,  here  the  mirror  is 
forming  a  real  image  of  virtual  image  I.  This  shows  how  pointless  it 
would  be  to  try  to  memorize  lists  of  facts  about  what  kinds  of  images 
are  formed  by  various  optical  elements  under  various  circumstances. 
You  are  better  off  simply  drawing  a  ray  diagram. 


Although  the  main  point  here  was  to  give  an  example  of  an  image 
of  an  image,  figure  i  also  shows  an  interesting  case  where  we  need 
to  make  the  distinction  between  magnification  and  angular  mag¬ 
nification.  If  you  are  looking  at  the  moon  through  this  telescope, 
then  the  images  I  and  V  are  much  smaller  than  the  actual  moon. 
Otherwise,  for  example,  image  I  would  not  fit  inside  the  telescope! 
However,  these  images  are  very  close  to  your  eye  compared  to  the 
actual  moon.  The  small  size  of  the  image  has  been  more  than  com¬ 
pensated  for  by  the  shorter  distance.  The  important  thing  here  is 
the  amount  of  angle  within  your  field  of  view  that  the  image  covers, 
and  it  is  this  angle  that  has  been  increased.  The  factor  by  which  it 
is  increased  is  called  the  angular  magnification ,  Ma. 
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k  /  The  person  uses  a  mirror  to 
get  a  view  of  both  sides  of  the 
ladybug.  Although  the  flat  mirror 
has  M  =  1,  it  doesn’t  give  an  an¬ 
gular  magnification  of  1.  The  im¬ 
age  is  farther  from  the  eye  than 
the  object,  so  the  angular  magni¬ 
fication  Ma  =  ai/oc0  is  less  than 
one. 


Discussion  Questions 

A  Locate  the  images  of  you  that  will  be  formed  if  you  stand  between 
two  parallel  mirrors. 


O 
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B  Locate  the  images  formed  by  two  perpendicular  mirrors,  as  in  the 
figure.  What  happens  if  the  mirrors  are  not  perfectly  perpendicular? 
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C  Locate  the  images  formed  by  the  periscope. 
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a /The  relationship  between 
the  object’s  position  and  the 
image’s  can  be  expressed  in 
terms  of  the  angles  0O  and  0,-. 


12.3  Images,  Quantitatively 

It  sounds  a  bit  odd  when  a  scientist  refers  to  a  theory  as  “beauti¬ 
ful,”  but  to  those  in  the  know  it  makes  perfect  sense.  One  mark 
of  a  beautiful  theory  is  that  it  surprises  us  by  being  simple.  The 
mathematical  theory  of  lenses  and  curved  mirrors  gives  us  just  such 
a  surprise.  We  expect  the  subject  to  be  complex  because  there  are 
so  many  cases:  a  converging  mirror  forming  a  real  image,  a  diverg¬ 
ing  lens  that  makes  a  virtual  image,  and  so  on  for  a  total  of  six 
possibilities.  If  we  want  to  predict  the  location  of  the  images  in  all 
these  situations,  we  might  expect  to  need  six  different  equations, 
and  six  more  for  predicting  magnifications.  Instead,  it  turns  out 
that  we  can  use  just  one  equation  for  the  location  of  the  image  and 
one  equation  for  its  magnification,  and  these  two  equations  work 
in  all  the  different  cases  with  no  changes  except  for  plus  and  minus 
signs.  This  is  the  kind  of  thing  the  physicist  Eugene  Wigner  referred 
to  as  “the  unreasonable  effectiveness  of  mathematics.”  Sometimes 
we  can  find  a  deeper  reason  for  this  kind  of  unexpected  simplicity, 
but  sometimes  it  almost  seems  as  if  God  went  out  of  Her  way  to 
make  the  secrets  of  universe  susceptible  to  attack  by  the  human 
thought-tool  called  math. 

12.3.1  A  real  image  formed  by  a  converging  mirror 

Location  of  the  image 

We  will  now  derive  the  equation  for  the  location  of  a  real  image 
formed  by  a  converging  mirror.  We  assume  for  simplicity  that  the 
mirror  is  spherical,  but  actually  this  isn’t  a  restrictive  assumption, 
because  any  shallow,  symmetric  curve  can  be  approximated  by  a 
sphere.  The  shape  of  the  mirror  can  be  specified  by  giving  the 
location  of  its  center,  C.  A  deeply  curved  mirror  is  a  sphere  with  a 
small  radius,  so  C  is  close  to  it,  while  a  weakly  curved  mirror  has 
C  farther  away.  Given  the  point  O  where  the  object  is,  we  wish  to 
find  the  point  I  where  the  image  will  be  formed. 

To  locate  an  image,  we  need  to  track  a  minimum  of  two  rays 
coming  from  the  same  point.  Since  we  have  proved  in  the  previous 
chapter  that  this  type  of  image  is  not  distorted,  we  can  use  an  on-axis 
point,  O,  on  the  object,  as  in  figure  a/1.  The  results  we  derive  will 
also  hold  for  off-axis  points,  since  otherwise  the  image  would  have 
to  be  distorted,  which  we  know  is  not  true.  We  let  one  of  the  rays  be 
the  one  that  is  emitted  along  the  axis;  this  ray  is  especially  easy  to 
trace,  because  it  bounces  straight  back  along  the  axis  again.  As  our 
second  ray,  we  choose  one  that  strikes  the  mirror  at  a  distance  of  1 
from  the  axis.  “One  what?”  asks  the  astute  reader.  The  answer  is 
that  it  doesn’t  really  matter.  When  a  mirror  has  shallow  curvature, 
all  the  reflected  rays  hit  the  same  point,  so  1  could  be  expressed 
in  any  units  you  like.  It  could,  for  instance,  be  1  cm,  unless  your 
mirror  is  smaller  than  1  cm! 
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The  only  way  to  find  out  anything  mathematical  about  the  rays 
is  to  use  the  sole  mathematical  fact  we  possess  concerning  specular 
reflection:  the  incident  and  reflected  rays  form  equal  angles  with 
respect  to  the  normal,  which  is  shown  as  a  dashed  line.  Therefore 
the  two  angles  shown  in  figure  a/2  are  the  same,  and  skipping  some 
straightforward  geometry,  this  leads  to  the  visually  reasonable  result 
that  the  two  angles  in  figure  a/3  are  related  as  follows: 

6i  +  0o  =  constant 


(Note  that  9{  and  0o ,  which  are  measured  from  the  image  and  the 
object,  not  from  the  eye  like  the  angles  we  referred  to  in  discussing 
angular  magnification  on  page  762.)  For  example,  move  O  farther 
from  the  mirror.  The  top  angle  in  figure  a/2  is  increased,  so  the 
bottom  angle  must  increase  by  the  same  amount,  causing  the  image 
point,  I,  to  move  closer  to  the  mirror.  In  terms  of  the  angles  shown  in 
figure  a/3,  the  more  distant  object  has  resulted  in  a  smaller  angle  0o, 
while  the  closer  image  corresponds  to  a  larger  0*;  One  angle  increases 
by  the  same  amount  that  the  other  decreases,  so  their  sum  remains 
constant.  These  changes  are  summarized  in  figure  a/4. 

The  sum  9t  +  0o  is  a  constant.  What  does  this  constant  repre¬ 
sent?  Geometrically,  we  interpret  it  as  double  the  angle  made  by 
the  dashed  radius  line.  Optically,  it  is  a  measure  of  the  strength  of 
the  mirror,  i.e.,  how  strongly  the  mirror  focuses  light,  and  so  we  call 
it  the  focal  angle,  Of, 

0i  +  0o  =  Of. 

Suppose,  for  example,  that  we  wish  to  use  a  quick  and  dirty  optical 
test  to  determine  how  strong  a  particular  mirror  is.  We  can  lay 
it  on  the  floor  as  shown  in  figure  c,  and  use  it  to  make  an  image 
of  a  lamp  mounted  on  the  ceiling  overhead,  which  we  assume  is 
very  far  away  compared  to  the  radius  of  curvature  of  the  mirror, 
so  that  the  mirror  intercepts  only  a  very  narrow  cone  of  rays  from 
the  lamp.  This  cone  is  so  narrow  that  its  rays  are  nearly  parallel, 
and  0o  is  nearly  zero.  The  real  image  can  be  observed  on  a  piece  of 
paper.  By  moving  the  paper  nearer  and  farther,  we  can  bring  the 
image  into  focus,  at  which  point  we  know  the  paper  is  located  at 
the  image  point.  Since  0o  ps  0,  we  have  0i  «  Of,  and  we  can  then 
determine  this  mirror’s  focal  angle  either  by  measuring  0t  directly 
with  a  protractor,  or  indirectly  via  trigonometry.  A  strong  mirror 
will  bring  the  rays  together  to  form  an  image  close  to  the  mirror, 
and  these  rays  will  form  a  blunt-angled  cone  with  a  large  0{  and  Of. 

An  alternative  optical  test  example  4 

>  Figure  c  shows  an  alternative  optical  test.  Rather  than  placing 
the  object  at  infinity  as  in  figure  b,  we  adjust  it  so  that  the  image 
is  right  on  top  of  the  object.  Points  O  and  I  coincide,  and  the  rays 
are  reflected  right  back  on  top  of  themselves.  If  we  measure  the 
angle  0  shown  in  figure  c,  how  can  we  find  the  focal  angle? 


c  /  Example  4,  an  alternative 
test  for  finding  the  focal  angle. 
The  mirror  is  the  same  as  in 
figure  b. 
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d/The  object  and  image  dis¬ 
tances 


e  /  Mirror  1  is  weaker  than 
mirror  2.  It  has  a  shallower 
curvature,  a  longer  focal  length, 
and  a  smaller  focal  angle.  It 
reflects  rays  at  angles  not  much 
different  than  those  that  would  be 
produced  with  a  flat  mirror. 


>  The  object  and  image  angles  are  the  same;  the  angle  labeled 
6  in  the  figure  equals  both  of  them.  We  therefore  have  0,  +  0O  = 
0  =  Qf.  Comparing  figures  b  and  c,  it  is  indeed  plausible  that  the 
angles  are  related  by  a  factor  of  two. 

At  this  point,  we  could  consider  our  work  to  be  done.  Typically, 
we  know  the  strength  of  the  mirror,  and  we  want  to  find  the  image 
location  for  a  given  object  location.  Given  the  mirror’s  focal  angle 
and  the  object  location,  we  can  determine  0O  by  trigonometry,  sub¬ 
tract  to  find  6i  =  6 f  —  0o,  and  then  do  more  trig  to  find  the  image 
location. 

There  is,  however,  a  shortcut  that  can  save  us  from  doing  so 
much  work.  Figure  a/3  shows  two  right  triangles  whose  legs  of 
length  1  coincide  and  whose  acute  angles  are  0o  and  0,  .  These  can 
be  related  by  trigonometry  to  the  object  and  image  distances  shown 
in  figure  d: 

tan  0O  =  1  /d0  tan  0,  =  1/di 

Ever  since  chapter  2,  we’ve  been  assuming  small  angles.  For  small 
angles,  we  can  use  the  small-angle  approximation  tanx  ~  x  (for  x 
in  radians),  giving  simply 

0O  =  l /d0  Oi  =  1/di. 

We  likewise  define  a  distance  called  the  focal  length,  /  according  to 
Of  =  1/  f.  In  figure  b,  /  is  the  distance  from  the  mirror  to  the  place 
where  the  rays  cross.  We  can  now  reexpress  the  equation  relating 
the  object  and  image  positions  as 

1  _  t  1 

7  di  +  d0 ' 

Figure  e  summarizes  the  interpretation  of  the  focal  length  and  focal 
angle.1 

Which  form  is  better,  Of  =  6i  +  0o  or  1//  =  1/di  +  1  /d0l  The 
angular  form  has  in  its  favor  its  simplicity  and  its  straightforward 
visual  interpretation,  but  there  are  two  reasons  why  we  might  prefer 
the  second  version.  First,  the  numerical  values  of  the  angles  depend 
on  what  we  mean  by  “one  unit”  for  the  distance  shown  as  1  in 

1There  is  a  standard  piece  of  terminology  which  is  that  the  “focal  point”  is 
the  point  lying  on  the  optical  axis  at  a  distance  from  the  mirror  equal  to  the  focal 
length.  This  term  isn’t  particularly  helpful,  because  it  names  a  location  where 
nothing  normally  happens.  In  particular,  it  is  not  normally  the  place  where  the 
rays  come  to  a  focus!  —  that  would  be  the  image  point.  In  other  words,  we 
don’t  normally  have  di  =  /,  unless  perhaps  d0  =  oo.  A  recent  online  discussion 
among  some  physics  teachers  (https://carnot.physics.buffalo.edu/archives,  Feb. 
2006)  showed  that  many  disliked  the  terminology,  felt  it  was  misleading,  or  didn’t 
know  it  and  would  have  misinterpreted  it  if  they  had  come  across  it.  That  is,  it 
appears  to  be  what  grammarians  call  a  “skunked  term”  —  a  word  that  bothers 
half  the  population  when  it’s  used  incorrectly,  and  the  other  half  when  it’s  used 
correctly. 
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figure  a/1.  Second,  it  is  usually  easier  to  measure  distances  rather 
than  angles,  so  the  distance  form  is  more  convenient  for  number 
crunching.  Neither  form  is  superior  overall,  and  we  will  often  need 
to  use  both  to  solve  any  given  problem.2 

A  searchlight  example  5 

Suppose  we  need  to  create  a  parallel  beam  of  light,  as  in  a  search¬ 
light.  Where  should  we  place  the  lightbulb?  A  parallel  beam  has 
zero  angle  between  its  rays,  so  0,  =  0.  To  place  the  lightbulb 
correctly,  however,  we  need  to  know  a  distance,  not  an  angle: 
the  distance  d0  between  the  bulb  and  the  mirror.  The  problem 
involves  a  mixture  of  distances  and  angles,  so  we  need  to  get 
everything  in  terms  of  one  or  the  other  in  order  to  solve  it.  Since 
the  goal  is  to  find  a  distance,  let’s  figure  out  the  image  distance 
corresponding  to  the  given  angle  0,  =  0.  These  are  related  by 
dj  =  1  /0/,  so  we  have  dj  =  oo.  (Yes,  dividing  by  zero  gives  infin¬ 
ity.  Don’t  be  afraid  of  infinity.  Infinity  is  a  useful  problem-solving 
device.)  Solving  the  distance  equation  for  d0,  we  have 

do  =  (^/f-^/di^l 

=  (i/f-or1 

=  t 

The  bulb  has  to  be  placed  at  a  distance  from  the  mirror  equal  to 
its  focal  point. 

Diopters  example  6 

An  equation  like  d,  =  1  /0(  really  doesn’t  make  sense  in  terms  of 
units.  Angles  are  unitless,  since  radians  aren’t  really  units,  so 
the  right-hand  side  is  unitless.  We  can’t  have  a  left-hand  side 
with  units  of  distance  if  the  right-hand  side  of  the  same  equation 
is  unitless.  This  is  an  artifact  of  my  cavalier  statement  that  the 
conical  bundles  of  rays  spread  out  to  a  distance  of  1  from  the  axis 
where  they  strike  the  mirror,  without  specifying  the  units  used  to 
measure  this  1.  In  real  life,  optometrists  define  the  thing  we’re 
calling  0/  =  1  /dj  as  the  “dioptric  strength”  of  a  lens  or  mirror, 
and  measure  it  in  units  of  inverse  meters  (mr1),  also  known  as 
diopters  (1  D=1  m^1). 

Magnification 

We  have  already  discussed  in  the  previous  chapter  how  to  find 
the  magnification  of  a  virtual  image  made  by  a  curved  mirror.  The 
result  is  the  same  for  a  real  image,  and  we  omit  the  proof,  which 
is  very  similar.  In  our  new  notation,  the  result  is  M  =  di/dQ.  A 
numerical  example  is  given  in  subsection  12.3.2. 


2I  would  like  to  thank  Fouad  Ajami  for  pointing  out  the  pedagogical  advan¬ 
tages  of  using  both  equations  side  by  side. 
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12.3.2  Other  cases  with  curved  mirrors 


The  equation  d{  =  (1  //  —  1  /d0)-1  can  easily  produce  a  negative 
result,  but  we  have  been  thinking  of  dt  as  a  distance,  and  distances 
can’t  be  negative.  A  similar  problem  occurs  with  6i  =  Of  —  0O  for 
0o  >  Of.  What’s  going  on  here? 

The  interpretation  of  the  angular  equation  is  straightforward. 
As  we  bring  the  object  closer  and  closer  to  the  image,  0o  gets  bigger 
and  bigger,  and  eventually  we  reach  a  point  where  0o  =  Of  and 
Oi  =  0.  This  large  object  angle  represents  a  bundle  of  rays  forming 
a  cone  that  is  very  broad,  so  broad  that  the  mirror  can  no  longer 
bend  them  back  so  that  they  reconverge  on  the  axis.  The  image 
angle  Oi  =  0  represents  an  outgoing  bundle  of  rays  that  are  parallel. 
The  outgoing  rays  never  cross,  so  this  is  not  a  real  image,  unless  we 
want  to  be  charitable  and  say  that  the  rays  cross  at  infinity.  If  we 
go  on  bringing  the  object  even  closer,  we  get  a  virtual  image. 


f  /  A  graph  of  the  image  distance 
dj  as  a  function  of  the  object  dis¬ 
tance  d0. 


To  analyze  the  distance  equation,  let’s  look  at  a  graph  of  dj  as 
a  function  of  d0 ■  The  branch  on  the  upper  right  corresponds  to  the 
case  of  a  real  image.  Strictly  speaking,  this  is  the  only  part  of  the 
graph  that  we’ve  proven  corresponds  to  reality,  since  we  never  did 
any  geometry  for  other  cases,  such  as  virtual  images.  As  discussed  in 
the  previous  section,  making  dQ  bigger  causes  di  to  become  smaller, 
and  vice-versa. 
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Letting  dQ  be  less  than  /  is  equivalent  to  0o  >  Of  :  a  virtual  image 
is  produced  on  the  far  side  of  the  mirror.  This  is  the  first  example 
of  Wigner’s  “unreasonable  effectiveness  of  mathematics”  that  we 
have  encountered  in  optics.  Even  though  our  proof  depended  on 
the  assumption  that  the  image  was  real,  the  equation  we  derived 
turns  out  to  be  applicable  to  virtual  images,  provided  that  we  either 
interpret  the  positive  and  negative  signs  in  a  certain  way,  or  else 
modify  the  equation  to  have  different  positive  and  negative  signs. 

self-check  D 

Interpret  the  three  places  where,  in  physically  realistic  parts  of  the  graph, 
the  graph  approaches  one  of  the  dashed  lines.  [This  will  come  more 
naturally  if  you  have  learned  the  concept  of  limits  in  a  math  class.]  > 
Answer,  p.  968 

A  flat  mirror  example  7 

We  can  even  apply  the  equation  to  a  flat  mirror.  As  a  sphere  gets 
bigger  and  bigger,  its  surface  is  more  and  more  gently  curved. 
The  planet  Earth  is  so  large,  for  example,  that  we  cannot  even 
perceive  the  curvature  of  its  surface.  To  represent  a  flat  mirror,  we 
let  the  mirror’s  radius  of  curvature,  and  its  focal  length,  become 
infinite.  Dividing  by  infinity  gives  zero,  so  we  have 


1  /d0  =  -1 M, 


or 


d0  =  -dj. 

If  we  interpret  the  minus  sign  as  indicating  a  virtual  image  on  the 
far  side  of  the  mirror  from  the  object,  this  makes  sense. 

It  turns  out  that  for  any  of  the  six  possible  combinations  of 
real  or  virtual  images  formed  by  converging  or  diverging  lenses  or 
mirrors,  we  can  apply  equations  of  the  form 

Of  =  6i  +  0o 


and 


1  _  1  1 
7  “  di  +  J0' 

with  only  a  modification  of  plus  or  minus  signs.  There  are  two  pos¬ 
sible  approaches  here.  The  approach  we  have  been  using  so  far  is 
the  more  popular  approach  in  American  textbooks:  leave  the  equa¬ 
tion  the  same,  but  attach  interpretations  to  the  resulting  negative 
or  positive  values  of  the  variables.  The  trouble  with  this  approach 
is  that  one  is  then  forced  to  memorize  tables  of  sign  conventions, 
e.g.,  that  the  value  of  di  should  be  negative  when  the  image  is  a 
virtual  image  formed  by  a  converging  mirror.  Positive  and  negative 
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signs  also  have  to  be  memorized  for  focal  lengths.  Ugh!  It’s  highly 
unlikely  that  any  student  has  ever  retained  these  lengthy  tables  in 
his  or  her  mind  for  more  than  five  minutes  after  handing  in  the  final 
exam  in  a  physics  course.  Of  course  one  can  always  look  such  things 
up  when  they  are  needed,  but  the  effect  is  to  turn  the  whole  thing 
into  an  exercise  in  blindly  plugging  numbers  into  formulas. 

As  you  have  gathered  by  now,  there  is  another  method  which  I 
think  is  better,  and  which  I’ll  use  throughout  the  rest  of  this  book. 
In  this  method,  all  distances  and  angles  are  positive  by  definition, 
and  we  put  in  positive  and  negative  signs  in  the  equations  depending 
on  the  situation.  (I  thought  I  was  the  first  to  invent  this  method,  but 
I've  been  told  that  this  is  known  as  the  European  sign  convention, 
and  that  it’s  fairly  common  in  Europe.)  Rather  than  memorizing 
these  signs,  we  start  with  the  generic  equations 

Of  =  ±0i  ±  0o 
1  _  1  1 
/  ±di±  dfi 

and  then  determine  the  signs  by  a  two-step  method  that  depends  on 
ray  diagrams.  There  are  really  only  two  signs  to  determine,  not  four; 
the  signs  in  the  two  equations  match  up  in  the  way  you’d  expect. 
The  method  is  as  follows: 

1.  Use  ray  diagrams  to  decide  whether  0o  and  Qi  vary  in  the  same 
way  or  in  opposite  ways.  (In  other  words,  decide  whether  making  0o 
greater  results  in  a  greater  value  of  6t  or  a  smaller  one.)  Based  on 
this,  decide  whether  the  two  signs  in  the  angle  equation  are  the  same 
or  opposite.  If  the  signs  are  opposite,  go  on  to  step  2  to  determine 
which  is  positive  and  which  is  negative. 

2.  If  the  signs  are  opposite,  we  need  to  decide  which  is  the 
positive  one  and  which  is  the  negative.  Since  the  focal  angle  is  never 
negative,  the  smaller  angle  must  be  the  one  with  a  minus  sign. 

In  step  1,  many  students  have  trouble  drawing  the  ray  diagram 
correctly.  For  simplicity,  you  should  always  do  your  diagram  for  a 
point  on  the  object  that  is  on  the  axis  of  the  mirror,  and  let  one 
of  your  rays  be  the  one  that  is  emitted  along  the  axis  and  reflected 
straight  back  on  itself,  as  in  the  figures  in  subsection  12.3.1.  As 
shown  in  figure  a/4  in  subsection  12.3.1,  there  are  four  angles  in¬ 
volved:  two  at  the  mirror,  one  at  the  object  ( 0o ),  and  one  at  the 
image  (#,;).  Make  sure  to  draw  in  the  normal  to  the  mirror  so  that 
you  can  see  the  two  angles  at  the  mirror.  These  two  angles  are 
equal,  so  as  you  change  the  object  position,  they  fan  out  or  fan  in, 
like  opening  or  closing  a  book.  Once  you’ve  drawn  this  effect,  you 
should  easily  be  able  to  tell  whether  0o  and  6i  change  in  the  same 
way  or  in  opposite  ways. 

Although  focal  lengths  are  always  positive  in  the  method  used 
in  this  book,  you  should  be  aware  that  diverging  mirrors  and  lenses 
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are  assigned  negative  focal  lengths  in  the  other  method,  so  if  you 
see  a  lens  labeled  /  =  —30  cm,  you’ll  know  what  it  means. 

An  anti-shoplifting  mirror  example  8 

>  Convenience  stores  often  install  a  diverging  mirror  so  that  the 
clerk  has  a  view  of  the  whole  store  and  can  catch  shoplifters.  Use 
a  ray  diagram  to  show  that  the  image  is  reduced,  bringing  more 
into  the  clerk’s  field  of  view.  If  the  focal  length  of  the  mirror  is  3.0 
m,  and  the  mirror  is  7.0  m  from  the  farthest  wall,  how  deep  is  the 
image  of  the  store? 


o  As  shown  in  ray  diagram  g/1 ,  dj  is  less  than  d0.  The  magnifica¬ 
tion,  M  =  dj/do,  will  be  less  than  one,  i.e.,  the  image  is  actually 
reduced  rather  than  magnified. 

Apply  the  method  outlined  above  for  determining  the  plus  and 
minus  signs.  Step  1:  The  object  is  the  point  on  the  opposite 
wall.  As  an  experiment,  g/2,  move  the  object  closer.  I  did  these 
drawings  using  illustration  software,  but  if  you  were  doing  them 
by  hand,  you’d  want  to  make  the  scale  much  larger  for  greater 
accuracy.  Also,  although  I  split  figure  g  into  two  separate  drawings 
in  order  to  make  them  easier  to  understand,  you’re  less  likely  to 
make  a  mistake  if  you  do  them  on  top  of  each  other. 

The  two  angles  at  the  mirror  fan  out  from  the  normal.  Increasing 
0O  has  clearly  made  0,  larger  as  well.  (All  four  angles  got  big¬ 
ger.)  There  must  be  a  cancellation  of  the  effects  of  changing  the 
two  terms  on  the  right  in  the  same  way,  and  the  only  way  to  get 
such  a  cancellation  is  if  the  two  terms  in  the  angle  equation  have 
opposite  signs: 

df  =  +0/  —  0O 


Step  2:  Now  which  is  the  positive  term  and  which  is  negative? 
Since  the  image  angle  is  bigger  than  the  object  angle,  the  angle 
equation  must  be 

8f  =  0/  -  0O, 


in  order  to  give  a  positive  result  for  the  focal  angle.  The  signs  of 
the  distance  equation  behave  the  same  way: 

1  _  1  1 

1  ~  d,  d0  ‘ 

Solving  for  dj,  we  find 


=  2.1  m. 


The  image  of  the  store  is  reduced  by  a  factor  of  2. 1/7.0  =  0.3, 
i.e.,  it  is  smaller  by  70%. 


g  /  Example  8. 
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h  /  A  diverging  mirror  in  the  shape 
of  a  sphere.  The  image  is  re¬ 
duced  (M  <  1).  This  is  similar 
to  example  8,  but  here  the  image 
is  distorted  because  the  mirror’s 
curve  is  not  shallow. 


A  shortcut  for  real  images  example  9 

In  the  case  of  a  real  image,  there  is  a  shortcut  for  step  1,  the 
determination  of  the  signs.  In  a  real  image,  the  rays  cross  at 
both  the  object  and  the  image.  We  can  therefore  time-reverse  the 
ray  diagram,  so  that  all  the  rays  are  coming  from  the  image  and 
reconverging  at  the  object.  Object  and  image  swap  roles.  Due 
to  this  time-reversal  symmetry,  the  object  and  image  cannot  be 
treated  differently  in  any  of  the  equations,  and  they  must  therefore 
have  the  same  signs.  They  are  both  positive,  since  they  must  add 
up  to  a  positive  result. 

12.3.3  x  Aberrations 

An  imperfection  or  distortion  in  an  image  is  called  an  aberra¬ 
tion.  An  aberration  can  be  produced  by  a  flaw  in  a  lens  or  mirror, 
but  even  with  a  perfect  optical  surface  some  degree  of  aberration  is 
unavoidable.  To  see  why,  consider  the  mathematical  approximation 
we’ve  been  making,  which  is  that  the  depth  of  the  mirror’s  curve 
is  small  compared  to  dQ  and  dj.  Since  only  a  flat  mirror  can  sat¬ 
isfy  this  shallow-mirror  condition  perfectly,  any  curved  mirror  will 
deviate  somewhat  from  the  mathematical  behavior  we  derived  by 
assuming  that  condition.  There  are  two  main  types  of  aberration  in 
curved  mirrors,  and  these  also  occur  with  lenses. 

(1)  An  object  on  the  axis  of  the  lens  or  mirror  may  be  imaged 
correctly,  but  off-axis  objects  may  be  out  of  focus  or  distorted.  In 
a  camera,  this  type  of  aberration  would  show  up  as  a  fuzziness  or 
warping  near  the  sides  of  the  picture  when  the  center  was  perfectly 
focused.  An  example  of  this  is  shown  in  figure  i,  and  in  that  partic¬ 
ular  example,  the  aberration  is  not  a  sign  that  the  equipment  was 
of  low  quality  or  wasn’t  right  for  the  job  but  rather  an  inevitable 
result  of  trying  to  flatten  a  panoramic  view;  in  the  limit  of  a  360- 
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degree  panorama,  the  problem  would  be  similar  to  the  problem  of 
representing  the  Earth’s  surface  on  a  flat  map,  which  can’t  be  ac¬ 
complished  without  distortion. 


i  /  This  photo  was  taken  using  a 
“fish-eye  lens,”  which  gives  an  ex¬ 
tremely  large  field  of  view. 


(2)  The  image  may  be  sharp  when  the  object  is  at  certain  dis¬ 
tances  and  blurry  when  it  is  at  other  distances.  The  blurriness 
occurs  because  the  rays  do  not  all  cross  at  exactly  the  same  point. 
If  we  know  in  advance  the  distance  of  the  objects  with  which  the 
mirror  or  lens  will  be  used,  then  we  can  optimize  the  shape  of  the 
optical  surface  to  make  in-focus  images  in  that  situation.  For  in¬ 
stance,  a  spherical  mirror  will  produce  a  perfect  image  of  an  object 
that  is  at  the  center  of  the  sphere,  because  each  ray  is  reflected  di¬ 
rectly  onto  the  radius  along  which  it  was  emitted.  For  objects  at 
greater  distances,  however,  the  focus  will  be  somewhat  blurry.  In 
astronomy  the  objects  being  used  are  always  at  infinity,  so  a  spher¬ 
ical  mirror  is  a  poor  choice  for  a  telescope.  A  different  shape  (a 
parabola)  is  better  specialized  for  astronomy. 


One  way  of  decreasing  aberration  is  to  use  a  small-diameter  mir¬ 
ror  or  lens,  or  block  most  of  the  light  with  an  opaque  screen  with  a 
hole  in  it,  so  that  only  light  that  comes  in  close  to  the  axis  can  get 
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j  /  Spherical  mirrors  are  the 
cheapest  to  make,  but  parabolic 
mirrors  are  better  for  making 
images  of  objects  at  infinity. 
A  sphere  has  equal  curvature 
everywhere,  but  a  parabola  has 
tighter  curvature  at  its  center  and 
gentler  curvature  at  the  sides. 


through.  Either  way,  we  are  using  a  smaller  portion  of  the  lens  or 
mirror  whose  curvature  will  be  more  shallow,  thereby  making  the 
shallow- mirror  (or  thin- lens)  approximation  more  accurate.  Your 
eye  does  this  by  narrowing  down  the  pupil  to  a  smaller  hole.  In 
a  camera,  there  is  either  an  automatic  or  manual  adjustment,  and 
narrowing  the  opening  is  called  “stopping  down.”  The  disadvantage 
of  stopping  down  is  that  light  is  wasted,  so  the  image  will  be  dimmer 
or  a  longer  exposure  must  be  used. 


k  /  Even  though  the  spherical  mir¬ 
ror  (solid  line)  is  not  well  adapted 
for  viewing  an  object  at  infinity, 
we  can  improve  its  performance 
greatly  by  stopping  it  down.  Now 
the  only  part  of  the  mirror  be¬ 
ing  used  is  the  central  portion, 
where  its  shape  is  virtually  in¬ 
distinguishable  from  a  parabola 
(dashed  line). 


What  I  would  suggest  you  take  away  from  this  discussion  for  the 
sake  of  your  general  scientific  education  is  simply  an  understanding 
of  what  an  aberration  is,  why  it  occurs,  and  how  it  can  be  reduced, 
not  detailed  facts  about  specific  types  of  aberrations. 
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I /The  Hubble  Space  Telescope 
was  placed  into  orbit  with  faulty 
optics  in  1990.  Its  main  mir¬ 
ror  was  supposed  to  have  been 
nearly  parabolic,  since  it  is  an  as¬ 
tronomical  telescope,  meant  for 
producing  images  of  objects  at  in¬ 
finity.  However,  contractor  Per¬ 
kin  Elmer  had  delivered  a  faulty 
mirror,  which  produced  aberra¬ 
tions.  The  large  photo  shows  as¬ 
tronauts  putting  correcting  mirrors 
in  place  in  1993.  The  two  small 
photos  show  images  produced  by 
the  telescope  before  and  after  the 
fix. 


12.4  Refraction 

Economists  normally  consider  free  markets  to  be  the  natural  way  of 
judging  the  monetary  value  of  something,  but  social  scientists  also 
use  questionnaires  to  gauge  the  relative  value  of  privileges,  disad¬ 
vantages,  or  possessions  that  cannot  be  bought  or  sold.  They  ask 
people  to  imagine  that  they  could  trade  one  thing  for  another  and 
ask  which  they  would  choose.  One  interesting  result  is  that  the  av¬ 
erage  light-skinned  person  in  the  U.S.  would  rather  lose  an  arm  than 
suffer  the  racist  treatment  routinely  endured  by  African-Americans. 
Even  more  impressive  is  the  value  of  sight.  Many  prospective  par¬ 
ents  can  imagine  without  too  much  fear  having  a  deaf  child,  but 
would  have  a  far  more  difficult  time  coping  with  raising  a  blind  one. 

So  great  is  the  value  attached  to  sight  that  some  have  imbued 
it  with  mystical  aspects.  Joan  of  Arc  saw  visions,  and  my  college 
has  a  “vision  statement.”  Christian  fundamentalists  who  perceive  a 
conflict  between  evolution  and  their  religion  have  claimed  that  the 
eye  is  such  a  perfect  device  that  it  could  never  have  arisen  through 
a  process  as  helter-skelter  as  evolution,  or  that  it  could  not  have 
evolved  because  half  of  an  eye  would  be  useless.  In  fact,  the  struc¬ 
ture  of  an  eye  is  fundamentally  dictated  by  physics,  and  it  has  arisen 
separately  by  evolution  somewhere  between  eight  and  40  times,  de¬ 
pending  on  which  biologist  you  ask.  We  humans  have  a  version  of 
the  eye  that  can  be  traced  back  to  the  evolution  of  a  light-sensitive 
“eye  spot”  on  the  head  of  an  ancient  invertebrate.  A  sunken  pit 
then  developed  so  that  the  eye  would  only  receive  light  from  one 
direction,  allowing  the  organism  to  tell  where  the  light  was  coming 
from.  (Modern  flatworms  have  this  type  of  eye.)  The  top  of  the 
pit  then  became  partially  covered,  leaving  a  hole,  for  even  greater 
directionality  (as  in  the  nautilus).  At  some  point  the  cavity  became 
filled  with  jelly,  and  this  jelly  finally  became  a  lens,  resulting  in  the 
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a  /  A  human  eye. 


b  /  The  anatomy  of  the  eye. 


c/A  simplified  optical  dia¬ 
gram  of  the  eye.  Light  rays  are 
bent  when  they  cross  from  the 
air  into  the  eye.  (A  little  of  the 
incident  rays’  energy  goes  into 
the  reflected  rays  rather  than  the 
ones  transmitted  into  the  eye.) 


general  type  of  eye  that  we  share  with  the  bony  fishes  and  other 
vertebrates.  Far  from  being  a  perfect  device,  the  vertebrate  eye  is 
marred  by  a  serious  design  flaw  due  to  the  lack  of  planning  or  intelli¬ 
gent  design  in  evolution:  the  nerve  cells  of  the  retina  and  the  blood 
vessels  that  serve  them  are  all  in  front  of  the  light-sensitive  cells, 
blocking  part  of  the  light.  Squids  and  other  molluscs,  whose  eyes 
evolved  on  a  separate  branch  of  the  evolutionary  tree,  have  a  more 
sensible  arrangement,  with  the  light-sensitive  cells  out  in  front. 

12.4.1  Refraction 

Refraction 

The  fundamental  physical  phenomenon  at  work  in  the  eye  is 
that  when  light  crosses  a  boundary  between  two  media  (such  as  air 
and  the  eye’s  jelly),  part  of  its  energy  is  reflected,  but  part  passes 
into  the  new  medium.  In  the  ray  model  of  light,  we  describe  the 
original  ray  as  splitting  into  a  reflected  ray  and  a  transmitted  one 
(the  one  that  gets  through  the  boundary).  Of  course  the  reflected 
ray  goes  in  a  direction  that  is  different  from  that  of  the  original  one, 
according  to  the  rules  of  reflection  we  have  already  studied.  More 
surprisingly  —  and  this  is  the  crucial  point  for  making  your  eye 
focus  light  —  the  transmitted  ray  is  bent  somewhat  as  well.  This 
bending  phenomenon  is  called  refraction.  The  origin  of  the  word 
is  the  same  as  that  of  the  word  “fracture,”  i.e. ,  the  ray  is  bent  or 
“broken.”  (Keep  in  mind,  however,  that  light  rays  are  not  physical 
objects  that  can  really  be  “broken.”)  Refraction  occurs  with  all 
waves,  not  just  light  waves. 

The  actual  anatomy  of  the  eye,  b,  is  quite  complex,  but  in  essence 
it  is  very  much  like  every  other  optical  device  based  on  refraction. 
The  rays  are  bent  when  they  pass  through  the  front  surface  of  the 
eye,  c.  Rays  that  enter  farther  from  the  central  axis  are  bent  more, 
with  the  result  that  an  image  is  formed  on  the  retina.  There  is 
only  one  slightly  novel  aspect  of  the  situation.  In  most  human-built 
optical  devices,  such  as  a  movie  projector,  the  light  is  bent  as  it 
passes  into  a  lens,  bent  again  as  it  reemerges,  and  then  reaches  a 
focus  beyond  the  lens.  In  the  eye,  however,  the  “screen”  is  inside 
the  eye,  so  the  rays  are  only  refracted  once,  on  entering  the  jelly, 
and  never  emerge  again. 

A  common  misconception  is  that  the  “lens”  of  the  eye  is  what 
does  the  focusing.  All  the  transparent  parts  of  the  eye  are  made 
of  fairly  similar  stuff,  so  the  dramatic  change  in  medium  is  when  a 
ray  crosses  from  the  air  into  the  eye  (at  the  outside  surface  of  the 
cornea).  This  is  where  nearly  all  the  refraction  takes  place.  The  lens 
medium  differs  only  slightly  in  its  optical  properties  from  the  rest 
of  the  eye,  so  very  little  refraction  occurs  as  light  enters  and  exits 
the  lens.  The  lens,  whose  shape  is  adjusted  by  muscles  attached  to 
it,  is  only  meant  for  fine-tuning  the  focus  to  form  images  of  near  or 
far  objects. 
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Refractive  properties  of  media 

What  are  the  rules  governing  refraction?  The  first  thing  to  ob¬ 
serve  is  that  just  as  with  reflection,  the  new,  bent  part  of  the  ray  lies 
in  the  same  plane  as  the  normal  (perpendicular)  and  the  incident 
ray,  d. 

If  you  try  shooting  a  beam  of  light  at  the  boundary  between 
two  substances,  say  water  and  air,  you’ll  find  that  regardless  of  the 
angle  at  which  you  send  in  the  beam,  the  part  of  the  beam  in  the 
water  is  always  closer  to  the  normal  line,  e.  It  doesn’t  matter  if  the 
ray  is  entering  the  water  or  leaving,  so  refraction  is  symmetric  with 
respect  to  time-reversal,  f. 

If,  instead  of  water  and  air,  you  try  another  combination  of  sub¬ 
stances,  say  plastic  and  gasoline,  again  you’ll  find  that  the  ray’s 
angle  with  respect  to  the  normal  is  consistently  smaller  in  one  and 
larger  in  the  other.  Also,  we  find  that  if  substance  A  has  rays  closer 
to  normal  than  in  B,  and  B  has  rays  closer  to  normal  than  in  C,  then 
A  has  rays  closer  to  normal  than  C.  This  means  that  we  can  rank- 
order  all  materials  according  to  their  refractive  properties.  Isaac 
Newton  did  so,  including  in  his  list  many  amusing  substances,  such 
as  “Danzig  vitriol”  and  “a  pseudo-topazius,  being  a  natural,  pellu¬ 
cid,  brittle,  hairy  stone,  of  a  yellow  color.”  Several  general  rules  can 
be  inferred  from  such  a  list: 


•  Vacuum  lies  at  one  end  of  the  list.  In  refraction  across  the 
interface  between  vacuum  and  any  other  medium,  the  other 
medium  has  rays  closer  to  the  normal. 

•  Among  gases,  the  ray  gets  closer  to  the  normal  if  you  increase 
the  density  of  the  gas  by  pressurizing  it  more. 

•  The  refractive  properties  of  liquid  mixtures  and  solutions  vary 
in  a  smooth  and  systematic  manner  as  the  proportions  of  the 
mixture  are  changed. 

•  Denser  substances  usually,  but  not  always,  have  rays  closer  to 
the  normal. 


The  second  and  third  rules  provide  us  with  a  method  for  measur¬ 
ing  the  density  of  an  unknown  sample  of  gas,  or  the  concentration 
of  a  solution.  The  latter  technique  is  very  commonly  used,  and  the 
CRC  Handbook  of  Physics  and  Chemistry,  for  instance,  contains 
extensive  tables  of  the  refractive  properties  of  sugar  solutions,  cat 
urine,  and  so  on. 

Snell’s  law 

The  numerical  rule  governing  refraction  was  discovered  by  Snell, 
who  must  have  collected  experimental  data  something  like  what  is 


d  /  The  incident,  reflected, 
and  transmitted  (refracted)  rays 
all  lie  in  a  plane  that  includes  the 
normal  (dashed  line). 


e/The  angles  0i  and  02  are 
related  to  each  other,  and  also 
depend  on  the  properties  of  the 
two  media.  Because  refraction 
is  time-reversal  symmetric,  there 
is  no  need  to  label  the  rays  with 
arrowheads. 


f  /  Refraction  has  time-reversal 
symmetry.  Regardless  of  whether 
the  light  is  going  into  or  out  of  the 
water,  the  relationship  between 
the  two  angles  is  the  same,  and 
the  ray  is  closer  to  the  normal 
while  in  the  water. 
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g/The  relationship  between 
the  angles  in  refraction. 


shown  on  this  graph  and  then  attempted  by  trial  and  error  to  find 
the  right  equation.  The  equation  he  came  up  with  was 

sin  0\ 

- —  =  constant. 

sin  U2 

The  value  of  the  constant  would  depend  on  the  combination  of  media 
used.  For  instance,  any  one  of  the  data  points  in  the  graph  would 
have  sufficed  to  show  that  the  constant  was  1.3  for  an  air-water 
interface  (taking  air  to  be  substance  1  and  water  to  be  substance 
2). 

Snell  further  found  that  if  media  A  and  B  gave  a  constant  Kab 
and  media  B  and  C  gave  a  constant  Kbc >  then  refraction  at  an  inter¬ 
face  between  A  and  C  would  be  described  by  a  constant  equal  to  the 
product,  Kac  =  KabKbc •  This  is  exactly  what  one  would  expect 
if  the  constant  depended  on  the  ratio  of  some  number  characteriz¬ 
ing  one  medium  to  the  number  characteristic  of  the  second  medium. 
This  number  is  called  the  index  of  refraction  of  the  medium,  written 
as  n  in  equations.  Since  measuring  the  angles  would  only  allow  him 
to  determine  the  ratio  of  the  indices  of  refraction  of  two  media,  Snell 
had  to  pick  some  medium  and  define  it  as  having  n  =  1.  He  chose 
to  define  vacuum  as  having  n  =  1.  (The  index  of  refraction  of  air 
at  normal  atmospheric  pressure  is  1.0003,  so  for  most  purposes  it  is 
a  good  approximation  to  assume  that  air  has  n  =  1.)  He  also  had 
to  decide  which  way  to  define  the  ratio,  and  he  chose  to  define  it  so 
that  media  with  their  rays  closer  to  the  normal  would  have  larger  in¬ 
dices  of  refraction.  This  had  the  advantage  that  denser  media  would 
typically  have  higher  indices  of  refraction,  and  for  this  reason  the 
index  of  refraction  is  also  referred  to  as  the  optical  density.  Written 
in  terms  of  indices  of  refraction,  Snell’s  equation  becomes 

sin  6\  n2 
sin  02  n\  ’ 

but  rewriting  it  in  the  form 


m  sin  6\  =  7i2  sin  02 

[relationship  between  angles  of  rays  at  the  interface  be¬ 
tween  media  with  indices  of  refraction  ni  and  n2]  angles 
are  defined  with  respect  to  the  normal] 

makes  us  less  likely  to  get  the  l’s  and  2’s  mixed  up,  so  this  the  way 
most  people  remember  Snell’s  law.  A  few  indices  of  refraction  are 
given  in  the  back  of  the  book. 

self-check  E 

(1)  What  would  the  graph  look  like  for  two  substances  with  the  same 
index  of  refraction? 

(2)  Based  on  the  graph,  when  does  refraction  at  an  air-water  interface 

change  the  direction  of  a  ray  most  strongly?  >  Answer,  p.  968 
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Finding  an  angle  using  Snell’s  law  example  10 

>  A  submarine  shines  its  searchlight  up  toward  the  surface  of  the 
water.  What  is  the  angle  ot  shown  in  the  figure? 

o  The  tricky  part  is  that  Snell’s  law  refers  to  the  angles  with  re¬ 
spect  to  the  normal.  Forgetting  this  is  a  very  common  mistake. 
The  beam  is  at  an  angle  of  30°  with  respect  to  the  normal  in  the 
water.  Let’s  refer  to  the  air  as  medium  1  and  the  water  as  2. 
Solving  Snell’s  law  for  0! ,  we  find 

0i  =  sin-1  (  —  sin  02 

Vni 

As  mentioned  above,  air  has  an  index  of  refraction  very  close  to 
1,  and  water’s  is  about  1.3,  so  we  find  0!  =  40°.  The  angle  a  is 
therefore  50°. 


h  /  Example  10. 


The  index  of  refraction  is  related  to  the  speed  of  light. 

What  neither  Snell  nor  Newton  knew  was  that  there  is  a  very 
simple  interpretation  of  the  index  of  refraction.  This  may  come  as 
a  relief  to  the  reader  who  is  taken  aback  by  the  complex  reasoning 
involving  proportionalities  that  led  to  its  definition.  Later  experi¬ 
ments  showed  that  the  index  of  refraction  of  a  medium  was  inversely 
proportional  to  the  speed  of  light  in  that  medium.  Since  c  is  defined 
as  the  speed  of  light  in  vacuum,  and  n  =  1  is  defined  as  the  index 
of  refraction  of  vacuum,  we  have 

c 

n  =  -. 
v 

[re  =  medium’s  index  of  refraction,  v  =  speed  of  light 
in  that  medium,  c  =  speed  of  light  in  a  vacuum] 

Many  textbooks  start  with  this  as  the  definition  of  the  index 
of  refraction,  although  that  approach  makes  the  quantity’s  name 
somewhat  of  a  mystery,  and  leaves  students  wondering  why  c/v  was 
used  rather  than  v/c.  It  should  also  be  noted  that  measuring  angles 
of  refraction  is  a  far  more  practical  method  for  determining  re  than 
direct  measurement  of  the  speed  of  light  in  the  substance  of  interest. 

A  mechanical  model  of  Snell’s  law 

Why  should  refraction  be  related  to  the  speed  of  light?  The 
mechanical  model  shown  in  the  figure  may  help  to  make  this  more 
plausible.  Suppose  medium  2  is  thick,  sticky  mud,  which  slows  down 
the  car.  The  car’s  right  wheel  hits  the  mud  first,  causing  the  right 
side  of  the  car  to  slow  down.  This  will  cause  the  car  to  turn  to  the 
right  until  is  moves  far  enough  forward  for  the  left  wheel  to  cross 
into  the  mud.  After  that,  the  two  sides  of  the  car  will  once  again  be 
moving  at  the  same  speed,  and  the  car  will  go  straight. 

Of  course,  light  isn’t  a  car.  Why  should  a  beam  of  light  have 
anything  resembling  a  “left  wheel”  and  “right  wheel?”  After  all, 


i/A  mechanical  model  of  re¬ 
fraction. 
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the  mechanical  model  would  predict  that  a  motorcycle  would  go 
straight,  and  a  motorcycle  seems  like  a  better  approximation  to  a 
ray  of  light  than  a  car.  The  whole  thing  is  just  a  model,  not  a 
description  of  physical  reality. 


j  /  A  derivation  of  Snell’s  law. 


A  derivation  of  Snell’s  law 

However  intuitively  appealing  the  mechanical  model  may  be, 
light  is  a  wave,  and  we  should  be  using  wave  models  to  describe 
refraction.  In  fact  Snell’s  law  can  be  derived  quite  simply  from 
wave  concepts.  Figure  j  shows  the  refraction  of  a  water  wave.  The 
water  in  the  upper  left  part  of  the  tank  is  shallower,  so  the  speed 
of  the  waves  is  slower  there,  and  their  wavelengths  is  shorter.  The 
reflected  part  of  the  wave  is  also  very  faintly  visible. 

In  the  close-up  view  on  the  right,  the  dashed  lines  are  normals 
to  the  interface.  The  two  marked  angles  on  the  right  side  are  both 
equal  to  9\,  and  the  two  on  the  left  to  62. 

Trigonometry  gives 

sin  6\  =  \\/h  and 
sin  02  =  A2 /h. 

Eliminating  h  by  dividing  the  equations,  we  find 

sin  61  Ai 
sin  02  A2  ’ 

The  frequencies  of  the  two  waves  must  be  equal  or  else  they  would 
get  out  of  step,  so  by  v  =  / A  we  know  that  their  wavelengths  are 
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proportional  to  their  velocities.  Combining  A  oc  v  with  v  oc  1/n 
gives  A  oc  1/n,  so  we  find 

sin  6\  77,2 

sin  82  77 1  ’ 

which  is  one  form  of  Snell’s  law. 

Ocean  waves  near  and  far  from  shore  example  1 1 

Ocean  waves  are  formed  by  winds,  typically  on  the  open  sea,  and 
the  wavefronts  are  perpendicular  to  the  direction  of  the  wind  that 
formed  them.  At  the  beach,  however,  you  have  undoubtedly  ob¬ 
served  that  waves  tend  come  in  with  their  wavefronts  very  nearly 
(but  not  exactly)  parallel  to  the  shoreline.  This  is  because  the 
speed  of  water  waves  in  shallow  water  depends  on  depth:  the 
shallower  the  water,  the  slower  the  wave.  Although  the  change 
from  the  fast-wave  region  to  the  slow-wave  region  is  gradual  rather 
than  abrupt,  there  is  still  refraction,  and  the  wave  motion  is  nearly 
perpendicular  to  the  normal  in  the  slow  region. 

Color  and  refraction 

In  general,  the  speed  of  light  in  a  medium  depends  both  on  the 
medium  and  on  the  wavelength  of  the  light.  Another  way  of  saying  it 
is  that  a  medium’s  index  of  refraction  varies  with  wavelength.  This 
is  why  a  prism  can  be  used  to  split  up  a  beam  of  white  light  into  a 
rainbow.  Each  wavelength  of  light  is  refracted  through  a  different 
angle. 

How  much  light  is  reflected,  and  how  much  is  transmitted? 

In  section  6.2  we  developed  an  equation  for  the  percentage  of 
the  wave  energy  that  is  transmitted  and  the  percentage  reflected  at 
a  boundary  between  media.  This  was  only  done  in  the  case  of  waves 
in  one  dimension,  however,  and  rather  than  discuss  the  full  three  di¬ 
mensional  generalization  it  will  be  more  useful  to  go  into  some  qual¬ 
itative  observations  about  what  happens.  First,  reflection  happens 
only  at  the  interface  between  two  media,  and  two  media  with  the 
same  index  of  refraction  act  as  if  they  were  a  single  medium.  Thus, 
at  the  interface  between  media  with  the  same  index  of  refraction, 
there  is  no  reflection,  and  the  ray  keeps  going  straight.  Continuing 
this  line  of  thought,  it  is  not  surprising  that  we  observe  very  lit¬ 
tle  reflection  at  an  interface  between  media  with  similar  indices  of 
refraction. 

The  next  thing  to  note  is  that  it  is  possible  to  have  situations 
where  no  possible  angle  for  the  refracted  ray  can  satisfy  Snell’s  law. 
Solving  Snell’s  law  for  82,  we  find 

82  =  sin-1  sin  , 

and  if  77 1  is  greater  than  772,  then  there  will  be  large  values  of  8\ 
for  which  the  quantity  (711/772)  sin  8  is  greater  than  one,  meaning 


k  /  Total  internal  reflection  in 
a  fiber-optic  cable. 


I/A  simplified  drawing  of  a 
surgical  endoscope.  The  first 
lens  forms  a  real  image  at 
one  end  of  a  bundle  of  optical 
fibers.  The  light  is  transmitted 
through  the  bundle,  and  is  finally 
magnified  by  the  eyepiece. 


m  /  Endoscopic  images  of  a 
duodenal  ulcer. 
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that  your  calculator  will  flash  an  error  message  at  you  when  you 
try  to  take  the  inverse  sine.  What  can  happen  physically  in  such 
a  situation?  The  answer  is  that  all  the  light  is  reflected,  so  there 
is  no  refracted  ray.  This  phenomenon  is  known  as  total  internal 
reflection ,  and  is  used  in  the  fiber-optic  cables  that  nowadays  carry 
almost  all  long-distance  telephone  calls.  The  electrical  signals  from 
your  phone  travel  to  a  switching  center,  where  they  are  converted 
from  electricity  into  light.  From  there,  the  light  is  sent  across  the 
country  in  a  thin  transparent  fiber.  The  light  is  aimed  straight  into 
the  end  of  the  fiber,  and  as  long  as  the  fiber  never  goes  through  any 
turns  that  are  too  sharp,  the  light  will  always  encounter  the  edge 
of  the  fiber  at  an  angle  sufficiently  oblique  to  give  total  internal 
reflection.  If  the  fiber-optic  cable  is  thick  enough,  one  can  see  an 
image  at  one  end  of  whatever  the  other  end  is  pointed  at. 

Alternatively,  a  bundle  of  cables  can  be  used,  since  a  single  thick 
cable  is  too  hard  to  bend.  This  technique  for  seeing  around  corners 
is  useful  for  making  surgery  less  traumatic.  Instead  of  cutting  a 
person  wide  open,  a  surgeon  can  make  a  small  “keyhole”  incision 
and  insert  a  bundle  of  fiber-optic  cable  (known  as  an  endoscope) 
into  the  body. 

Since  rays  at  sufficiently  large  angles  with  respect  to  the  normal 
may  be  completely  reflected,  it  is  not  surprising  that  the  relative 
amount  of  reflection  changes  depending  on  the  angle  of  incidence, 
and  is  greatest  for  large  angles  of  incidence. 

Discussion  Questions 

A  What  index  of  refraction  should  a  fish  have  in  order  to  be  invisible  to 
other  fish? 

B  Does  a  surgeon  using  an  endoscope  need  a  source  of  light  inside 
the  body  cavity?  If  so,  how  could  this  be  done  without  inserting  a  light 
bulb  through  the  incision? 

C  A  denser  sample  of  a  gas  has  a  higher  index  of  refraction  than  a 
less  dense  sample  (i.e.,  a  sample  under  lower  pressure),  but  why  would 
it  not  make  sense  for  the  index  of  refraction  of  a  gas  to  be  proportional  to 
density? 

D  The  earth’s  atmosphere  gets  thinner  and  thinner  as  you  go  higher  in 
altitude.  If  a  ray  of  light  comes  from  a  star  that  is  below  the  zenith,  what 
will  happen  to  it  as  it  comes  into  the  earth’s  atmosphere? 

E  Does  total  internal  reflection  occur  when  light  in  a  denser  medium 
encounters  a  less  dense  medium,  or  the  other  way  around?  Or  can  it 
occur  in  either  case? 

12.4.2  Lenses 

Figures  n/1  and  n/2  show  examples  of  lenses  forming  images. 
There  is  essentially  nothing  for  you  to  learn  about  imaging  with 
lenses  that  is  truly  new.  You  already  know  how  to  construct  and 
use  ray  diagrams,  and  you  know  about  real  and  virtual  images.  The 
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concept  of  the  focal  length  of  a  lens  is  the  same  as  for  a  curved 
mirror.  The  equations  for  locating  images  and  determining  magni¬ 
fications  are  of  the  same  form.  It’s  really  just  a  question  of  flexing 
your  mental  muscles  on  a  few  examples.  The  following  self-checks 
and  discussion  questions  will  get  you  started. 


Hn  / 1.  A  converging  lens  forms  an 

image  of  a  candle  flame.  2.  A  di¬ 
verging  lens. 


self-check  F 

(1)  In  figures  n/1  and  n/2,  classify  the  images  as  real  or  virtual. 

(2)  Glass  has  an  index  of  refraction  that  is  greater  than  that  of  air.  Con¬ 
sider  the  topmost  ray  in  figure  n/1.  Explain  why  the  ray  makes  a  slight 
left  turn  upon  entering  the  lens,  and  another  left  turn  when  it  exits. 

(3)  If  the  flame  in  figure  n/2  was  moved  closer  to  the  lens,  what  would 

happen  to  the  location  of  the  image?  i>  Answer,  p.  968 

Discussion  Questions 

A  In  figures  n/1  and  n/2,  the  front  and  back  surfaces  are  parallel  to  each 
other  at  the  center  of  the  lens.  What  will  happen  to  a  ray  that  enters  near 
the  center,  but  not  necessarily  along  the  axis  of  the  lens?  Draw  a  BIG  ray 
diagram,  and  show  a  ray  that  comes  from  off  axis. 

In  discussion  questions  B-F,  don’t  draw  ultra- detailed  ray  dia¬ 
grams  as  in  A. 

B  Suppose  you  wanted  to  change  the  setup  in  figure  n/1  so  that  the 
location  of  the  actual  flame  in  the  figure  would  instead  be  occupied  by  an 
image  of  a  flame.  Where  would  you  have  to  move  the  candle  to  achieve 
this?  What  about  in  n/2? 

C  There  are  three  qualitatively  different  types  of  image  formation  that 
can  occur  with  lenses,  of  which  figures  n/1  and  n/2  exhaust  only  two. 
Figure  out  what  the  third  possibility  is.  Which  of  the  three  possibilities  can 
result  in  a  magnification  greater  than  one?  Cf.  problem  1 0,  p.  805. 
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D  Classify  the  examples  shown  in  figure  o  according  to  the  types  of 
images  delineated  in  discussion  question  C. 

E  In  figures  n/1  and  n/2,  the  only  rays  drawn  were  those  that  happened 
to  enter  the  lenses.  Discuss  this  in  relation  to  figure  o. 

F  In  the  right-hand  side  of  figure  o,  the  image  viewed  through  the  lens 
is  in  focus,  but  the  side  of  the  rose  that  sticks  out  from  behind  the  lens  is 
not.  Why? 


o  /  Two  images  of  a  rose  created  by  the  same  lens  and  recorded  with  the  same  camera. 


p/The  radii  of  curvature  ap¬ 
pearing  in  the  lensmaker’s 
equation. 


12.4.3  *  The  lensmaker’s  equation 

The  focal  length  of  a  spherical  mirror  is  simply  r / 2,  but  we  can¬ 
not  expect  the  focal  length  of  a  lens  to  be  given  by  pure  geometry, 
since  it  also  depends  on  the  index  of  refraction  of  the  lens.  Suppose 
we  have  a  lens  whose  front  and  back  surfaces  are  both  spherical. 
(This  is  no  great  loss  of  generality,  since  any  surface  with  a  suffi¬ 
ciently  shallow  curvature  can  be  approximated  with  a  sphere.)  Then 
if  the  lens  is  immersed  in  a  medium  with  an  index  of  refraction  of 
1,  its  focal  length  is  given  approximately  by 


" 

1 

1  1 

(n-  i) 

—  ±  — 

r  i 

r  2  J 

where  n  is  the  index  of  refraction  and  rq  and  r2  are  the  radii  of 
curvature  of  the  two  surfaces  of  the  lens.  This  is  known  as  the 
lensmaker’s  equation.  In  my  opinion  it  is  not  particularly  worthy 
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of  memorization.  The  positive  sign  is  used  when  both  surfaces  are 
curved  outward  or  both  are  curved  inward;  otherwise  a  negative 
sign  applies.  The  proof  of  this  equation  is  left  as  an  exercise  to 
those  readers  who  are  sufficiently  brave  and  motivated. 


12.4.4  Dispersion 

For  most  materials,  we  observe  that  the  index  of  refraction  de¬ 
pends  slightly  on  wavelength,  being  highest  at  the  blue  end  of  the 
visible  spectrum  and  lowest  at  the  red.  For  example,  white  light 
disperses  into  a  rainbow  when  it  passes  through  a  prism,  q.  Even 
when  the  waves  involved  aren’t  light  waves,  and  even  when  refrac¬ 
tion  isn’t  of  interest,  the  dependence  of  wave  speed  on  wavelength 
is  referred  to  as  dispersion.  Dispersion  inside  spherical  raindrops  is 
responsible  for  the  creation  of  rainbows  in  the  sky,  and  in  an  optical 
instrument  such  as  the  eye  or  a  camera  it  is  responsible  for  a  type  of 
aberration  called  chromatic  aberration  (subsection  12.3.3  and  prob¬ 
lem  28).  As  we’ll  see  in  subsection  13.3.2,  dispersion  causes  a  wave 
that  is  not  a  pure  sine  wave  to  have  its  shape  distorted  as  it  trav¬ 
els,  and  also  causes  the  speed  at  which  energy  and  information  are 
transported  by  the  wave  to  be  different  from  what  one  might  expect 
from  a  naive  calculation.  The  microscopic  reasons  for  dispersion  of 
light  in  matter  are  discussed  in  optional  subsection  12.4.6. 


q  /  Dispersion  of  white  light 
by  a  prism.  White  light  is  a 
mixture  of  all  the  wavelengths  of 
the  visible  spectrum.  Waves  of 
different  wavelengths  undergo 
different  amounts  of  refraction. 


12.4.5  *  The  principle  of  least  time  for  refraction 

We  have  seen  previously  how  the  rules  governing  straight-line 
motion  of  light  and  reflection  of  light  can  be  derived  from  the  prin¬ 
ciple  of  least  time.  What  about  refraction?  In  the  figure,  it  is  indeed 
plausible  that  the  bending  of  the  ray  serves  to  minimize  the  time 
required  to  get  from  a  point  A  to  point  B.  If  the  ray  followed  the  un¬ 
bent  path  shown  with  a  dashed  line,  it  would  have  to  travel  a  longer 
distance  in  the  medium  in  which  its  speed  is  slower.  By  bending 
the  correct  amount,  it  can  reduce  the  distance  it  has  to  cover  in  the 
slower  medium  without  going  too  far  out  of  its  way.  It  is  true  that 
Snell’s  law  gives  exactly  the  set  of  angles  that  minimizes  the  time 
required  for  light  to  get  from  one  point  to  another.  The  proof  of 
this  fact  is  left  as  an  exercise  (problem  38,  p.  810). 


r/The  principle  of  least  time 
applied  to  refraction. 
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12.4.6  *  Microscopic  description  of  refraction 

Given  that  the  speed  of  light  is  different  in  different  media,  we’ve 
seen  two  different  explanations  (on  p.  782  and  in  subsection  12.4.5 
above)  of  why  refraction  must  occur.  What  we  haven’t  yet  explained 
is  why  the  speed  of  light  does  depend  on  the  medium. 


s  /  Index  of  refraction  of  sil¬ 
ica  glass,  redrawn  from  Ki- 
tamura,  Pilon,  and  Jonasz, 
Applied  Optics  46  (2007) 

8118,  reprinted  online  at  http: 

/ / www . seas . ucla . edu/~pilon/ 
Publications/A02007-1 .pdf. 


c 

o 

1 


o 

■O 


frequency  (Hz) 


A  good  clue  as  to  what’s  going  on  comes  from  the  figure  s.  The 
relatively  minor  variation  of  the  index  of  refraction  within  the  visible 
spectrum  was  misleading.  At  certain  specific  frequencies,  n  exhibits 
wild  swings  in  the  positive  and  negative  directions.  After  each  such 
swing,  we  reach  a  new,  lower  plateau  on  the  graph.  These  frequen¬ 
cies  are  resonances.  For  example,  the  visible  part  of  the  spectrum 
lies  on  the  left-hand  tail  of  a  resonance  at  about  2  x  1015  Hz,  cor¬ 
responding  to  the  ultraviolet  part  of  the  spectrum.  This  resonance 
arises  from  the  vibration  of  the  electrons,  which  are  bound  to  the 
nuclei  as  if  by  little  springs.  Because  this  resonance  is  narrow,  the 
effect  on  visible-light  frequencies  is  relatively  small,  but  it  is  stronger 
at  the  blue  end  of  the  spectrum  than  at  the  red  end.  Near  each  reso¬ 
nance,  not  only  does  the  index  of  refraction  fluctuate  wildly,  but  the 
glass  becomes  nearly  opaque;  this  is  because  the  vibration  becomes 
very  strong,  causing  energy  to  be  dissipated  as  heat.  The  “stair¬ 
case”  effect  is  the  same  one  visible  in  any  resonance,  e.g.,  figure  k 
on  p.  180:  oscillators  have  a  finite  response  for  /  <C  /o,  but  the 
response  approaches  zero  for  /  3>  /o- 

So  far,  we  have  a  qualitative  explanation  of  the  frequency- variation 
of  the  loosely  defined  “strength”  of  the  glass’s  effect  on  a  light  wave, 
but  we  haven’t  explained  why  the  effect  is  observed  as  a  change  in 
speed,  or  why  each  resonance  is  an  up-down  swing  rather  than  a 
single  positive  peak.  To  understand  these  effects  in  more  detail,  we 
need  to  consider  the  phase  response  of  the  oscillator.  As  shown  in 
the  bottom  panel  of  figure  j  on  p.  181,  the  phase  response  reverses 
itself  as  we  pass  through  a  resonance. 

Suppose  that  a  plane  wave  is  normally  incident  on  the  left  side  of 
a  thin  sheet  of  glass,  t/1,  at  /  <C  /o-  The  light  wave  observed  on  the 
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right  side  consists  of  a  superposition  of  the  incident  wave  consisting 
of  Eo  and  Bo  with  a  secondary  wave  E*  and  B*  generated  by  the 
oscillating  charges  in  the  glass.  Since  the  frequency  is  far  below 
resonance,  the  response  qx  of  a  vibrating  charge  q  is  in  phase  with 
the  driving  force  Eo-  The  current  is  the  derivative  of  this  quantity, 
and  therefore  90  degrees  ahead  of  it  in  phase.  The  magnetic  field 
generated  by  a  sheet  of  current  has  been  analyzed  in  subsection 
11.2.1,  and  the  result,  shown  in  figure  e  on  p.  670,  is  just  what 
we  would  expect  from  the  right-hand  rule.  We  find,  t/1,  that  the 
secondary  wave  is  90  degrees  ahead  of  the  incident  one  in  phase. 
The  incident  wave  still  exists  on  the  right  side  of  the  sheet,  but  it  is 
superposed  with  the  secondary  one.  Their  addition  is  shown  in  t/2 
using  the  complex  number  representation  introduced  in  subsection 
10.5.7.  The  superposition  of  the  two  fields  lags  behind  the  incident 
wave,  which  is  the  effect  we  would  expect  if  the  wave  had  traveled 
more  slowly  through  the  glass. 

In  the  case  /  3>  0,  the  same  analysis  applies  except  that  the 
phase  of  the  secondary  wave  is  reversed.  The  transmitted  wave 
is  advanced  rather  than  retarded  in  phase.  This  explains  the  dip 
observed  in  figure  s  after  each  spike. 

All  of  this  is  in  accord  with  our  understanding  of  relativity,  ch.  7, 
in  which  we  saw  that  the  universal  speed  c  was  to  be  understood  fun¬ 
damentally  as  a  conversion  factor  between  the  units  used  to  measure 
time  and  space  —  not  as  the  speed  of  light.  Since  c  isn’t  defined  as 
the  speed  of  light,  it’s  of  no  fundamental  importance  whether  light 
has  a  different  speed  in  matter  than  it  does  in  vacuum.  In  fact,  the 
picture  we’ve  built  up  here  is  one  in  which  all  of  our  electromagnetic 
waves  travel  at  c;  propagation  at  some  other  speed  is  only  what  ap¬ 
pears  to  happen  because  of  the  superposition  of  the  (Eo,Bo)  and 
(E*,B*)  waves,  both  of  which  move  at  c. 

But  it  is  worrisome  that  at  the  frequencies  where  n  <  1,  the 
speed  of  the  wave  is  greater  than  c.  According  to  special  relativity, 
information  is  never  supposed  to  be  transmitted  at  speeds  greater 
than  c,  since  this  would  produce  situations  in  which  a  signal  could 
be  received  before  it  was  transmitted!  This  difficulty  is  resolved 
in  subsection  13.3.2,  where  we  show  that  there  are  two  different 
velocities  that  can  be  defined  for  a  wave  in  a  dispersive  medium, 
the  phase  velocity  and  the  group  velocity.  The  group  velocity  is  the 
velocity  at  which  information  is  transmitted,  and  it  is  always  less 
than  c. 


t  / 1 .  A  wave  incident  on  a 
sheet  of  glass  excites  current  in 
the  glass,  which  produce  a  sec¬ 
ondary  wave.  2.  The  secondary 
wave  superposes  with  the  origi¬ 
nal  wave,  as  represented  in  the 
complex-number  representation 
introduced  in  subsection  10.5.7. 
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a  /In  this  view  from  overhead,  a 
straight,  sinusoidal  water  wave 
encounters  a  barrier  with  two 
gaps  in  it.  Strong  wave  vibration 
occurs  at  angles  X  and  Z,  but 
there  is  none  at  all  at  angle  Y. 
(The  figure  has  been  retouched 
from  a  real  photo  of  water  waves. 
In  reality,  the  waves  beyond  the 
barrier  would  be  much  weaker 
than  the  ones  before  it,  and  they 
would  therefore  be  difficult  to 
see.) 


b  /  This  doesn’t  happen. 


12.5  Wave  Optics 

Electron  microscopes  can  make  images  of  individual  atoms,  but  why 
will  a  visible-light  microscope  never  be  able  to?  Stereo  speakers 
create  the  illusion  of  music  that  comes  from  a  band  arranged  in 
your  living  room,  but  why  doesn’t  the  stereo  illusion  work  with  bass 
notes?  Why  are  computer  chip  manufacturers  investing  billions  of 
dollars  in  equipment  to  etch  chips  with  x-rays  instead  of  visible 
light? 

The  answers  to  all  of  these  questions  have  to  do  with  the  subject 
of  wave  optics.  So  far  this  book  has  discussed  the  interaction  of 
light  waves  with  matter,  and  its  practical  applications  to  optical 
devices  like  mirrors,  but  we  have  used  the  ray  model  of  light  almost 
exclusively.  Hardly  ever  have  we  explicitly  made  use  of  the  fact  that 
light  is  an  electromagnetic  wave.  We  were  able  to  get  away  with  the 
simple  ray  model  because  the  chunks  of  matter  we  were  discussing, 
such  as  lenses  and  mirrors,  were  thousands  of  times  larger  than  a 
wavelength  of  light.  We  now  turn  to  phenomena  and  devices  that 
can  only  be  understood  using  the  wave  model  of  light. 

12.5.1  Diffraction 

Figure  a  shows  a  typical  problem  in  wave  optics,  enacted  with 
water  waves.  It  may  seem  surprising  that  we  don’t  get  a  simple 
pattern  like  figure  b,  but  the  pattern  would  only  be  that  simple 
if  the  wavelength  was  hundreds  of  times  shorter  than  the  distance 
between  the  gaps  in  the  barrier  and  the  widths  of  the  gaps. 

Wave  optics  is  a  broad  subject,  but  this  example  will  help  us 
to  pick  out  a  reasonable  set  of  restrictions  to  make  things  more 
manageable: 

(1)  We  restrict  ourselves  to  cases  in  which  a  wave  travels  through 
a  uniform  medium,  encounters  a  certain  area  in  which  the  medium 
has  different  properties,  and  then  emerges  on  the  other  side  into  a 
second  uniform  region. 

(2)  We  assume  that  the  incoming  wave  is  a  nice  tidy  sine-wave 
pattern  with  wavefronts  that  are  lines  (or,  in  three  dimensions, 
planes). 

(3)  In  figure  a  we  can  see  that  the  wave  pattern  immediately 
beyond  the  barrier  is  rather  complex,  but  farther  on  it  sorts  itself 
out  into  a  set  of  wedges  separated  by  gaps  in  which  the  water  is 
still.  We  will  restrict  ourselves  to  studying  the  simpler  wave  patterns 
that  occur  farther  away,  so  that  the  main  question  of  interest  is  how 
intense  the  outgoing  wave  is  at  a  given  angle. 

The  kind  of  phenomenon  described  by  restriction  (1)  is  called 
diffraction.  Diffraction  can  be  defined  as  the  behavior  of  a  wave 
when  it  encounters  an  obstacle  or  a  nonuniformity  in  its  medium. 
In  general,  diffraction  causes  a  wave  to  bend  around  obstacles  and 
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make  patterns  of  strong  and  weak  waves  radiating  out  beyond  the 
obstacle.  Understanding  diffraction  is  the  central  problem  of  wave 
optics.  If  you  understand  diffraction,  even  the  subset  of  diffraction 
problems  that  fall  within  restrictions  (2)  and  (3),  the  rest  of  wave 
optics  is  icing  on  the  cake. 

Diffraction  can  be  used  to  find  the  structure  of  an  unknown 
diffracting  object:  even  if  the  object  is  too  small  to  study  with 
ordinary  imaging,  it  may  be  possible  to  work  backward  from  the 
diffraction  pattern  to  learn  about  the  object.  The  structure  of  a 
crystal,  for  example,  can  be  determined  from  its  x-ray  diffraction 
pattern. 

Diffraction  can  also  be  a  bad  thing.  In  a  telescope,  for  example, 
light  waves  are  diffracted  by  all  the  parts  of  the  instrument.  This  will 
cause  the  image  of  a  star  to  appear  fuzzy  even  when  the  focus  has 
been  adjusted  correctly.  By  understanding  diffraction,  one  can  learn 
how  a  telescope  must  be  designed  in  order  to  reduce  this  problem 
-  essentially,  it  should  have  the  biggest  possible  diameter. 

There  are  two  ways  in  which  restriction  (2)  might  commonly  be 
violated.  First,  the  light  might  be  a  mixture  of  wavelengths.  If  we 
simply  want  to  observe  a  diffraction  pattern  or  to  use  diffraction  as 
a  technique  for  studying  the  object  doing  the  diffracting  (e.g.,  if  the 
object  is  too  small  to  see  with  a  microscope),  then  we  can  pass  the 
light  through  a  colored  filter  before  diffracting  it. 

A  second  issue  is  that  light  from  sources  such  as  the  sun  or  a 
lightbulb  does  not  consist  of  a  nice  neat  plane  wave,  except  over- 
very  small  regions  of  space.  Different  parts  of  the  wave  are  out  of 
step  with  each  other,  and  the  wave  is  referred  to  as  incoherent.  One 
way  of  dealing  with  this  is  shown  in  figure  c.  After  filtering  to  select 
a  certain  wavelength  of  red  light,  we  pass  the  light  through  a  small 
pinhole.  The  region  of  the  light  that  is  intercepted  by  the  pinhole  is 
so  small  that  one  part  of  it  is  not  out  of  step  with  another.  Beyond 
the  pinhole,  light  spreads  out  in  a  spherical  wave;  this  is  analogous 
to  what  happens  when  you  speak  into  one  end  of  a  paper  towel  roll 
and  the  sound  waves  spread  out  in  all  directions  from  the  other  end. 
By  the  time  the  spherical  wave  gets  to  the  double  slit  it  has  spread 
out  and  reduced  its  curvature,  so  that  we  can  now  think  of  it  as  a 
simple  plane  wave. 

If  this  seems  laborious,  you  may  be  relieved  to  know  that  modern 
technology  gives  us  an  easier  way  to  produce  a  single-wavelength, 
coherent  beam  of  light:  the  laser. 

The  parts  of  the  final  image  on  the  screen  in  c  are  called  diffrac¬ 
tion  fringes.  The  center  of  each  fringe  is  a  point  of  maximum  bright¬ 
ness,  and  halfway  between  two  fringes  is  a  minimum. 

Discussion  Question 

A  Why  would  x-rays  rather  than  visible  light  be  used  to  find  the  structure 
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c  /  A  practical,  low-tech  setup  for 
observing  diffraction  of  light. 


/ 


d/The  bottom  figure  is  sim¬ 
ply  a  copy  of  the  middle  portion 
of  the  top  one,  scaled  up  by  a 
factor  of  two.  All  the  angles  are 
the  same.  Physically,  the  angular 
pattern  of  the  diffraction  fringes 
can’t  be  any  different  if  we  scale 
both  A  and  d  by  the  same  factor, 
leaving  A /d  unchanged. 
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e  /  Christiaan  Huygens  (1629- 
1695). 


of  a  crystal?  Sound  waves  are  used  to  make  images  of  fetuses  in  the 
womb.  What  would  influence  the  choice  of  wavelength? 

12.5.2  Scaling  of  diffraction 

This  chapter  has  “optics”  in  its  title,  so  it  is  nominally  about 
light,  but  we  started  out  with  an  example  involving  water  waves. 
Water  waves  are  certainly  easier  to  visualize,  but  is  this  a  legitimate 
comparison?  In  fact  the  analogy  works  quite  well,  despite  the  fact 
that  a  light  wave  has  a  wavelength  about  a  million  times  shorter. 
This  is  because  diffraction  effects  scale  uniformly.  That  is,  if  we 
enlarge  or  reduce  the  whole  diffraction  situation  by  the  same  factor, 
including  both  the  wavelengths  and  the  sizes  of  the  obstacles  the 
wave  encounters,  the  result  is  still  a  valid  solution. 

This  is  unusually  simple  behavior!  In  subsection  0.2.2  we  saw 
many  examples  of  more  complex  scaling,  such  as  the  impossibility 
of  bacteria  the  size  of  dogs,  or  the  need  for  an  elephant  to  eliminate 
heat  through  its  ears  because  of  its  small  surface-to-volume  ratio, 
whereas  a  tiny  shrew’s  life-style  centers  around  conserving  its  body 
heat. 

Of  course  water  waves  and  light  waves  differ  in  many  ways,  not 
just  in  scale,  but  the  general  facts  you  will  learn  about  diffraction  are 
applicable  to  all  waves.  In  some  ways  it  might  have  been  more  ap¬ 
propriate  to  insert  this  chapter  after  section  6.2  on  bounded  waves, 
but  many  of  the  important  applications  are  to  light  waves,  and  you 
would  probably  have  found  these  much  more  difficult  without  any 
background  in  optics. 

Another  way  of  stating  the  simple  scaling  behavior  of  diffraction 
is  that  the  diffraction  angles  we  get  depend  only  on  the  unitless  ratio 
A/d,  where  A  is  the  wavelength  of  the  wave  and  d  is  some  dimension 
of  the  diffracting  objects,  e.g.,  the  center-to-center  spacing  between 
the  slits  in  figure  a.  If,  for  instance,  we  scale  up  both  A  and  d  by  a 
factor  of  37,  the  ratio  A/d  will  be  unchanged. 

12.5.3  The  correspondence  principle 

The  only  reason  we  don’t  usually  notice  diffraction  of  light  in 
everyday  life  is  that  we  don’t  normally  deal  with  objects  that  are 
comparable  in  size  to  a  wavelength  of  visible  light,  which  is  about  a 
millionth  of  a  meter.  Does  this  mean  that  wave  optics  contradicts 
ray  optics,  or  that  wave  optics  sometimes  gives  wrong  results?  No. 
If  you  hold  three  fingers  out  in  the  sunlight  and  cast  a  shadow 
with  them,  either  wave  optics  or  ray  optics  can  be  used  to  predict 
the  straightforward  result:  a  shadow  pattern  with  two  bright  lines 
where  the  light  has  gone  through  the  gaps  between  your  fingers. 
Wave  optics  is  a  more  general  theory  than  ray  optics,  so  in  any  case 
where  ray  optics  is  valid,  the  two  theories  will  agree.  This  is  an 
example  of  a  general  idea  enunciated  by  the  physicist  Niels  Bohr, 
called  the  correspondence  principle:  when  flaws  in  a  physical  theory 
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lead  to  the  creation  of  a  new  and  more  general  theory,  the  new 
theory  must  still  agree  with  the  old  theory  within  its  more  restricted 
area  of  applicability.  After  all,  a  theory  is  only  created  as  a  way  of 
describing  experimental  observations.  If  the  original  theory  had  not 
worked  in  any  cases  at  all,  it  would  never  have  become  accepted. 

In  the  case  of  optics,  the  correspondence  principle  tells  us  that 
when  X/d  is  small,  both  the  ray  and  the  wave  model  of  light  must 
give  approximately  the  same  result.  Suppose  you  spread  your  fingers 
and  cast  a  shadow  with  them  using  a  coherent  light  source.  The 
quantity  X/d  is  about  10  ,  so  the  two  models  will  agree  very  closely. 

(To  be  specific,  the  shadows  of  your  fingers  will  be  outlined  by  a 
series  of  light  and  dark  fringes,  but  the  angle  subtended  by  a  fringe 
will  be  on  the  order  of  10-4  radians,  so  they  will  be  too  tiny  to  be 
visible. 

self-check  G 

What  kind  of  wavelength  would  an  electromagnetic  wave  have  to  have 
in  order  to  diffract  dramatically  around  your  body?  Does  this  contradict 
the  correspondence  principle?  >  Answer,  p.  968 

12.5.4  Huygens’  principle 

Returning  to  the  example  of  double-slit  diffraction,  f,  note  the 
strong  visual  impression  of  two  overlapping  sets  of  concentric  semi¬ 
circles.  This  is  an  example  of  Huygens’  principle ,  named  after  a 
Dutch  physicist  and  astronomer.  (The  first  syllable  rhymes  with 
“boy.” )  Huygens’  principle  states  that  any  wavefront  can  be  broken 
down  into  many  small  side-by-side  wave  peaks,  g,  which  then  spread 
out  as  circular  ripples,  h,  and  by  the  principle  of  superposition,  the 
result  of  adding  up  these  sets  of  ripples  must  give  the  same  result 
as  allowing  the  wave  to  propagate  forward,  i.  In  the  case  of  sound 
or  light  waves,  which  propagate  in  three  dimensions,  the  “ripples” 
are  actually  spherical  rather  than  circular,  but  we  can  often  imagine 
things  in  two  dimensions  for  simplicity. 

In  double-slit  diffraction  the  application  of  Huygens’  principle  is 
visually  convincing:  it  is  as  though  all  the  sets  of  ripples  have  been 
blocked  except  for  two.  It  is  a  rather  surprising  mathematical  fact, 
however,  that  Huygens’  principle  gives  the  right  result  in  the  case  of 
an  unobstructed  linear  wave,  h  and  i.  A  theoretically  infinite  number 
of  circular  wave  patterns  somehow  conspire  to  add  together  and 
produce  the  simple  linear  wave  motion  with  which  we  are  familiar. 

Since  Huygens’  principle  is  equivalent  to  the  principle  of  super¬ 
position,  and  superposition  is  a  property  of  waves,  what  Huygens 
had  created  was  essentially  the  first  wave  theory  of  light.  However, 
he  imagined  light  as  a  series  of  pulses,  like  hand  claps,  rather  than 
as  a  sinusoidal  wave. 

The  history  is  interesting.  Isaac  Newton  loved  the  atomic  theory 
of  matter  so  much  that  he  searched  enthusiastically  for  evidence  that 
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f  /  Double-slit  diffraction. 


+r 

+3 

g  /  A  wavefront  can  be  analyzed 
by  the  principle  of  superposition, 
breaking  it  down  into  many  small 
parts. 


h  /  If  it  was  by  itself,  each  of 
the  parts  would  spread  out  as  a 
circular  ripple. 


i  /  Adding  up  the  ripples  pro¬ 
duces  a  new  wavefront. 
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j  /  Thomas  Young 


k  /  Double-slit  diffraction. 


x  Y  Z 


I  /  Use  of  Huygens’  principle. 


x  y  z 


m  /  Constructive  interference 
along  the  center-line. 


light  was  also  made  of  tiny  particles.  The  paths  of  his  light,  particles 
would  correspond  to  rays  in  our  description;  the  only  significant 
difference  between  a  ray  model  and  a  particle  model  of  light  would 
occur  if  one  could  isolate  individual  particles  and  show  that  light 
had  a  “graininess”  to  it.  Newton  never  did  this,  so  although  he 
thought  of  his  model  as  a  particle  model,  it  is  more  accurate  to  say 
he  was  one  of  the  builders  of  the  ray  model. 

Almost  all  that  was  known  about  reflection  and  refraction  of 
light  could  be  interpreted  equally  well  in  terms  of  a  particle  model 
or  a  wave  model,  but  Newton  had  one  reason  for  strongly  opposing 
Huygens’  wave  theory.  Newton  knew  that  waves  exhibited  diffrac¬ 
tion,  but  diffraction  of  light  is  difficult  to  observe,  so  Newton  be¬ 
lieved  that  light  did  not  exhibit  diffraction,  and  therefore  must  not 
be  a  wave.  Although  Newton’s  criticisms  were  fair  enough,  the  de¬ 
bate  also  took  on  the  overtones  of  a  nationalistic  dispute  between 
England  and  continental  Europe,  fueled  by  English  resentment  over 
Leibniz’s  supposed  plagiarism  of  Newton’s  calculus.  Newton  wrote 
a  book  on  optics,  and  his  prestige  and  political  prominence  tended 
to  discourage  questioning  of  his  model. 

Thomas  Young  (1773-1829)  was  the  person  who  finally,  a  hun¬ 
dred  years  later,  did  a  careful  search  for  wave  interference  effects 
with  light  and  analyzed  the  results  correctly.  He  observed  double- 
slit  diffraction  of  light  as  well  as  a  variety  of  other  diffraction  ef¬ 
fects,  all  of  which  showed  that  light  exhibited  wave  interference  ef¬ 
fects,  and  that  the  wavelengths  of  visible  light  waves  were  extremely 
short.  The  crowning  achievement  was  the  demonstration  by  the  ex¬ 
perimentalist  Heinrich  Hertz  and  the  theorist  James  Clerk  Maxwell 
that  light  was  an  electromagnetic  wave.  Maxwell  is  said  to  have  re¬ 
lated  his  discovery  to  his  wife  one  starry  evening  and  told  her  that 
she  was  the  only  other  person  in  the  world  who  knew  what  starlight 
was. 

12.5.5  Double-slit  diffraction 

Let’s  now  analyze  double-slit  diffraction,  k,  using  Huygens’  prin¬ 
ciple.  The  most  interesting  question  is  how  to  compute  the  angles 
such  as  X  and  Z  where  the  wave  intensity  is  at  a  maximum,  and 
the  in-between  angles  like  Y  where  it  is  minimized.  Let’s  measure 
all  our  angles  with  respect  to  the  vertical  center  line  of  the  figure, 
which  was  the  original  direction  of  propagation  of  the  wave. 

If  we  assume  that  the  width  of  the  slits  is  small  (on  the  order 
of  the  wavelength  of  the  wave  or  less),  then  we  can  imagine  only  a 
single  set  of  Huygens  ripples  spreading  out  from  each  one,  1.  White 
lines  represent  peaks,  black  ones  troughs.  The  only  dimension  of  the 
diffracting  slits  that  has  any  effect  on  the  geometric  pattern  of  the 
overlapping  ripples  is  then  the  center-to-center  distance,  d,  between 
the  slits. 
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We  know  from  our  discussion  of  the  scaling  of  diffraction  that 
there  must  be  some  equation  that  relates  an  angle  like  9z  to  the 
ratio  X/d, 

1  °  ® z ' 
a 

If  the  equation  for  6z  depended  on  some  other  expression  such  as 
A  +  d  or  A2/d,  then  it  would  change  when  we  scaled  A  and  d  by  the 
same  factor,  which  would  violate  what  we  know  about  the  scaling 
of  diffraction. 

Along  the  central  maximum  line,  X,  we  always  have  positive 
waves  coinciding  with  positive  ones  and  negative  waves  coinciding 
with  negative  ones.  (I  have  arbitrarily  chosen  to  take  a  snapshot  of 
the  pattern  at  a  moment  when  the  waves  emerging  from  the  slit  are 
experiencing  a  positive  peak.)  The  superposition  of  the  two  sets  of 
ripples  therefore  results  in  a  doubling  of  the  wave  amplitude  along 
this  line.  There  is  constructive  interference.  This  is  easy  to  explain, 
because  by  symmetry,  each  wave  has  had  to  travel  an  equal  number 
of  wavelengths  to  get  from  its  slit  to  the  center  line,  m:  Because 
both  sets  of  ripples  have  ten  wavelengths  to  cover  in  order  to  reach 
the  point  along  direction  X,  they  will  be  in  step  when  they  get  there. 

At  the  point  along  direction  Y  shown  in  the  same  figure,  one 
wave  has  traveled  ten  wavelengths,  and  is  therefore  at  a  positive 
extreme,  but  the  other  has  traveled  only  nine  and  a  half  wavelengths, 
so  it  at  a  negative  extreme.  There  is  perfect  cancellation,  so  points 
along  this  line  experience  no  wave  motion. 

But  the  distance  traveled  does  not  have  to  be  equal  in  order  to 
get  constructive  interference.  At  the  point  along  direction  Z,  one 
wave  has  gone  nine  wavelengths  and  the  other  ten.  They  are  both 
at  a  positive  extreme. 

self-check  H 

At  a  point  half  a  wavelength  below  the  point  marked  along  direction  X, 
carry  out  a  similar  analysis.  c>  Answer,  p.  968 

To  summarize,  we  will  have  perfect  constructive  interference  at 
any  point  where  the  distance  to  one  slit  differs  from  the  distance  to 
the  other  slit  by  an  integer  number  of  wavelengths.  Perfect  destruc¬ 
tive  interference  will  occur  when  the  number  of  wavelengths  of  path 
length  difference  equals  an  integer  plus  a  half. 

Now  we  are  ready  to  find  the  equation  that  predicts  the  angles 
of  the  maxima  and  minima.  The  waves  travel  different  distances 
to  get  to  the  same  point  in  space,  n.  We  need  to  find  whether  the 
waves  are  in  phase  (in  step)  or  out  of  phase  at  this  point  in  order  to 
predict  whether  there  will  be  constructive  interference,  destructive 
interference,  or  something  in  between. 

One  of  our  basic  assumptions  in  this  chapter  is  that  we  will  only 
be  dealing  with  the  diffracted  wave  in  regions  very  far  away  from  the 
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n  /  The  waves  travel  distances  L 
and  L!  from  the  two  slits  to  get 
to  the  same  point  in  space,  at  an 
angle  0  from  the  center  line. 


d 


o/A  close-up  view  of  figure 
n,  showing  how  the  path  length 
difference  L  -  L'  is  related  to  d 
and  to  the  angle  0. 
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object  that  diffracts  it,  so  the  triangle  is  long  and  skinny.  Most  real- 
world  examples  with  diffraction  of  light,  in  fact,  would  have  triangles 
with  even  skinner  proportions  than  this  one.  The  two  long  sides  are 
therefore  very  nearly  parallel,  and  we  are  justified  in  drawing  the 
right  triangle  shown  in  figure  o,  labeling  one  leg  of  the  right  triangle 
as  the  difference  in  path  length  ,  L  —  L' ,  and  labeling  the  acute  angle 
as  #.  (In  reality  this  angle  is  a  tiny  bit  greater  than  the  one  labeled 
#  in  figure  n.) 

The  difference  in  path  length  is  related  to  d  and  6  by  the  equation 


i 

m=o  2  l 


p  /  Cutting  d  in  half  doubles 
the  angles  of  the  diffraction 
fringes. 


- - —  =  sin#. 

d 

Constructive  interference  will  result  in  a  maximum  at  angles  for 
which  L  —  L'  is  an  integer  number  of  wavelengths, 


L  —  L'  =  mX.  [condition  for  a  maximum; 

m  is  an  integer] 

Here  m  equals  0  for  the  central  maximum,  —1  for  the  first  maximum 
to  its  left,  +2  for  the  second  maximum  on  the  right,  etc.  Putting 
all  the  ingredients  together,  we  find  mX/d  =  sin#,  or 


A  sin# 
d  m 


[condition  for  a  maximum; 

m  is  an  integer] 


Similarly,  the  condition  for  a  minimum  is 
A  sin# 


d 


m 


[condition  for  a  minimum; 
m  is  an  integer  plus  1/2] 


That  is,  the  minima  are  about  halfway  between  the  maxima. 


Illlllllllll 


q  /  Double-slit  diffraction  pat¬ 
terns  of  long-wavelength  red  light 
(top)  and  short-wavelength  blue 
light  (bottom). 


As  expected  based  on  scaling,  this  equation  relates  angles  to  the 
unitless  ratio  X/d.  Alternatively,  we  could  say  that  we  have  proven 
the  scaling  property  in  the  special  case  of  double-slit  diffraction.  It 
was  inevitable  that  the  result  would  have  these  scaling  properties, 
since  the  whole  proof  was  geometric,  and  would  have  been  equally 
valid  when  enlarged  or  reduced  on  a  photocopying  machine! 

Counterintuitively,  this  means  that  a  diffracting  object  with 
smaller  dimensions  produces  a  bigger  diffraction  pattern,  p. 
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Double-slit  diffraction  of  blue  and  red  light  example  12 

Blue  light  has  a  shorter  wavelength  than  red.  For  a  given  double¬ 
slit  spacing  d,  the  smaller  value  of  A /d  for  leads  to  smaller  values 
of  sin  0,  and  therefore  to  a  more  closely  spaced  set  of  diffraction 
fringes,  (g) 

The  correspondence  principle  example  13 

Let’s  also  consider  how  the  equations  for  double-slit  diffraction 
relate  to  the  correspondence  principle.  When  the  ratio  A /d  is  very 
small,  we  should  recover  the  case  of  simple  ray  optics.  Now  if  A /d 
is  small,  sine  must  be  small  as  well,  and  the  spacing  between 
the  diffraction  fringes  will  be  small  as  well.  Although  we  have  not 
proven  it,  the  central  fringe  is  always  the  brightest,  and  the  fringes 
get  dimmer  and  dimmer  as  we  go  farther  from  it.  For  small  values 
of  A /d,  the  part  of  the  diffraction  pattern  that  is  bright  enough  to 
be  detectable  covers  only  a  small  range  of  angles.  This  is  exactly 
what  we  would  expect  from  ray  optics:  the  rays  passing  through 
the  two  slits  would  remain  parallel,  and  would  continue  moving 
in  the  0  =  0  direction.  (In  fact  there  would  be  images  of  the  two 
separate  slits  on  the  screen,  but  our  analysis  was  all  in  terms  of 
angles,  so  we  should  not  expect  it  to  address  the  issue  of  whether 
there  is  structure  within  a  set  of  rays  that  are  all  traveling  in  the 
0  =  0  direction.) 

Spacing  of  the  fringes  at  small  angles  example  14 

At  small  angles,  we  can  use  the  approximation  sin  0  «  0,  which 
is  valid  if  0  is  measured  in  radians.  The  equation  for  double-slit 
diffraction  becomes  simply 


A  _  0 
d~  m’ 

which  can  be  solved  for  0  to  give 


The  difference  in  angle  between  successive  fringes  is  the  change 
in  0  that  results  from  changing  m  by  plus  or  minus  one, 


For  example,  if  we  write  07  for  the  angle  of  the  seventh  bright 
fringe  on  one  side  of  the  central  maximum  and  08  for  the  neigh¬ 
boring  one,  we  have 


08  -  07  = 


8A 

A 


d’ 


7A 


and  similarly  for  any  other  neighboring  pair  of  fringes. 
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III 


r  /  Interpretation  of  the  angu¬ 
lar  spacing  A0  in  example  14. 
It  can  be  defined  either  from 
maximum  to  maximum  or  from 
minimum  to  minimum.  Either  way, 
the  result  is  the  same.  It  does  not 
make  sense  to  try  to  interpret  A0 
as  the  width  of  a  fringe;  one  can 
see  from  the  graph  and  from  the 
image  below  that  it  is  not  obvious 
either  that  such  a  thing  is  well 
defined  or  that  it  would  be  the 
same  for  all  fringes. 
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t  /  A  double-slit  diffraction  pattern 
(top),  and  a  pattern  made  by  five 
slits  (bottom). 


Although  the  equation  X/d  =  sin  9/m  is  only  valid  for  a  double 
slit,  it  is  can  still  be  a  guide  to  our  thinking  even  if  we  are  observing 
diffraction  of  light  by  a  virus  or  a  flea’s  leg:  it  is  always  true  that 

(1)  large  values  of  X/d  lead  to  a  broad  diffraction  pattern,  and 

(2)  diffraction  patterns  are  repetitive. 

In  many  cases  the  equation  looks  just  like  X/d  =  sin  d/m  but 
with  an  extra  numerical  factor  thrown  in,  and  with  d  interpreted  as 
some  other  dimension  of  the  object,  e.g.,  the  diameter  of  a  piece  of 
wire. 

12.5.6  Repetition 

Suppose  we  replace  a  double  slit  with  a  triple  slit,  s.  We  can 
think  of  this  as  a  third  repetition  of  the  structures  that  were  present 
in  the  double  slit.  Will  this  device  be  an  improvement  over  the 
double  slit  for  any  practical  reasons? 

The  answer  is  yes,  as  can  be  shown  using  figure  u.  For  ease 
of  visualization,  I  have  violated  our  usual  rule  of  only  considering 
points  very  far  from  the  diffracting  object.  The  scale  of  the  drawing 
is  such  that  a  wavelengths  is  one  cm.  In  u/1,  all  three  waves  travel 
an  integer  number  of  wavelengths  to  reach  the  same  point,  so  there 
is  a  bright  central  spot,  as  we  would  expect  from  our  experience 
with  the  double  slit.  In  figure  u/2,  we  show  the  path  lengths  to 
a  new  point.  This  point  is  farther  from  slit  A  by  a  quarter  of  a 
wavelength,  and  correspondingly  closer  to  slit  C.  The  distance  from 
slit  B  has  hardly  changed  at  all.  Because  the  paths  lengths  traveled 
from  slits  A  and  C  differ  by  half  a  wavelength,  there  will  be  perfect 
destructive  interference  between  these  two  waves.  There  is  still  some 
uncanceled  wave  intensity  because  of  slit  B,  but  the  amplitude  will 
be  three  times  less  than  in  figure  u/1,  resulting  in  a  factor  of  9 
decrease  in  brightness.  Thus,  by  moving  off  to  the  right  a  little,  we 
have  gone  from  the  bright  central  maximum  to  a  point  that  is  quite 
dark. 

Now  let’s  compare  with  what  would  have  happened  if  slit  C  had 
been  covered,  creating  a  plain  old  double  slit.  The  waves  coming 
from  slits  A  and  B  would  have  been  out  of  phase  by  0.23  wavelengths, 
but  this  would  not  have  caused  very  severe  interference.  The  point 
in  figure  u/2  would  have  been  quite  brightly  lit  up. 

To  summarize,  we  have  found  that  adding  a  third  slit  narrows 
down  the  central  fringe  dramatically.  The  same  is  true  for  all  the 
other  fringes  as  well,  and  since  the  same  amount  of  energy  is  con- 
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u  / 1 .  There  is  a  bright  central  maximum.  2.  At  this  point  just  off  the  central  maximum,  the  path  lengths  traveled 
by  the  three  waves  have  changed. 


centrated  in  narrower  diffraction  fringes,  each  fringe  is  brighter  and 
easier  to  see,  t. 

This  is  an  example  of  a  more  general  fact  about  diffraction:  if 
some  feature  of  the  diffracting  object  is  repeated,  the  locations  of 
the  maxima  and  minima  are  unchanged,  but  they  become  narrower. 

Taking  this  reasoning  to  its  logical  conclusion,  a  diffracting  ob¬ 
ject  with  thousands  of  slits  would  produce  extremely  narrow  fringes. 
Such  an  object  is  called  a  diffraction  grating. 

12.5.7  Single-slit  diffraction 

If  we  use  only  a  single  slit,  is  there  diffraction?  If  the  slit  is  not 
wide  compared  to  a  wavelength  of  light,  then  we  can  approximate 
its  behavior  by  using  only  a  single  set  of  Huygens  ripples.  There 
are  no  other  sets  of  ripples  to  add  to  it,  so  there  are  no  constructive 
or  destructive  interference  effects,  and  no  maxima  or  minima.  The 
result  will  be  a  uniform  spherical  wave  of  light  spreading  out  in  all 
directions,  like  what  we  would  expect  from  a  tiny  lightbulb.  We 
could  call  this  a  diffraction  pattern,  but  it  is  a  completely  feature¬ 
less  one,  and  it  could  not  be  used,  for  instance,  to  determine  the 
wavelength  of  the  light,  as  other  diffraction  patterns  could. 

All  of  this,  however,  assumes  that  the  slit  is  narrow  compared  to 
a  wavelength  of  light.  If,  on  the  other  hand,  the  slit  is  broader,  there 
will  indeed  be  interference  among  the  sets  of  ripples  spreading  out 
from  various  points  along  the  opening.  Figure  v  shows  an  example 
with  water  waves,  and  figure  w  with  light. 

self-check  I 

How  does  the  wavelength  of  the  waves  compare  with  the  width  of  the 
slit  in  figure  v?  >  Answer,  p.  968 

We  will  not  go  into  the  details  of  the  analysis  of  single-slit  diffrac¬ 
tion,  but  let  us  see  how  its  properties  can  be  related  to  the  general 


v  /  Single-slit  diffraction  of 
water  waves. 


w  /  Single-slit  diffraction  of 
red  light.  Note  the  double  width 
of  the  central  maximum. 


x  /  A  pretty  good  simulation 
of  the  single-slit  pattern  of  figure 
v,  made  by  using  three  motors  to 
produce  overlapping  ripples  from 
three  neighboring  points  in  the 
water. 
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y  /  An  image  of  the  Pleiades 
star  cluster.  The  circular  rings 
around  the  bright  stars  are  due  to 
single-slit  diffraction  at  the  mouth 
of  the  telescope’s  tube. 


■ 


z  /  A  radio  telescope. 


things  we’ve  learned  about  diffraction.  We  know  based  on  scaling 
arguments  that  the  angular  sizes  of  features  in  the  diffraction  pat¬ 
tern  must  be  related  to  the  wavelength  and  the  width,  a,  of  the  slit 
by  some  relationship  of  the  form 

-aO. 

a 

This  is  indeed  true,  and  for  instance  the  angle  between  the  maximum 
of  the  central  fringe  and  the  maximum  of  the  next  fringe  on  one  side 
equals  1.5A/a.  Scaling  arguments  will  never  produce  factors  such  as 
the  1.5,  but  they  tell  us  that  the  answer  must  involve  A/a,  so  all  the 
familiar  qualitative  facts  are  true.  For  instance,  shorter-wavelength 
light  will  produce  a  more  closely  spaced  diffraction  pattern. 

An  important  scientific  example  of  single-slit  diffraction  is  in 
telescopes.  Images  of  individual  stars,  as  in  figure  y,  are  a  good  way 
to  examine  diffraction  effects,  because  all  stars  except  the  sun  are  so 
far  away  that  no  telescope,  even  at  the  highest  magnification,  can 
image  their  disks  or  surface  features.  Thus  any  features  of  a  star’s 
image  must  be  due  purely  to  optical  effects  such  as  diffraction.  A 
prominent  cross  appears  around  the  brightest  star,  and  dimmer  ones 
surround  the  dimmer  stars.  Something  like  this  is  seen  in  most  tele¬ 
scope  photos,  and  indicates  that  inside  the  tube  of  the  telescope 
there  were  two  perpendicular  struts  or  supports.  Light  diffracted 
around  these  struts.  You  might  think  that  diffraction  could  be  elim¬ 
inated  entirely  by  getting  rid  of  all  obstructions  in  the  tube,  but  the 
circles  around  the  stars  are  diffraction  effects  arising  from  single¬ 
slit  diffraction  at  the  mouth  of  the  telescope’s  tube!  (Actually  we 
have  not  even  talked  about  diffraction  through  a  circular  opening, 
but  the  idea  is  the  same.)  Since  the  angular  sizes  of  the  diffracted 
images  depend  on  A/a,  the  only  way  to  improve  the  resolution  of 
the  images  is  to  increase  the  diameter,  a,  of  the  tube.  This  is  one 
of  the  main  reasons  (in  addition  to  light-gathering  power)  why  the 
best  telescopes  must  be  very  large  in  diameter. 

self-check  J 

What  would  this  imply  about  radio  telescopes  as  compared  with  visible- 

light  telescopes?  >  Answer,  p. 

968 

Double-slit  diffraction  is  easier  to  understand  conceptually  than 
single-slit  diffraction,  but  if  you  do  a  double-slit  diffraction  experi¬ 
ment  in  real  life,  you  are  likely  to  encounter  a  complicated  pattern 
like  figure  aa/1,  rather  than  the  simpler  one,  2,  you  were  expecting. 
This  is  because  the  slits  are  fairly  big  compared  to  the  wavelength 
of  the  light  being  used.  We  really  have  two  different  distances  in 
our  pair  of  slits:  d,  the  distance  between  the  slits,  and  w,  the  width 
of  each  slit.  Remember  that  smaller  distances  on  the  object  the 
light  diffracts  around  correspond  to  larger  features  of  the  diffraction 
pattern.  The  pattern  1  thus  has  two  spacings  in  it:  a  short  spac- 
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ing  corresponding  to  the  large  distance  d,  and  a  long  spacing  that 
relates  to  the  small  dimension  w. 


aa/1.  A  diffraction  pattern  formed  by  a  real  double  slit.  The  width  of  each  slit  is  fairly  big  compared  to 
the  wavelength  of  the  light.  This  is  a  real  photo.  2.  This  idealized  pattern  is  not  likely  to  occur  in  real  life.  To  get 
it,  you  would  need  each  slit  to  be  so  narrow  that  its  width  was  comparable  to  the  wavelength  of  the  light,  but 
that’s  not  usually  possible.  This  is  not  a  real  photo.  3.  A  real  photo  of  a  single-slit  diffraction  pattern  caused  by 
a  slit  whose  width  is  the  same  as  the  widths  of  the  slits  used  to  make  the  top  pattern. 

Discussion  Question 

A  Why  is  it  optically  impossible  for  bacteria  to  evolve  eyes  that  use 
visible  light  to  form  images? 

12.5.8  *  The  principle  of  least  time 

In  subsection  12.1.5  and  12.4.5,  we  saw  how  in  the  ray  model 
of  light,  both  refraction  and  reflection  can  be  described  in  an  ele¬ 
gant  and  beautiful  way  by  a  single  principle,  the  principle  of  least 
time.  We  can  now  justify  the  principle  of  least  time  based  on  the 
wave  model  of  light.  Consider  an  example  involving  reflection,  ab. 

Starting  at  point  A,  Huygens’  principle  for  waves  tells  us  that  we 
can  think  of  the  wave  as  spreading  out  in  all  directions.  Suppose  we 
imagine  all  the  possible  ways  that  a  ray  could  travel  from  A  to  B. 

We  show  this  by  drawing  25  possible  paths,  of  which  the  central  one 
is  the  shortest.  Since  the  principle  of  least  time  connects  the  wave 
model  to  the  ray  model,  we  should  expect  to  get  the  most  accurate 
results  when  the  wavelength  is  much  shorter  than  the  distances  in¬ 
volved  —  for  the  sake  of  this  numerical  example,  let’s  say  that  a 
wavelength  is  1/10  of  the  shortest  reflected  path  from  A  to  B.  The 
table,  2,  shows  the  distances  traveled  by  the  25  rays. 

Note  how  similar  are  the  distances  traveled  by  the  group  of  7 
rays,  indicated  with  a  bracket,  that  come  closest  to  obeying  the 
principle  of  least  time.  If  we  think  of  each  one  as  a  wave,  then 
all  7  are  again  nearly  in  phase  at  point  B.  However,  the  rays  that 
are  farther  from  satisfying  the  principle  of  least  time  show  more 
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rapidly  changing  distances;  on  reuniting  at  point  B,  their  phases 
are  a  random  jumble,  and  they  will  very  nearly  cancel  each  other 
out.  Thus,  almost  none  of  the  wave  energy  delivered  to  point  B 
goes  by  these  longer  paths.  Physically  we  find,  for  instance,  that 
a  wave  pulse  emitted  at  A  is  observed  at  B  after  a  time  interval 
corresponding  very  nearly  to  the  shortest  possible  path,  and  the 
pulse  is  not  very  “smeared  out”  when  it  gets  there.  The  shorter 
the  wavelength  compared  to  the  dimensions  of  the  figure,  the  more 
accurate  these  approximate  statements  become. 

Instead  of  drawing  a  finite  number  of  rays,  such  25,  what  hap¬ 
pens  if  we  think  of  the  angle,  9,  of  emission  of  the  ray  as  a  continu¬ 
ously  varying  variable?  Minimizing  the  distance  L  requires 


Because  L  is  changing  slowly  in  the  vicinity  of  the  angle  that 
satisfies  the  principle  of  least  time,  all  the  rays  that  come  out  close 
to  this  angle  have  very  nearly  the  same  L,  and  remain  very  nearly  in 
phase  when  they  reach  B.  This  is  the  basic  reason  why  the  discrete 
table,  ab/2,  turned  out  to  have  a  group  of  rays  that  all  traveled 
nearly  the  same  distance. 

As  discussed  in  subsection  12.1.5,  the  principle  of  least  time 
is  really  a  principle  of  least  or  greatest  time.  This  makes  perfect 
sense,  since  d L/  &6  =  0  can  in  general  describe  either  a  minimum  or 
a  maximum 


ab  /  Light  could  take  many  The  Prin<:‘iple  of  least  time  is  very  general.  It  does  not  apply  just 

different  paths  from  A  to  B.  to  refraction  and  reflection  —  it  can  even  be  used  to  prove  that  light 

rays  travel  in  a  straight  line  through  empty  space,  without  taking 
detours!  This  general  approach  to  wave  motion  was  used  by  Richard 
Feynman,  one  of  the  pioneers  who  in  the  1950’s  reconciled  quantum 
mechanics  with  relativity.  A  very  readable  explanation  is  given  in 
a  book  Feynman  wrote  for  laypeople,  QED:  The  Strange  Theory  of 
Light  and  Matter. 
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Problems 


The  symbols  - ,  etc.  are  explained  on  page  818. 

1  Draw  a  ray  diagram  showing  why  a  small  light  source  (a 

candle,  say)  produces  sharper  shadows  than  a  large  one  (e.g.,  a  long 
fluorescent  bulb) .  - 

2  A  Global  Positioning  System  (GPS)  receiver  is  a  device  that 

lets  you  figure  out  where  you  are  by  receiving  timed  radio  signals 
from  satellites.  It  works  by  measuring  the  travel  time  for  the  signals, 
which  is  related  to  the  distance  between  you  and  the  satellite.  By 
finding  the  ranges  to  several  different  satellites  in  this  way,  it  can 
pin  down  your  location  in  three  dimensions  to  within  a  few  meters. 
How  accurate  does  the  measurement  of  the  time  delay  have  to  be  to 
determine  your  position  to  this  accuracy?  - 

3  Estimate  the  frequency  of  an  electromagnetic  wave  whose 

wavelength  is  similar  in  size  to  an  atom  (about  a  nm).  Referring 
back  to  figure  o  on  p.  709,  in  what  part  of  the  electromagnetic 
spectrum  would  such  a  wave  lie  (infrared,  gamma-rays,  . . . )?  - 

4  The  Stealth  Bomber  is  designed  with  flat,  smooth  surfaces. 
Why  would  this  make  it  difficult  to  detect  using  radar? 

5  The  natives  of  planet  Wumpus  play  pool  using  light  rays  on 
an  eleven-sided  table  with  mirrors  for  bumpers,  shown  in  the  figure 
on  the  next  page.  Trace  this  shot  accurately  with  a  ruler  to  reveal 
the  hidden  message.  To  get  good  enough  accuracy,  you’ll  need  to 
photocopy  the  page  (or  download  the  book  and  print  the  page)  and 
construct  each  reflection  using  a  protractor. 


Problem  5. 
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6  The  figure  on  the  next  page  shows  a  curved  (parabolic)  mir¬ 
ror,  with  three  parallel  light  rays  coming  toward  it.  One  ray  is 
approaching  along  the  mirror’s  center  line,  (a)  Continue  the  light 
rays  until  they  are  about  to  undergo  their  second  reflection.  To  get 
good  enough  accuracy,  you’ll  need  to  photocopy  the  page  (or  down¬ 
load  the  book  and  print  the  page)  and  draw  in  the  normal  at  each 
place  where  a  ray  is  reflected.  What  do  you  notice?  (b)  Make  up 
an  example  of  a  practical  use  for  this  device,  (c)  How  could  you 
use  this  mirror  with  a  small  lightbulb  to  produce  a  parallel  beam  of 
light  rays  going  off  to  the  right?  ■ 


7  A  man  is  walking  at  1.0  m/s  directly  towards  a  flat  mirror.  At 
what  speed  is  his  separation  from  his  image  decreasing?  V  ^ 

8  If  a  mirror  on  a  wall  is  only  big  enough  for  you  to  see  your¬ 
self  from  your  head  down  to  your  waist,  can  you  see  your  entire 
body  by  backing  up?  Test  this  experimentally  and  come  up  with  an 
explanation  for  your  observations,  including  a  ray  diagram. 

Note  that  when  you  do  the  experiment,  it’s  easy  to  confuse  your¬ 
self  if  the  mirror  is  even  a  tiny  bit  off  of  vertical.  One  way  to  check 
yourself  is  to  artificially  lower  the  top  of  the  mirror  by  putting  a 
piece  of  tape  or  a  post-it  note  where  it  blocks  your  view  of  the  top 
of  your  head.  You  can  then  check  whether  you  are  able  to  see  more 
of  yourself  both  above  and  below  by  backing  up. 
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9  In  section  12.2  we’ve  only  done  examples  of  mirrors  with 

hollowed-out  shapes  (called  concave  mirrors).  Now  draw  a  ray  dia¬ 
gram  for  a  curved  mirror  that  has  a  bulging  outward  shape  (called  a 
convex  mirror),  (a)  How  does  the  image’s  distance  from  the  mirror 
compare  with  the  actual  object’s  distance  from  the  mirror?  From 
this  comparison,  determine  whether  the  magnification  is  greater 
than  or  less  than  one.  (b)  Is  the  image  real,  or  virtual?  Could 
this  mirror  ever  make  the  other  type  of  image?  ■ 

10  As  discussed  in  question  9,  there  are  two  types  of  curved 
mirrors,  concave  and  convex.  Make  a  list  of  all  the  possible  com¬ 
binations  of  types  of  images  (virtual  or  real)  with  types  of  mirrors 
(concave  and  convex).  (Not  all  of  the  four  combinations  are  phys¬ 
ically  possible.)  Now  for  each  one,  use  ray  diagrams  to  determine 
whether  increasing  the  distance  of  the  object  from  the  mirror  leads 
to  an  increase  or  a  decrease  in  the  distance  of  the  image  from  the 
mirror. 

Draw  BIG  ray  diagrams!  Each  diagram  should  use  up  about  half 
a  page  of  paper. 

Some  tips:  To  draw  a  ray  diagram,  you  need  two  rays.  For  one 
of  these,  pick  the  ray  that  comes  straight  along  the  mirror’s  axis, 
since  its  reflection  is  easy  to  draw.  After  you  draw  the  two  rays  and 
locate  the  image  for  the  original  object  position,  pick  a  new  object 
position  that  results  in  the  same  type  of  image,  and  start  a  new  ray 
diagram,  in  a  different  color  of  pen,  right  on  top  of  the  first  one.  For 
the  two  new  rays,  pick  the  ones  that  just  happen  to  hit  the  mirror  at 
the  same  two  places;  this  makes  it  much  easier  to  get  the  result  right 
without  depending  on  extreme  accuracy  in  your  ability  to  draw  the 
reflected  rays.  - 

11  If  the  user  of  an  astronomical  telescope  moves  her  head 
closer  to  or  farther  away  from  the  image  she  is  looking  at,  does 
the  magnification  change?  Does  the  angular  magnification  change? 
Explain.  (For  simplicity,  assume  that  no  eyepiece  is  being  used.) 

■ 

12  In  figure  g/2  in  on  page  760,  only  the  image  of  my  fore¬ 

head  was  located  by  drawing  rays.  Either  photocopy  the  figure  or 
download  the  book  and  print  out  the  relevant  page.  On  this  copy 
of  the  figure,  make  a  new  set  of  rays  coming  from  my  chin,  and 
locate  its  image.  To  make  it  easier  to  judge  the  angles  accurately, 
draw  rays  from  the  chin  that  happen  to  hit  the  mirror  at  the  same 
points  where  the  two  rays  from  the  forehead  were  shown  hitting  it. 
By  comparing  the  locations  of  the  chin’s  image  and  the  forehead’s 
image,  verify  that  the  image  is  actually  upside-down,  as  shown  in 
the  original  figure.  S 

13  The  figure  shows  four  points  where  rays  cross.  Of  these, 

which  are  image  points?  Explain.  ^ 


Problem  13. 
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Problem  18. 


14  Here’s  a  game  my  kids  like  to  play.  I  sit  next  to  a  sunny 
window,  and  the  sun  reflects  from  the  glass  on  my  watch,  making  a 
disk  of  light  on  the  wall  or  floor,  which  they  pretend  to  chase  as  I 
move  it  around.  Is  the  spot  a  disk  because  that’s  the  shape  of  the 
sun,  or  because  it’s  the  shape  of  my  watch?  In  other  words,  would 
a  square  watch  make  a  square  spot,  or  do  we  just  have  a  circular 
image  of  the  circular  sun,  which  will  be  circular  no  matter  what? 

a 

15  Apply  the  equation  M  =  di/d0  to  the  case  of  a  flat  mirror. 

a 

16  Use  the  method  described  in  the  text  to  derive  the  equation 

relating  object  distance  to  image  distance  for  the  case  of  a  virtual 
image  produced  by  a  converging  mirror.  >  Solution,  p.  981  - 

17  Find  the  focal  length  of  the  mirror  in  problem  6  .  V  M 

18  Rank  the  focal  lengths  of  the  mirrors  in  the  figure,  from 
shortest  to  longest.  Explain. 

19  (a)  A  converging  mirror  with  a  focal  length  of  20  cm  is  used 

to  create  an  image,  using  an  object  at  a  distance  of  10  cm.  Is  the 
image  real,  or  is  it  virtual?  (b)  How  about  /  =  20  cm  and  dQ  =  30 
cm?  (c)  What  if  it  was  a  diverging  mirror  with  /  =  20  cm  and 
da  =  10  cm?  (d)  A  diverging  mirror  with  /  =  20  cm  and  dQ  =  30 
cm?  >  Solution,  p.  982  ^ 

20  (a)  Make  up  a  numerical  example  of  a  virtual  image  formed  by 
a  converging  mirror  with  a  certain  focal  length,  and  determine  the 
magnification.  (You  will  need  the  result  of  problem  16.)  Make  sure 
to  choose  values  of  dQ  and  /  that  would  actually  produce  a  virtual 
image,  uot  a  real  one.  Now  change  the  location  of  the  object  a 
little  bit  and  redetermine  the  magnification,  showing  that  it  changes. 
At  my  local  department  store,  the  cosmetics  department  sells  hand 
mirrors  advertised  as  giving  a  magnification  of  5  times.  How  would 
you  interpret  this? 

(b)  Suppose  a  Newtonian  telescope  is  being  used  for  astronom¬ 
ical  observing.  Assume  for  simplicity  that  uo  eyepiece  is  used,  and 
assume  a  value  for  the  focal  length  of  the  mirror  that  would  be 
reasonable  for  an  amateur  instrument  that  is  to  fit  in  a  closet.  Is 
the  angular  magnification  different  for  objects  at  different  distances? 
For  example,  you  could  consider  two  planets,  one  of  which  is  twice 
as  far  as  the  other.  ^ 

21  (a)  Find  a  case  where  the  magnification  of  a  curved  mirror 
is  infinite.  Is  the  angular  magnification  infinite  from  any  realistic 
viewing  position?  (b)  Explain  why  an  arbitrarily  large  magnification 
can’t  be  achieved  by  having  a  sufficiently  small  value  of  dQ. 
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22  A  concave  surface  that  reflects  sound  waves  can  act  just  like 

a  converging  mirror.  Suppose  that,  standing  near  such  a  surface, 
you  are  able  to  find  a  point  where  you  can  place  your  head  so  that 
your  own  whispers  are  focused  back  on  your  head,  so  that  they 
sound  loud  to  you.  Given  your  distance  to  the  surface,  what  is  the 
surface’s  focal  length?  v?  ^ 

23  The  figure  shows  a  device  for  constructing  a  realistic  optical 
illusion.  Two  mirrors  of  equal  focal  length  are  put  against  each 
other  with  their  silvered  surfaces  facing  inward.  A  small  object 
placed  in  the  bottom  of  the  cavity  will  have  its  image  projected  in 
the  air  above.  The  way  it  works  is  that  the  top  mirror  produces  a 
virtual  image,  and  the  bottom  mirror  then  creates  a  real  image  of 
the  virtual  image,  (a)  Show  that  if  the  image  is  to  be  positioned 
as  shown,  at  the  mouth  of  the  cavity,  then  the  focal  length  of  the 
mirrors  is  related  to  the  dimension  h  via  the  equation 


Problem  23. 


1 

7 


h+  U  “  7 


-l  • 


(b)  Restate  the  equation  in  terms  of  a  single  variable  x  =  h/ /,  and 
show  that  there  are  two  solutions  for  x.  Which  solution  is  physically 
consistent  with  the  assumptions  of  the  calculation? 

24  (a)  A  converging  mirror  is  being  used  to  create  a  virtual 

image.  What  is  the  range  of  possible  magnifications?  (b)  Do  the 
same  for  the  other  types  of  images  that  can  be  formed  by  curved 
mirrors  (both  converging  and  diverging).  - 

25  A  diverging  mirror  of  focal  length  /  is  fixed,  and  faces  down. 
An  object  is  dropped  from  the  surface  of  the  mirror,  and  falls  away 
from  it  with  acceleration  g.  The  goal  of  the  problem  is  to  find  the 
maximum  velocity  of  the  image. 

(a)  Describe  the  motion  of  the  image  verbally,  and  explain  why  we 
should  expect  there  to  be  a  maximum  velocity. 

(b)  Use  arguments  based  on  units  to  determine  the  form  of  the 
solution,  up  to  an  unknown  unitless  multiplicative  constant. 

(c)  Complete  the  solution  by  determining  the  unitless  constant. 

a 

26  Diamond  has  an  index  of  refraction  of  2.42,  and  part  of 

the  reason  diamonds  sparkle  is  that  this  encourages  a  light  ray  to 
undergo  many  total  internal  reflections  before  it  emerges,  (a)  Cal¬ 
culate  the  critical  angle  at  which  total  internal  reflection  occurs  in 
diamond,  (b)  Explain  the  interpretation  of  your  result:  Is  it  mea¬ 
sured  from  the  normal,  or  from  the  surface?  Is  it  a  minimum  angle 
for  total  internal  reflection,  or  is  it  a  maximum?  How  would  the 
critical  angle  have  been  different  for  a  substance  such  as  glass  or 
plastic,  with  a  lower  index  of  refraction?  V  ^ 
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27  Suppose  a  converging  lens  is  constructed  of  a  type  of  plastic 
whose  index  of  refraction  is  less  than  that  of  water.  How  will  the 
lens’s  behavior  be  different  if  it  is  placed  underwater? 

28  There  are  two  main  types  of  telescopes,  refracting  (using  a 

lens)  and  reflecting  (using  a  mirror,  as  in  figure  i  on  p.  762).  (Some 
telescopes  use  a  mixture  of  the  two  types  of  elements:  the  light  first 
encounters  a  large  curved  mirror,  and  then  goes  through  an  eyepiece 
that  is  a  lens.  To  keep  things  simple,  assume  no  eyepiece  is  used.) 
What  implications  would  the  color-dependence  of  focal  length  have 
for  the  relative  merits  of  the  two  types  of  telescopes?  Describe  the 
case  where  an  image  is  formed  of  a  white  star.  You  may  find  it 
helpful  to  draw  a  ray  diagram.  ■ 

29  Based  on  Snell’s  law,  explain  why  rays  of  light  passing  through 
the  edges  of  a  converging  lens  are  bent  more  than  rays  passing 
through  parts  closer  to  the  center.  It  might  seem  like  it  should 
be  the  other  way  around,  since  the  rays  at  the  edge  pass  through 
less  glass  —  shouldn’t  they  be  affected  less?  In  your  answer: 

•  Include  a  ray  diagram  showing  a  huge,  full-page,  close-up  view 
of  the  relevant  part  of  the  lens. 

•  Make  use  of  the  fact  that  the  front  and  back  surfaces  aren’t 
always  parallel;  a  lens  in  which  the  front  and  back  surfaces  are 
always  parallel  doesn’t  focus  light  at  all,  so  if  your  explanation 
doesn’t  make  use  of  this  fact,  your  argument  must  be  incorrect. 

•  Make  sure  your  argument  still  works  even  if  the  rays  don’t 
come  in  parallel  to  the  axis  or  from  a  point  on  the  axis. 


30  When  you  take  pictures  with  a  camera,  the  distance  between 
the  lens  and  the  film  has  to  be  adjusted,  depending  on  the  distance 
at  which  you  want  to  focus.  This  is  done  by  moving  the  lens.  If 
you  want  to  change  your  focus  so  that  you  can  take  a  picture  of 
something  farther  away,  which  way  do  you  have  to  move  the  lens? 
Explain  using  ray  diagrams.  [Based  on  a  problem  by  Eric  Mazur.] 

9 

31  When  swimming  underwater,  why  is  your  vision  made  much 

clearer  by  wearing  goggles  with  flat  pieces  of  glass  that  trap  air 
behind  them?  [Hint:  You  can  simplify  your  reasoning  by  considering 
the  special  case  where  you  are  looking  at  an  object  far  away,  and 
along  the  optic  axis  of  the  eye.]  9 

32  An  object  is  more  than  one  focal  length  from  a  converging 
lens,  (a)  Draw  a  ray  diagram,  (b)  Using  reasoning  like  that  devel¬ 
oped  in  section  12.3,  determine  the  positive  and  negative  signs  in 
the  equation  1/f  =  ±l/dj  ±  l/d0.  (c)  The  images  of  the  rose  in 
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section  4.2  were  made  using  a  lens  with  a  focal  length  of  23  cm.  If 
the  lens  is  placed  80  cm  from  the  rose,  locate  the  image.  V  ^ 

33  The  figure  shows  four  lenses.  Lens  1  has  two  spherical  sur¬ 
faces.  Lens  2  is  the  same  as  lens  1  but  turned  around.  Lens  3  is 
made  by  cutting  through  lens  1  and  turning  the  bottom  around. 
Lens  4  is  made  by  cutting  a  central  circle  out  of  lens  1  and  recessing 
it. 

(a)  A  parallel  beam  of  light  enters  lens  1  from  the  left,  parallel 
to  its  axis.  Reasoning  based  on  Snell’s  law,  will  the  beam  emerging 
from  the  lens  be  bent  inward,  or  outward,  or  will  it  remain  parallel 
to  the  axis?  Explain  your  reasoning.  As  part  of  your  answer,  make 
a  huge  drawing  of  one  small  part  of  the  lens,  and  apply  Snell’s  law 
at  both  interfaces.  Recall  that  rays  are  bent  more  if  they  come  to 
the  interface  at  a  larger  angle  with  respect  to  the  normal. 

(b)  What  will  happen  with  lenses  2,3,  and  4?  Explain.  Drawings 

are  not  necessary.  ■ 

34  The  drawing  shows  the  anatomy  of  the  human  eye,  at  twice 

life  size.  Find  the  radius  of  curvature  of  the  outer  surface  of  the 
cornea  by  measurements  on  the  figure,  and  then  derive  the  focal 
length  of  the  air-cornea  interface,  where  almost  all  the  focusing  of 
light  occurs.  You  will  need  to  use  physical  reasoning  to  modify 
the  lensmaker’s  equation  for  the  case  where  there  is  only  a  single 
refracting  surface.  Assume  that  the  index  of  refraction  of  the  cornea 
is  essentially  that  of  water.  M 

35  An  object  is  less  than  one  focal  length  from  a  converging  lens, 

(a)  Draw  a  ray  diagram,  (b)  Using  reasoning  like  that  developed 
in  section  12.3,  determine  the  positive  and  negative  signs  in  the 
equation  1//  =  ±l/d*  ±  l/d0.  (c)  The  images  of  the  rose  in  section 
4.2  were  made  using  a  lens  with  a  focal  length  of  23  cm.  If  the  lens 
is  placed  10  cm  from  the  rose,  locate  the  image.  v  ^ 

36  Nearsighted  people  wear  glasses  whose  lenses  are  diverging, 

(a)  Draw  a  ray  diagram.  For  simplicity  pretend  that  there  is  no 
eye  behind  the  glasses,  (b)  Using  reasoning  like  that  developed 
in  section  12.3,  determine  the  positive  and  negative  signs  in  the 
equation  1//  =  ±l/di  ±  l/d0.  (c)  If  the  focal  length  of  the  lens  is 
50.0  cm,  and  the  person  is  looking  at  an  object  at  a  distance  of  80.0 
cm,  locate  the  image.  V  ■ 

37  (a)  Light  is  being  reflected  diffusely  from  an  object  1.000  m 
underwater.  The  light  that  comes  up  to  the  surface  is  refracted  at 
the  water-air  interface.  If  the  refracted  rays  all  appear  to  come  from 
the  same  point,  then  there  will  be  a  virtual  image  of  the  object  in 
the  water,  above  the  object’s  actual  position,  which  will  be  visible 
to  an  observer  above  the  water.  Consider  three  rays,  A,  B  and  C, 
whose  angles  in  the  water  with  respect  to  the  normal  are  9{  =  0.000°, 
1.000°  and  20.000°  respectively.  Find  the  depth  of  the  point  at  which 
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Problem  34. 
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Problem  39. 


the  refracted  parts  of  A  and  B  appear  to  have  intersected,  and  do 
the  same  for  A  and  C.  Show  that  the  intersections  are  at  nearly  the 
same  depth,  but  not  quite.  [Check:  The  difference  in  depth  should 
be  about  4  cm.] 

(b)  Since  all  the  refracted  rays  do  not  quite  appear  to  have  come 
from  the  same  point,  this  is  technically  not  a  virtual  image.  In 
practical  terms,  what  effect  would  this  have  on  what  you  see? 

(c)  In  the  case  where  the  angles  are  all  small,  use  algebra  and 

trig  to  show  that  the  refracted  rays  do  appear  to  come  from  the 
same  point,  and  find  an  equation  for  the  depth  of  the  virtual  im¬ 
age.  Do  not  put  in  any  numerical  values  for  the  angles  or  for  the 
indices  of  refraction  —  just  keep  them  as  symbols.  You  will  need 
the  approximation  sin  9  ~  tan  9  ~  9,  which  is  valid  for  small  angles 
measured  in  radians.  ■ 

38  Prove  that  the  principle  of  least  time  leads  to  Snell’s  law. 

39  Two  standard  focal  lengths  for  camera  lenses  are  50  mm 
(standard)  and  28  mm  (wide-angle).  To  see  how  the  focal  lengths 
relate  to  the  angular  size  of  the  held  of  view,  it  is  helpful  to  visualize 
things  as  represented  in  the  figure.  Instead  of  showing  many  rays 
coming  from  the  same  point  on  the  same  object,  as  we  normally  do, 
the  figure  shows  two  rays  from  two  different  objects.  Although  the 
lens  will  intercept  infinitely  many  rays  from  each  of  these  points,  we 
have  shown  only  the  ones  that  pass  through  the  center  of  the  lens, 
so  that  they  suffer  no  angular  deflection.  (Any  angular  deflection  at 
the  front  surface  of  the  lens  is  canceled  by  an  opposite  deflection  at 
the  back,  since  the  front  and  back  surfaces  are  parallel  at  the  lens’s 
center.)  What  is  special  about  these  two  rays  is  that  they  are  aimed 
at  the  edges  of  one  35-mm-wide  frame  of  film;  that  is,  they  show 
the  limits  of  the  field  of  view.  Throughout  this  problem,  we  assume 
that  da  is  much  greater  than  dt .  (a)  Compute  the  angular  width 
of  the  camera’s  held  of  view  when  these  two  lenses  are  used,  (b) 
Use  small-angle  approximations  to  find  a  simplified  equation  for  the 
angular  width  of  the  held  of  view,  9 ,  in  terms  of  the  focal  length, 
/,  and  the  width  of  the  him,  w.  Your  equation  should  not  have 
any  trig  functions  in  it.  Compare  the  results  of  this  approximation 
with  your  answers  from  part  a.  (c)  Suppose  that  we  are  holding 
constant  the  aperture  (amount  of  surface  area  of  the  lens  being 
used  to  collect  light).  When  switching  from  a  50-mm  lens  to  a  28- 
mm  lens,  how  many  times  longer  or  shorter  must  the  exposure  be 
in  order  to  make  a  properly  developed  picture,  i.e.,  one  that  is  not 
under-  or  overexposed?  [Based  on  a  problem  by  Arnold  Arons.] 

>  Solution,  p.  982  - 
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40  A  nearsighted  person  is  one  whose  eyes  focus  light  too 
strongly,  and  who  is  therefore  unable  to  relax  the  lens  inside  her 
eye  sufficiently  to  form  an  image  on  her  retina  of  an  object  that  is 
too  far  away. 

(a)  Draw  a  ray  diagram  showing  what  happens  when  the  person 
tries,  with  uncorrected  vision,  to  focus  at  infinity. 

(b)  What  type  of  lenses  do  her  glasses  have?  Explain. 

(c)  Draw  a  ray  diagram  showing  what  happens  when  she  wears 
glasses.  Locate  both  the  image  formed  by  the  glasses  and  the  final 
image. 

(d)  Suppose  she  sometimes  uses  contact  lenses  instead  of  her 
glasses.  Does  the  focal  length  of  her  contacts  have  to  be  less  than, 
equal  to,  or  greater  than  that  of  her  glasses?  Explain. 


41  Fred’s  eyes  are  able  to  focus  on  things  as  close  as  5.0  cm. 
Fred  holds  a  magnifying  glass  with  a  focal  length  of  3.0  cm  at  a 
height  of  2.0  cm  above  a  flatworm.  (a)  Locate  the  image,  and  find 
the  magnification,  (b)  Without  the  magnifying  glass,  from  what 
distance  would  Fred  want  to  view  the  flatworm  to  see  its  details 
as  well  as  possible?  With  the  magnifying  glass?  (c)  Compute  the 
angular  magnification.  ^ 
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42  Panel  1  of  the  figure  shows  the  optics  inside  a  pair  of  binoc¬ 
ulars.  They  are  essentially  a  pair  of  telescopes,  one  for  each  eye. 
But  to  make  them  more  compact,  and  allow  the  eyepieces  to  be  the 
right  distance  apart  for  a  human  face,  they  incorporate  a  set  of  eight 
prisms,  which  fold  the  light  path.  In  addition,  the  prisms  make  the 
image  upright.  Panel  2  shows  one  of  these  prisms,  known  as  a  Porro 
prism.  The  light  enters  along  a  normal,  undergoes  two  total  internal 
reflections  at  angles  of  45  degrees  with  respect  to  the  back  surfaces, 
and  exits  along  a  normal.  The  image  of  the  letter  R  has  been  flipped 
across  the  horizontal.  Panel  3  shows  a  pair  of  these  prisms  glued 
together.  The  image  will  be  flipped  across  both  the  horizontal  and 
the  vertical,  which  makes  it  oriented  the  right  way  for  the  user  of 
the  binoculars. 

(a)  Find  the  minimum  possible  index  of  refraction  for  the  glass  used 
in  the  prisms. 

(b)  For  a  material  of  this  minimal  index  of  refraction,  find  the  frac¬ 
tion  of  the  incoming  light  that  will  be  lost  to  reflection  in  the  four 
Porro  prisms  on  a  each  side  of  a  pair  of  binoculars.  (See  section  6.2.) 
In  real,  high-quality  binoculars,  the  optical  surfaces  of  the  prisms 
have  antireflective  coatings,  but  carry  out  your  calculation  for  the 
case  where  there  is  no  such  coating. 

(c)  Discuss  the  reasons  why  a  designer  of  binoculars  might  or  might 

not  want  to  use  a  material  with  exactly  the  index  of  refraction  found 
in  part  a.  H 


43  It  would  be  annoying  if  your  eyeglasses  produced  a  magnified 
or  reduced  image.  Prove  that  when  the  eye  is  very  close  to  a  lens, 
and  the  lens  produces  a  virtual  image,  the  angular  magnification  is 
always  approximately  equal  to  1  (regardless  of  whether  the  lens  is 
diverging  or  converging).  ■ 
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44  The  figure  shows  a  diffraction  pattern  made  by  a  double 
slit,  along  with  an  image  of  a  meter  stick  to  show  the  scale.  Sketch 
the  diffraction  pattern  from  the  figure  on  your  paper.  Now  consider 
the  four  variables  in  the  equation  X/d  =  sin  6/m.  Which  of  these 
are  the  same  for  all  five  fringes,  and  which  are  different  for  each 
fringe?  Which  variable  would  you  naturally  use  in  order  to  label 
which  fringe  was  which?  Label  the  fringes  on  your  sketch  using  the 
values  of  that  variable.  ^ 


Problems  44  and  47. 


45  Match  gratings  A-C  with  the  diffraction  patterns  1-3  that 
they  produce.  Explain. 


Problems 


813 


46  The  figure  below  shows  two  diffraction  patterns.  The  top  one 
was  made  with  yellow  light,  and  the  bottom  one  with  red.  Could 
the  slits  used  to  make  the  two  patterns  have  been  the  same? 


47  The  figure  on  p.  813  shows  a  diffraction  pattern  made  by  a 

double  slit,  along  with  an  image  of  a  meter  stick  to  show  the  scale. 
The  slits  were  146  cm  away  from  the  screen  on  which  the  diffraction 
pattern  was  projected.  The  spacing  of  the  slits  was  0.050  mm.  What 
was  the  wavelength  of  the  light?  ■ 

48  Why  would  blue  or  violet  light  be  the  best  for  microscopy? 

■ 

49  The  figure  below  shows  two  diffraction  patterns,  both  made 
with  the  same  wavelength  of  red  light,  (a)  What  type  of  slits  made 
the  patterns?  Is  it  a  single  slit,  double  slits,  or  something  else? 
Explain,  (b)  Compare  the  dimensions  of  the  slits  used  to  make  the 
top  and  bottom  pattern.  Give  a  numerical  ratio,  and  state  which 
way  the  ratio  is,  i.e. ,  which  slit  pattern  was  the  larger  one.  Explain. 


50  When  white  light  passes  through  a  diffraction  grating,  what 
is  the  smallest  value  of  m  for  which  the  visible  spectrum  of  order  nn 
overlaps  the  next  one,  of  order  m  +  1?  (The  visible  spectrum  runs 
from  about  400  nm  to  about  700  nrn.)  - 
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Problem  51.  This  image  of 
the  Pleiades  star  cluster  shows 
haloes  around  the  stars  due  to  the 
wave  nature  of  light. 


51  For  star  images  such  as  the  ones  in  figure  y,  estimate  the 
angular  width  of  the  diffraction  spot  due  to  diffraction  at  the  mouth 
of  the  telescope.  Assume  a  telescope  with  a  diameter  of  10  meters 
(the  largest  currently  in  existence),  and  light  with  a  wavelength  in 
the  middle  of  the  visible  range.  Compare  with  the  actual  angular 
size  of  a  star  of  diameter  109  m  seen  from  a  distance  of  101'  nr. 
What  does  this  tell  you? 

52  The  figure  below  shows  three  diffraction  patterns.  All  were 
made  under  identical  conditions,  except  that  a  different  set  of  double 
slits  was  used  for  each  one.  The  slits  used  to  make  the  top  pattern 
had  a  center-to-center  separation  d  =  0.50  mm,  and  each  slit  was 
w  =  0.04  mm  wide,  (a)  Determine  d  and  w  for  the  slits  used  to 
make  the  pattern  in  the  middle,  (b)  Do  the  same  for  the  slits  used 
to  make  the  bottom  pattern. 
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53  The  beam  of  a  laser  passes  through  a  diffraction  grating, 

fans  out,  and  illuminates  a  wall  that  is  perpendicular  to  the  original 
beam,  lying  at  a  distance  of  2.0  m  from  the  grating.  The  beam 
is  produced  by  a  helium-neon  laser,  and  has  a  wavelength  of  694.3 
nrn.  The  grating  has  2000  lines  per  centimeter,  (a)  What  is  the 
distance  on  the  wall  between  the  central  maximum  and  the  maxima 
immediately  to  its  right  and  left?  (b)  How  much  does  your  answer 
change  when  you  use  the  small-angle  approximations  #  ~  sin#  ~ 
tan#?  V  ■ 

54  Ultrasound,  i.e.,  sound  waves  with  frequencies  too  high  to  be 

audible,  can  be  used  for  imaging  fetuses  in  the  womb  or  for  break¬ 
ing  up  kidney  stones  so  that  they  can  be  eliminated  by  the  body. 
Consider  the  latter  application.  Lenses  can  be  built  to  focus  sound 
waves,  but  because  the  wavelength  of  the  sound  is  not  all  that  small 
compared  to  the  diameter  of  the  lens,  the  sound  will  not  be  concen¬ 
trated  exactly  at  the  geometrical  focal  point.  Instead,  a  diffraction 
pattern  will  be  created  with  an  intense  central  spot  surrounded  by 
fainter  rings.  About  85%  of  the  power  is  concentrated  within  the 
central  spot.  The  angle  of  the  first  minimum  (surrounding  the  cen¬ 
tral  spot)  is  given  by  sin#  =  A/6,  where  6  is  the  diameter  of  the  lens. 
This  is  similar  to  the  corresponding  equation  for  a  single  slit,  but 
with  a  factor  of  1.22  in  front  which  arises  from  the  circular  shape  of 
the  aperture.  Let  the  distance  from  the  lens  to  the  patient’s  kidney 
stone  be  L  =  20  cm.  You  will  want  /  >  20  kHz,  so  that  the  sound 
is  inaudible.  Find  values  of  6  and  /  that  would  result  in  a  usable 
design,  where  the  central  spot  is  small  enough  to  lie  within  a  kidney 
stone  1  cm  in  diameter.  ■ 

55  Under  what  circumstances  could  one  get  a  mathematically 
undefined  result  by  solving  the  double-slit  diffraction  equation  for  #? 
Give  a  physical  interpretation  of  what  would  actually  be  observed. 

■ 

56  When  ultrasound  is  used  for  medical  imaging,  the  frequency 
may  be  as  high  as  5-20  MHz.  Another  medical  application  of  ul¬ 
trasound  is  for  therapeutic  heating  of  tissues  inside  the  body;  here, 
the  frequency  is  typically  1-3  MHz.  What  fundamental  physical  rea¬ 
sons  could  you  suggest  for  the  use  of  higher  frequencies  for  imaging? 
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57  Suppose  we  have  a  polygonal  room  whose  walls  are  mirrors, 
and  there  a  pointlike  light  source  in  the  room.  In  most  such  exam¬ 
ples,  every  point  in  the  room  ends  up  being  illuminated  by  the  light 
source  after  some  finite  number  of  reflections.  A  difficult  mathemat¬ 
ical  question,  first  posed  in  the  middle  of  the  last  century,  is  whether 
it  is  ever  possible  to  have  an  example  in  which  the  whole  room  is 
not  illuminated.  (Rays  are  assumed  to  be  absorbed  if  they  strike 
exactly  at  a  vertex  of  the  polygon,  or  if  they  pass  exactly  through 
the  plane  of  a  mirror.) 

The  problem  was  finally  solved  in  1995  by  G.W.  Tokarsky,  who 
found  an  example  of  a  room  that  was  not  illuminable  from  a  cer¬ 
tain  point.  Figure  57  shows  a  slightly  simpler  example  found  two 
years  later  by  D.  Castro.  If  a  light  source  is  placed  at  either  of  the 
locations  shown  with  dots,  the  other  dot  remains  unilluminated,  al¬ 
though  every  other  point  is  lit  up.  It  is  not  straightforward  to  prove 
rigorously  that  Castro’s  solution  has  this  property.  However,  the 
plausibility  of  the  solution  can  be  demonstrated  as  follows. 

Suppose  the  light  source  is  placed  at  the  right-hand  dot.  Locate 
all  the  images  formed  by  single  reflections.  Note  that  they  form  a 
regular  pattern.  Convince  yourself  that  none  of  these  images  illumi¬ 
nates  the  left-hand  dot.  Because  of  the  regular  pattern,  it  becomes 
plausible  that  even  if  we  form  images  of  images,  images  of  images 
of  images,  etc.,  none  of  them  will  ever  illuminate  the  other  dot. 

There  are  various  other  versions  of  the  problem,  some  of  which 
remain  unsolved.  The  book  by  Klee  and  Wagon  gives  a  good  in¬ 
troduction  to  the  topic,  although  it  predates  Tokarsky  and  Castro’s 
work. 


Problem  57. 
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58  A  mechanical  linkage  is  a  device  that  changes  one  type  of 
motion  into  another.  The  most  familiar  example  occurs  in  a  gasoline 
car’s  engine,  where  a  connecting  rod  changes  the  linear  motion  of  the 
piston  into  circular  motion  of  the  crankshaft.  The  top  panel  of  the 
hgure  shows  a  mechanical  linkage  invented  by  Peaucellier  in  1864, 
and  independently  by  Lipkin  around  the  same  time.  It  consists  of 
six  rods  joined  by  hinges,  the  four  short  ones  forming  a  rhombus. 
Point  O  is  fixed  in  space,  but  the  apparatus  is  free  to  rotate  about 
O.  Motion  at  P  is  transformed  into  a  different  motion  at  P'  (or  vice 
versa) . 

Geometrically,  the  linkage  is  a  mechanical  implementation  of 


Problem  58. 
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Problem  59. 


the  ancient  problem  of  inversion  in  a  circle.  Considering  the  case  in 
which  the  rhombus  is  folded  flat,  let  the  k  be  the  distance  from  O 
to  the  point  where  P  and  P7  coincide.  Form  the  circle  of  radius  k 
with  its  center  at  O.  As  P  and  P7  move  in  and  out,  points  on  the 
inside  of  the  circle  are  always  mapped  to  points  on  its  outside,  such 
that  rr'  =  k2.  That  is,  the  linkage  is  a  type  of  analog  computer 
that  exactly  solves  the  problem  of  finding  the  inverse  of  a  number 
r.  Inversion  in  a  circle  has  many  remarkable  geometrical  properties, 
discussed  in  H.S.M.  Coxeter,  Introduction  to  Geometry ,  Wiley,  1961. 
If  a  pen  is  inserted  through  a  hole  at  P,  and  P7  is  traced  over  a 
geometrical  figure,  the  Peaucellier  linkage  can  be  used  to  draw  a 
kind  of  image  of  the  figure. 

A  related  problem  is  the  construction  of  pictures,  like  the  one  in 
the  bottom  panel  of  the  figure,  called  anamorphs.  The  drawing  of 
the  column  on  the  paper  is  highly  distorted,  but  when  the  reflecting 
cylinder  is  placed  in  the  correct  spot  on  top  of  the  page,  an  undis¬ 
torted  image  is  produced  inside  the  cylinder.  (Wide-format  movie 
technologies  such  as  Cinemascope  are  based  on  similar  principles.) 

Show  that  the  Peaucellier  linkage  does  not  convert  correctly  be¬ 
tween  an  image  and  its  anamorph,  and  design  a  modified  version  of 
the  linkage  that  does.  Some  knowledge  of  analytic  geometry  will  be 
helpful.  fi£ 

59  The  figure  shows  a  lens  with  surfaces  that  are  curved,  but 

whose  thickness  is  constant  along  any  horizontal  line.  Use  the  lens- 
maker’s  equation  to  prove  that  this  “lens”  is  not  really  a  lens  at 
all.  >  Solution,  p.  982  ^ 

60  Under  ordinary  conditions,  gases  have  indices  of  refraction 

only  a  little  greater  than  that  of  vacuum,  i.e.,  n  =  1  +  e,  where  e  is 
some  small  number.  Suppose  that  a  ray  crosses  a  boundary  between 
a  region  of  vacuum  and  a  region  in  which  the  index  of  refraction  is 
1  +  e.  Find  the  maximum  angle  by  which  such  a  ray  can  ever  be 
deflected,  in  the  limit  of  small  e.  >  Hint,  p.  960 

61  A  converging  mirror  has  focal  length  /.  An  object  is  located 
at  a  distance  (1  +  e)f  from  the  mirror,  where  e  is  small.  Find  the 
distance  of  the  image  from  the  mirror,  simplifying  your  result  as 
much  as  possible  by  using  the  assumption  that  e  is  small. 

>  Answer,  p.  970  - 


Key  to  symbols: 

easy  -  typical  fi  challenging  ■  difficult  B  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  12A:  Exploring  Images  With  a  Curved  Mirror 

Equipment: 

concave  mirrors  with  deep  curvature 
concave  mirrors  with  gentle  curvature 
convex  mirrors 

1.  Obtain  a  curved  mirror  from  your  instructor.  If  it  is  silvered  on  both  sides,  make  sure  you’re 
working  with  the  concave  side,  which  bends  light  rays  inward.  Look  at  your  own  face  in  the 
mirror.  Now  change  the  distance  between  your  face  and  the  mirror,  and  see  what  happens. 
Explore  the  full  range  of  possible  distances  between  your  face  and  the  mirror. 

In  these  observations  you’ve  been  changing  two  variables  at  once:  the  distance  between  the 
object  (your  face)  and  the  mirror,  and  the  distance  from  the  mirror  to  your  eye.  In  general, 
scientific  experiments  become  easier  to  interpret  if  we  practice  isolation  of  variables,  i.e.,  only 
change  one  variable  while  keeping  all  the  others  constant.  In  parts  2  and  3  you’ll  form  an  image 
of  an  object  that’s  not  your  face,  so  that  you  can  have  independent  control  of  the  object  distance 
and  the  point  of  view. 

2.  With  the  mirror  held  far  away  from  you,  observe  the  image  of  something  behind  you,  over 
your  shoulder.  Now  bring  your  eye  closer  and  closer  to  the  mirror.  Can  you  see  the  image  with 
your  eye  very  close  to  the  mirror?  See  if  you  can  explain  your  observation  by  drawing  a  ray 
diagram. 


>  turn  page 
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3.  Now  imagine  the  following  new  situation,  but  don’t  actually  do  it  yet.  Suppose  you  lay  the 
mirror  face-up  on  a  piece  of  tissue  paper,  put  your  finger  a  few  cm  above  the  mirror,  and  look 
at  the  image  of  your  finger.  As  in  part  2,  you  can  bring  your  eye  closer  and  closer  to  the  mirror. 

Will  you  be  able  to  see  the  image  with  your  eye  very  close  to  the  mirror?  Draw  a  ray  diagram 
to  help  you  predict  what  you  will  observe. 


Prediction: _ 

Now  test  your  prediction.  If  your  prediction  was  incorrect,  see  if  you  can  figure  out  what  went 
wrong,  or  ask  your  instructor  for  help. 

4.  For  parts  4  and  5,  it’s  more  convenient  to  use  concave  mirrors  that  are  more  gently  curved; 
obtain  one  from  your  instructor.  Lay  the  mirror  on  the  tissue  paper,  and  use  it  to  create  an 
image  of  the  overhead  lights  on  a  piece  of  paper  above  it  and  a  little  off  to  the  side.  What  do 
you  have  to  do  in  order  to  make  the  image  clear?  Can  you  explain  this  observation  using  a  ray 
diagram? 


>  turn  page 
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5.  Now  imagine  the  following  experiment,  but  don’t  do  it  yet.  What  will  happen  to  the  image 
on  the  paper  if  you  cover  half  of  the  mirror  with  your  hand? 

Prediction: _ 

Test  your  prediction.  If  your  prediction  was  incorrect,  can  you  explain  what  happened? 

6.  Now  imagine  forming  an  image  with  a  convex  mirror  (one  that  bulges  outward),  and  that 
therefore  bends  light  rays  away  from  the  central  axis  (i.e.,  is  diverging).  Draw  a  typical  ray 
diagram. 


Is  the  image  real,  or  virtual?  Will  there  be  more  than  one  type  of  image? 

Prediction: _ 

Test  your  prediction. 
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Exercise  12B:  Object  and  Image  Distances 

Equipment: 

optical  benches 
converging  mirrors 
illuminated  objects 

1.  Set  up  the  optical  bench  with  the  mirror  at  zero  on  the  centimeter  scale.  Set  up  the 
illuminated  object  on  the  bench  as  well. 

2.  Each  group  will  locate  the  image  for  their  own  value  of  the  object  distance,  by  finding  where 
a  piece  of  paper  has  to  be  placed  in  order  to  see  the  image  on  it.  (The  instructor  will  do  one 
point  as  well.)  Note  that  you  will  have  to  tilt  the  mirror  a  little  so  that  the  paper  on  which  you 
project  the  image  doesn’t  block  the  light  from  the  illuminated  object. 

Is  the  image  real  or  virtual?  How  do  you  know?  Is  it  inverted,  or  uninverted? 

Draw  a  ray  diagram. 


3.  Measure  the  image  distance  and  write  your  result  in  the  table  on  the  board.  Do  the  same  for 
the  magnification. 

4.  What  do  you  notice  about  the  trend  of  the  data  on  the  board?  Draw  a  second  ray  diagram 
with  a  different  object  distance,  and  show  why  this  makes  sense.  Some  tips  for  doing  this 
correctly:  (1)  For  simplicity,  use  the  point  on  the  object  that  is  on  the  mirror’s  axis.  (2)  You 
need  to  trace  two  rays  to  locate  the  image.  To  save  work,  don’t  just  do  two  rays  at  random 
angles.  You  can  either  use  the  on-axis  ray  as  one  ray,  or  do  two  rays  that  come  off  at  the  same 
angle,  one  above  and  one  below  the  axis.  (3)  Where  each  ray  hits  the  mirror,  draw  the  normal 
line,  and  make  sure  the  ray  is  at  equal  angles  on  both  sides  of  the  normal. 

5.  We  will  find  the  mirror’s  focal  length  from  the  instructor’s  data-point.  Then,  using  this  focal 
length,  calculate  a  theoretical  prediction  of  the  image  distance,  and  write  it  on  the  board  next 
to  the  experimentally  determined  image  distance. 
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Exercise  12C:  How  strong  are  your  glasses? 

This  exercise  was  created  by  Dan  Maclsaac. 

Equipment: 

eyeglasses 

diverging  lenses  for  students  who  don’t  wear  glasses,  or  who  use  glasses  with  converging 
lenses 

rulers  and  metersticks 
scratch  paper 
marking  pens 

Most  people  who  wear  glasses  have  glasses  whose  lenses  are  outbending,  which  allows  them  to 
focus  on  objects  far  away.  Such  a  lens  cannot  form  a  real  image,  so  its  focal  length  cannot  be 
measured  as  easily  as  that  of  a  converging  lens.  In  this  exercise  you  will  determine  the  focal 
length  of  your  own  glasses  by  taking  them  off,  holding  them  at  a  distance  from  your  face,  and 
looking  through  them  at  a  set  of  parallel  lines  on  a  piece  of  paper.  The  lines  will  be  reduced 
(the  lens’s  magnification  is  less  than  one),  and  by  adjusting  the  distance  between  the  lens  and 
the  paper,  you  can  make  the  magnification  equal  1/2  exactly,  so  that  two  spaces  between  lines 
as  seen  through  the  lens  fit  into  one  space  as  seen  simultaneously  to  the  side  of  the  lens.  This 
object  distance  can  be  used  in  order  to  find  the  focal  length  of  the  lens. 

1.  Use  a  marker  to  draw  three  evenly  spaced  parallel  lines  on  the  paper.  (A  spacing  of  a  few 
cm  works  well.) 

2.  Does  this  technique  really  measure  magnification  or  does  it  measure  angular  magnification? 
What  can  you  do  in  your  experiment  in  order  to  make  these  two  quantities  nearly  the  same,  so 
the  math  is  simpler? 

3.  Before  taking  any  numerical  data,  use  algebra  to  find  the  focal  length  of  the  lens  in  terms  of 
d0,  the  object  distance  that  results  in  a  magnification  of  1/2. 

4.  Measure  the  object  distance  that  results  in  a  magnification  of  1/2,  and  determine  the  focal 
length  of  your  lens. 
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Exercise  12D:  Double-Source  Interference 
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1.  Two  sources  separated  by  a  distance  d  =  2  cm  make  circular  ripples  with  a  wavelength  of 
A  =  1  cm.  On  a  piece  of  paper,  make  a  life-size  drawing  of  the  two  sources  in  the  default  setup, 
and  locate  the  following  points: 

A.  The  point  that  is  10  wavelengths  from  source  #1  and  10  wavelengths  from  source  #2. 

B.  The  point  that  is  10.5  wavelengths  from  #1  and  10.5  from  #2. 

C.  The  point  that  is  11  wavelengths  from  $T  and  11  from  #2. 

D.  The  point  that  is  10  wavelengths  from  $T  and  10.5  from  #2. 

E.  The  point  that  is  11  wavelengths  from  #1  and  11.5  from  #2. 

F.  The  point  that  is  10  wavelengths  from  #1  and  11  from  #2. 

G.  The  point  that  is  11  wavelengths  from  #1  and  12  from  #2. 

You  can  do  this  either  using  a  compass  or  by  putting  the  next  page  under  your  paper  and 
tracing.  It  is  not  necessary  to  trace  all  the  arcs  completely,  and  doing  so  is  unnecessarily  time- 
consuming;  you  can  fairly  easily  estimate  where  these  points  would  lie,  and  just  trace  arcs  long 
enough  to  find  the  relevant  intersections. 

What  do  these  points  correspond  to  in  the  real  wave  pattern? 

2.  Make  a  fresh  copy  of  your  drawing,  showing  only  point  F  and  the  two  sources,  which  form  a 
long,  skinny  triangle.  Now  suppose  you  were  to  change  the  setup  by  doubling  d.  while  leaving  A 
the  same.  It’s  easiest  to  understand  what’s  happening  on  the  drawing  if  you  move  both  sources 
outward,  keeping  the  center  fixed.  Based  on  your  drawing,  what  will  happen  to  the  position  of 
point  F  when  you  double  dl  How  has  the  angle  of  point  F  changed? 

3.  What  would  happen  if  you  doubled  both  A  and  d  compared  to  the  standard  setup? _ 

4.  Combining  the  ideas  from  parts  2  and  3,  what  do  you  think  would  happen  to  your  angles  if, 

starting  from  the  standard  setup,  you  doubled  A  while  leaving  d  the  same? _ 

5.  Suppose  A  was  a  millionth  of  a  centimeter,  while  d  was  still  as  in  the  standard  setup.  What 
would  happen  to  the  angles?  What  does  this  tell  you  about  observing  diffraction  of  light? 
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Exercise  12E:  Single-slit  diffraction 

Equipment: 

rulers 

computer  with  web  browser 

The  following  page  is  a  diagram  of  a  single  slit  and  a  screen  onto  which  its  diffraction  pattern 
is  projected.  The  class  will  make  a  numerical  prediction  of  the  intensity  of  the  pattern  at  the 
different  points  on  the  screen.  Each  group  will  be  responsible  for  calculating  the  intensity  at 
one  of  the  points.  (Either  11  groups  or  six  will  work  nicely  -  in  the  latter  case,  only  points  a, 
c,  e,  g,  i,  and  k  are  used.)  The  idea  is  to  break  up  the  wavefront  in  the  mouth  of  the  slit  into 
nine  parts,  each  of  which  is  assumed  to  radiate  semicircular  ripples  as  in  Huygens’  principle. 
The  wavelength  of  the  wave  is  1  cm,  and  we  assume  for  simplicity  that  each  set  of  ripples  has 
an  amplitude  of  1  unit  when  it  reaches  the  screen. 

1.  For  simplicity,  let’s  imagine  that  we  were  only  to  use  two  sets  of  ripples  rather  than  nine. 
You  could  measure  the  distance  from  each  of  the  two  points  inside  the  slit  to  your  point  on 
the  screen.  Suppose  the  distances  were  both  25.0  cm.  What  would  be  the  amplitude  of  the 
superimposed  waves  at  this  point  on  the  screen? 

Suppose  one  distance  was  24.0  cm  and  the  other  was  25.0  cm.  What  would  happen? 

What  if  one  was  24.0  cm  and  the  other  was  26.0  cm? 

What  if  one  was  24.5  cm  and  the  other  was  25.0  cm? 

In  general,  what  combinations  of  distances  will  lead  to  completely  destructive  and  completely 
constructive  interference? 

Can  you  estimate  the  answer  in  the  case  where  the  distances  are  24.7  and  25.0  cm? 

2.  Although  it  is  possible  to  calculate  mathematically  the  amplitude  of  the  sine  wave  that  results 
from  superimposing  two  sine  waves  with  an  arbitrary  phase  difference  between  them,  the  algebra 
is  rather  laborious,  and  it  become  even  more  tedious  when  we  have  more  than  two  waves  to  super¬ 
impose.  Instead,  one  can  simply  use  a  computer  spreadsheet  or  some  other  computer  program  to 
add  up  the  sine  waves  numerically  at  a  series  of  points  covering  one  complete  cycle.  This  is  what 
we  will  actually  do.  You  just  need  to  enter  the  relevant  data  into  the  computer,  then  examine  the 
results  and  pick  off  the  amplitude  from  the  resulting  list  of  numbers.  You  can  run  the  software 
through  a  web  interface  at  http://lightandmatter.com/cgi-bin/diffractionl.cgi. 

3.  Measure  all  nine  distances  to  your  group’s  point  on  the  screen,  and  write  them  on  the  board 
-  that  way  everyone  can  see  everyone  else’s  data,  and  the  class  can  try  to  make  sense  of  why  the 
results  came  out  the  way  they  did.  Determine  the  amplitude  of  the  combined  wave,  and  write 
it  on  the  board  as  well. 

The  class  will  discuss  why  the  results  came  out  the  way  they  did. 
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Exercise  12F:  Diffraction  of  Light 

Equipment: 

slit  patterns,  lasers,  straight-filament  bulbs 

station  1 

You  have  a  mask  with  a  bunch  of  different  double  slits  cut  out  of  it.  The  values  of  w  and  d  are 
as  follows: 

pattern  A  w=0.04  mm  d=.250  mm 

pattern  B  w=0.04  mm  d=.500  mm 

pattern  C  w=0.08  mm  d=.250  mm 

pattern  D  w=0.08  mm  d=.500  mm 

Predict  how  the  patterns  will  look  different,  and  test  your  prediction.  The  easiest  way  to  get 
the  laser  to  point  at  different  sets  of  slits  is  to  stick  folded  up  pieces  of  paper  in  one  side  or  the 
other  of  the  holders. 

station  2 

This  is  just  like  station  1,  but  with  single  slits: 

pattern  A  w=0.02  mm 

pattern  B  w=0.04  mm 

pattern  C  w=0.08  mm 

pattern  D  w=0.16  mm 

Predict  what  will  happen,  and  test  your  predictions.  If  you  have  time,  check  the  actual  numerical 
ratios  of  the  w  values  against  the  ratios  of  the  sizes  of  the  diffraction  patterns 

station  3 

This  is  like  station  1,  but  the  only  difference  among  the  sets  of  slits  is  how  many  slits  there  are: 

pattern  A  double  slit 
pattern  B  3  slits 
pattern  C  4  slits 
pattern  D  5  slits 

station  4 

Hold  the  diffraction  grating  up  to  your  eye,  and  look  through  it  at  the  straight-filament  light 
bulb.  If  you  orient  the  grating  correctly,  you  should  be  able  to  see  the  m  =  1  and  m  =  —  1 
diffraction  patterns  off  the  left  and  right.  If  you  have  it  oriented  the  wrong  way,  they’ll  be  above 
and  below  the  bulb  instead,  which  is  inconvenient  because  the  bulb’s  filament  is  vertical.  Where 
is  the  m  =  0  fringe?  Can  you  see  m  =  2,  etc.? 

Station  5  has  the  same  equipment  as  station  4 ■  If  you’re  assigned  to  station  5  first,  you  should 
actually  do  activity  4  first,  because  it’s  easier. 

station  5 

Use  the  transformer  to  increase  and  decrease  the  voltage  across  the  bulb.  This  allows  you  to 
control  the  filament’s  temperature.  Sketch  graphs  of  intensity  as  a  function  of  wavelength  for 
various  temperatures.  The  inability  of  the  wave  model  of  light  to  explain  the  mathematical 
shapes  of  these  curves  was  historically  one  of  the  reasons  for  creating  a  new  model,  in  which 
light  is  both  a  particle  and  a  wave. 
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Chapter  13 

Quantum  Physics 


13.1  Rules  of  Randomness 

Given  for  one  instant  an  intelligence  which  could  comprehend  all 
the  forces  by  which  nature  is  animated  and  the  respective  positions 
of  the  things  which  compose  it. ..nothing  would  be  uncertain,  and 
the  future  as  the  past  would  be  laid  out  before  its  eyes. 

Pierre  Simon  de  Laplace,  1776 

The  energy  produced  by  the  atom  is  a  very  poor  kind  of  thing. 
Anyone  who  expects  a  source  of  power  from  the  transformation  of 
these  atoms  is  talking  moonshine. 

Ernest  Rutherford,  1933 

The  Quantum  Mechanics  is  very  imposing.  But  an  inner  voice  tells 
me  that  it  is  still  not  the  final  truth.  The  theory  yields  much,  but 
it  hardly  brings  us  nearer  to  the  secret  of  the  Old  One.  In  any  case, 
I  am  convinced  that  He  does  not  play  dice. 

Albert  Einstein 

However  radical  Newton’s  clockwork  universe  seemed  to  his  con¬ 
temporaries,  by  the  early  twentieth  century  it  had  become  a  sort  of 
smugly  accepted  dogma.  Luckily  for  us,  this  deterministic  picture  of 
the  universe  breaks  down  at  the  atomic  level.  The  clearest  demon¬ 
stration  that  the  laws  of  physics  contain  elements  of  randomness 
is  in  the  behavior  of  radioactive  atoms.  Pick  two  identical  atoms 
of  a  radioactive  isotope,  say  the  naturally  occurring  uranium  238, 
and  watch  them  carefully.  They  will  decay  at  different  times,  even 
though  there  was  no  difference  in  their  initial  behavior. 

We  would  be  in  big  trouble  if  these  atoms’  behavior  was  as  pre¬ 
dictable  as  expected  in  the  Newtonian  world-view,  because  radioac¬ 
tivity  is  an  important  source  of  heat  for  our  planet.  In  reality,  each 
atom  chooses  a  random  moment  at  which  to  release  its  energy,  re¬ 
sulting  in  a  nice  steady  heating  effect.  The  earth  would  be  a  much 
colder  planet  if  only  sunlight  heated  it  and  not  radioactivity.  Prob¬ 
ably  there  would  be  no  volcanoes,  and  the  oceans  would  never  have 
been  liquid.  The  deep-sea  geothermal  vents  in  which  life  first  evolved 
would  never  have  existed.  But  there  would  be  an  even  worse  conse¬ 
quence  if  radioactivity  was  deterministic:  after  a  few  billion  years  of 
peace,  all  the  uranium  238  atoms  in  our  planet  would  presumably 
pick  the  same  moment  to  decay.  The  huge  amount  of  stored  nuclear 


a /In  1980,  the  continental 
U.S.  got  its  first  taste  of  active 
volcanism  in  recent  memory  with 
the  eruption  of  Mount  St.  Helens. 
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energy,  instead  of  being  spread  out  over  eons,  would  all  be  released 
at  one  instant,  blowing  our  whole  planet  to  Kingdom  Come.1 

The  new  version  of  physics,  incorporating  certain  kinds  of  ran¬ 
domness,  is  called  quantum  physics  (for  reasons  that  will  become 
clear  later).  It  represented  such  a  dramatic  break  with  the  pre¬ 
vious,  deterministic  tradition  that  everything  that  came  before  is 
considered  “classical,”  even  the  theory  of  relativity.  This  chapter  is 
a  basic  introduction  to  quantum  physics. 

Discussion  Question 

A  I  said  “Pick  two  identical  atoms  of  a  radioactive  isotope.”  Are  two 
atoms  really  identical?  If  their  electrons  are  orbiting  the  nucleus,  can  we 
distinguish  each  atom  by  the  particular  arrangement  of  its  electrons  at 
some  instant  in  time? 

13.1.1  Randomness  isn’t  random. 

Einstein’s  distaste  for  randomness,  and  his  association  of  deter¬ 
minism  with  divinity,  goes  back  to  the  Enlightenment  conception  of 
the  universe  as  a  gigantic  piece  of  clockwork  that  only  had  to  be  set 
in  motion  initially  by  the  Builder.  Many  of  the  founders  of  quan¬ 
tum  mechanics  were  interested  in  possible  links  between  physics  and 
Eastern  and  Western  religious  and  philosophical  thought,  but  every 
educated  person  has  a  different  concept  of  religion  and  philosophy. 
Bertrand  Russell  remarked,  “Sir  Arthur  Eddington  deduces  religion 
from  the  fact  that  atoms  do  not  obey  the  laws  of  mathematics.  Sir 
James  Jeans  deduces  it  from  the  fact  that  they  do.” 

Russell’s  witticism,  which  implies  incorrectly  that  mathemat¬ 
ics  cannot  describe  randomness,  remind  us  how  important  it  is  not 
to  oversimplify  this  question  of  randomness.  You  should  not  sim¬ 
ply  surmise,  “Well,  it’s  all  random,  anything  can  happen.”  For 
one  thing,  certain  things  simply  cannot  happen,  either  in  classical 
physics  or  quantum  physics.  The  conservation  laws  of  mass,  energy, 
momentum,  and  angular  momentum  are  still  valid,  so  for  instance 
processes  that  create  energy  out  of  nothing  are  not  just  unlikely 
according  to  quantum  physics,  they  are  impossible. 

A  useful  analogy  can  be  made  with  the  role  of  randomness  in 
evolution.  Darwin  was  not  the  first  biologist  to  suggest  that  species 
changed  over  long  periods  of  time.  His  two  new  fundamental  ideas 
were  that  (1)  the  changes  arose  through  random  genetic  variation, 
and  (2)  changes  that  enhanced  the  organism’s  ability  to  survive  and 
reproduce  would  be  preserved,  while  maladaptive  changes  would  be 

1This  is  under  the  assumption  that  all  the  uranium  atoms  were  created  at  the 
same  time.  In  reality,  we  have  only  a  general  idea  of  the  processes  that  might 
have  created  the  heavy  elements  in  the  nebula  from  which  our  solar  system 
condensed.  Some  portion  of  them  may  have  come  from  nuclear  reactions  in 
supernova  explosions  in  that  particular  nebula,  but  some  may  have  come  from 
previous  supernova  explosions  throughout  our  galaxy,  or  from  exotic  events  like 
collisions  of  white  dwarf  stars. 
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eliminated  by  natural  selection.  Doubters  of  evolution  often  consider 
only  the  first  point,  about  the  randomness  of  natural  variation,  but 
not  the  second  point,  about  the  systematic  action  of  natural  selec¬ 
tion.  They  make  statements  such  as,  “the  development  of  a  complex 
organism  like  Homo  sapiens  via  random  chance  would  be  like  a  whirl¬ 
wind  blowing  through  a  junkyard  and  spontaneously  assembling  a 
jumbo  jet  out  of  the  scrap  metal.”  The  flaw  in  this  type  of  reason¬ 
ing  is  that  it  ignores  the  deterministic  constraints  on  the  results  of 
random  processes.  For  an  atom  to  violate  conservation  of  energy  is 
no  more  likely  than  the  conquest  of  the  world  by  chimpanzees  next 
year. 

Discussion  Question 

A  Economists  often  behave  like  wannabe  physicists,  probably  because 
it  seems  prestigious  to  make  numerical  calculations  instead  of  talking 
about  human  relationships  and  organizations  like  other  social  scientists. 
Their  striving  to  make  economics  work  like  Newtonian  physics  extends 
to  a  parallel  use  of  mechanical  metaphors,  as  in  the  concept  of  a  mar¬ 
ket’s  supply  and  demand  acting  like  a  self-adjusting  machine,  and  the 
idealization  of  people  as  economic  automatons  who  consistently  strive  to 
maximize  their  own  wealth.  What  evidence  is  there  for  randomness  rather 
than  mechanical  determinism  in  economics? 

13.1.2  Calculating  randomness 

You  should  also  realize  that  even  if  something  is  random,  we 
can  still  understand  it,  and  we  can  still  calculate  probabilities  nu¬ 
merically.  In  other  words,  physicists  are  good  bookmakers.  A  good 
bookmaker  can  calculate  the  odds  that  a  horse  will  win  a  race  much 
more  accurately  that  an  inexperienced  one,  but  nevertheless  cannot 
predict  what  will  happen  in  any  particular  race. 

Statistical  independence 

As  an  illustration  of  a  general  technique  for  calculating  odds, 
suppose  you  are  playing  a  25-cent  slot  machine.  Each  of  the  three 
wheels  has  one  chance  in  ten  of  coming  up  with  a  cherry.  If  all 
three  wheels  come  up  cherries,  you  win  $100.  Even  though  the 
results  of  any  particular  trial  are  random,  you  can  make  certain 
quantitative  predictions.  First,  you  can  calculate  that  your  odds 
of  winning  on  any  given  trial  are  1/10  X  1/10  X  1/10  =  1/1000  = 
0.001.  Here,  I  am  representing  the  probabilities  as  numbers  from 
0  to  1,  which  is  clearer  than  statements  like  “The  odds  are  999  to 
1,”  and  makes  the  calculations  easier.  A  probability  of  0  represents 
something  impossible,  and  a  probability  of  1  represents  something 
that  will  definitely  happen. 

Also,  you  can  say  that  any  given  trial  is  equally  likely  to  result  in 
a  win,  and  it  doesn’t  matter  whether  you  have  won  or  lost  in  prior 
games.  Mathematically,  we  say  that  each  trial  is  statistically  inde¬ 
pendent,  or  that  separate  games  are  uncorrelated.  Most  gamblers 
are  mistakenly  convinced  that,  to  the  contrary,  games  of  chance  are 
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correlated.  If  they  have  been  playing  a  slot  machine  all  day,  they 
are  convinced  that  it  is  “getting  ready  to  pay,”  and  they  do  not 
want  anyone  else  playing  the  machine  and  “using  up”  the  jackpot 
that  they  “have  coming.”  In  other  words,  they  are  claiming  that 
a  series  of  trials  at  the  slot  machine  is  negatively  correlated,  that 
losing  now  makes  you  more  likely  to  win  later.  Craps  players  claim 
that  you  should  go  to  a  table  where  the  person  rolling  the  dice  is 
“hot,”  because  she  is  likely  to  keep  on  rolling  good  numbers.  Craps 
players,  then,  believe  that  rolls  of  the  dice  are  positively  correlated, 
that  winning  now  makes  you  more  likely  to  win  later. 

My  method  of  calculating  the  probability  of  winning  on  the  slot 
machine  was  an  example  of  the  following  important  rule  for  calcu¬ 
lations  based  on  independent  probabilities: 

the  law  of  independent  probabilities 

If  the  probability  of  one  event  happening  is  Pa,  and  the  prob¬ 
ability  of  a  second  statistically  independent  event  happening 
is  Pb,  then  the  probability  that  they  will  both  occur  is  the 
product  of  the  probabilities,  PaPb •  If  there  are  more  than 
two  events  involved,  you  simply  keep  on  multiplying. 

This  can  be  taken  as  the  definition  of  statistical  independence. 

Note  that  this  only  applies  to  independent  probabilities.  For 
instance,  if  you  have  a  nickel  and  a  dime  in  your  pocket,  and  you 
randomly  pull  one  out,  there  is  a  probability  of  0.5  that  it  will  be 
the  nickel.  If  you  then  replace  the  coin  and  again  pull  one  out 
randomly,  there  is  again  a  probability  of  0.5  of  coming  up  with  the 
nickel,  because  the  probabilities  are  independent.  Thus,  there  is  a 
probability  of  0.25  that  you  will  get  the  nickel  both  times. 

Suppose  instead  that  you  do  not  replace  the  first  coin  before 
pulling  out  the  second  one.  Then  you  are  bound  to  pull  out  the 
other  coin  the  second  time,  and  there  is  no  way  you  could  pull  the 
nickel  out  twice.  In  this  situation,  the  two  trials  are  not  indepen¬ 
dent,  because  the  result  of  the  first  trial  has  an  effect  on  the  second 
trial.  The  law  of  independent  probabilities  does  not  apply,  and  the 
probability  of  getting  the  nickel  twice  is  zero,  not  0.25. 

Experiments  have  shown  that  in  the  case  of  radioactive  decay, 
the  probability  that  any  nucleus  will  decay  during  a  given  time  in¬ 
terval  is  unaffected  by  what  is  happening  to  the  other  nuclei,  and 
is  also  unrelated  to  how  long  it  has  gone  without  decaying.  The 
first  observation  makes  sense,  because  nuclei  are  isolated  from  each 
other  at  the  centers  of  their  respective  atoms,  and  therefore  have  no 
physical  way  of  influencing  each  other.  The  second  fact  is  also  rea¬ 
sonable,  since  all  atoms  are  identical.  Suppose  we  wanted  to  believe 
that  certain  atoms  were  “extra  tough,”  as  demonstrated  by  their 
history  of  going  an  unusually  long  time  without  decaying.  Those 
atoms  would  have  to  be  different  in  some  physical  way,  but  nobody 
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has  ever  succeeded  in  detecting  differences  among  atoms.  There  is 
no  way  for  an  atom  to  be  changed  by  the  experiences  it  has  in  its 
lifetime. 


Addition  of  probabilities 

The  law  of  independent  probabilities  tells  us  to  use  multiplica¬ 
tion  to  calculate  the  probability  that  both  A  and  B  will  happen, 
assuming  the  probabilities  are  independent.  What  about  the  prob¬ 
ability  of  an  “or”  rather  than  an  “and”?  If  two  events  A  and  B 
are  mutually  exclusive,  then  the  probability  of  one  or  the  other  oc¬ 
curring  is  the  sum  Pa  +  Pb ■  For  instance,  a  bowler  might  have  a 
30%  chance  of  getting  a  strike  (knocking  down  all  ten  pins)  and  a 
20%  chance  of  knocking  down  nine  of  them.  The  bowler’s  chance  of 
knocking  down  either  nine  pins  or  ten  pins  is  therefore  50%. 

It  does  not  make  sense  to  add  probabilities  of  things  that  are 
not  mutually  exclusive,  i.e.,  that  could  both  happen.  Say  I  have  a 
90%  chance  of  eating  lunch  on  any  given  day,  and  a  90%  chance  of 
eating  dinner.  The  probability  that  I  will  eat  either  lunch  or  dinner 
is  not  180%. 


Normalization 

If  I  spin  a  globe  and  randomly  pick  a  point  on  it,  I  have  about  a 
70%  chance  of  picking  a  point  that’s  in  an  ocean  and  a  30%  chance 
of  picking  a  point  on  land.  The  probability  of  picking  either  wa¬ 
ter  or  land  is  70%  +  30%  =  100%.  Water  and  land  are  mutually 
exclusive,  and  there  are  no  other  possibilities,  so  the  probabilities 
had  to  add  up  to  100%.  It  works  the  same  if  there  are  more  than 
two  possibilities  —  if  you  can  classify  all  possible  outcomes  into  a 
list  of  mutually  exclusive  results,  then  all  the  probabilities  have  to 
add  up  to  1,  or  100%.  This  property  of  probabilities  is  known  as 
normalization. 

Averages 

Another  way  of  dealing  with  randomness  is  to  take  averages. 
The  casino  knows  that  in  the  long  run,  the  number  of  times  you  win 
will  approximately  equal  the  number  of  times  you  play  multiplied 
by  the  probability  of  winning.  In  the  slot-machine  game  described 
on  page  831,  where  the  probability  of  winning  is  0.001,  if  you  spend 
a  week  playing,  and  pay  $2500  to  play  10,000  times,  you  are  likely 
to  win  about  10  times  (10,  000  x  0.001  =  10),  and  collect  $1000.  On 
the  average,  the  casino  will  make  a  profit  of  $1500  from  you.  This 
is  an  example  of  the  following  rule. 

Rule  for  Calculating  Averages 

If  you  conduct  N  identical,  statistically  independent  trials, 
and  the  probability  of  success  in  each  trial  is  P,  then  on  the 
average,  the  total  number  of  successful  trials  will  be  NP.  If  N 
is  large  enough,  the  relative  error  in  this  estimate  will  become 


b  /  Normalization:  the  proba¬ 
bility  of  picking  land  plus  the 
probability  of  picking  water  adds 
up  to  1. 
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c  /  Why  are  dice  random? 


The  statement  that  the  rule  for  calculating  averages  gets  more 
and  more  accurate  for  larger  and  larger  fV(known  popularly  as  the 
“law  of  averages”)  often  provides  a  correspondence  principle  that 
connects  classical  and  quantum  physics.  For  instance,  the  amount 
of  power  produced  by  a  nuclear  power  plant  is  not  random  at  any 
detectable  level,  because  the  number  of  atoms  in  the  reactor  is  so 
large.  In  general,  random  behavior  at  the  atomic  level  tends  to 
average  out  when  we  consider  large  numbers  of  atoms,  which  is  why 
physics  seemed  deterministic  before  physicists  learned  techniques  for 
studying  atoms  individually. 

We  can  achieve  great  precision  with  averages  in  quantum  physics 
because  we  can  use  identical  atoms  to  reproduce  exactly  the  same 
situation  many  times.  If  we  were  betting  on  horses  or  dice,  we  would 
be  much  more  limited  in  our  precision.  After  a  thousand  races,  the 
horse  would  be  ready  to  retire.  After  a  million  rolls,  the  dice  would 
be  worn  out. 

self-check  A 

Which  of  the  following  things  must  be  independent,  which  could  be  in¬ 
dependent,  and  which  definitely  are  not  independent?  (1)  the  probabil¬ 
ity  of  successfully  making  two  free-throws  in  a  row  in  basketball;  (2)  the 
probability  that  it  will  rain  in  London  tomorrow  and  the  probability  that  it 
will  rain  on  the  same  day  in  a  certain  city  in  a  distant  galaxy;  (3)  your 
probability  of  dying  today  and  of  dying  tomorrow.  >  Answer,  p.  968 

Discussion  Questions 

A  Newtonian  physics  is  an  essentially  perfect  approximation  for  de¬ 
scribing  the  motion  of  a  pair  of  dice.  If  Newtonian  physics  is  deterministic, 
why  do  we  consider  the  result  of  rolling  dice  to  be  random? 

B  Why  isn’t  it  valid  to  define  randomness  by  saying  that  randomness 
is  when  all  the  outcomes  are  equally  likely? 

C  The  sequence  of  digits  1 21 21 21 21 21 21 21 21 2  seems  clearly  nonran¬ 
dom,  and  41592653589793  seems  random.  The  latter  sequence,  how¬ 
ever,  is  the  decimal  form  of  pi,  starting  with  the  third  digit.  There  is  a  story 
about  the  Indian  mathematician  Ramanujan,  a  self-taught  prodigy,  that  a 
friend  came  to  visit  him  in  a  cab,  and  remarked  that  the  number  of  the 
cab,  1729,  seemed  relatively  uninteresting.  Ramanujan  replied  that  on 
the  contrary,  it  was  very  interesting  because  it  was  the  smallest  number 
that  could  be  represented  in  two  different  ways  as  the  sum  of  two  cubes. 
The  Argentine  author  Jorge  Luis  Borges  wrote  a  short  story  called  “The 
Library  of  Babel,”  in  which  he  imagined  a  library  containing  every  book 
that  could  possibly  be  written  using  the  letters  of  the  alphabet.  It  would  in¬ 
clude  a  book  containing  only  the  repeated  letter  “a;”  all  the  ancient  Greek 
tragedies  known  today,  all  the  lost  Greek  tragedies,  and  millions  of  Greek 
tragedies  that  were  never  actually  written;  your  own  life  story,  and  various 
incorrect  versions  of  your  own  life  story;  and  countless  anthologies  con¬ 
taining  a  short  story  called  “The  Library  of  Babel.”  Of  course,  if  you  picked 
a  book  from  the  shelves  of  the  library,  it  would  almost  certainly  look  like  a 
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nonsensical  sequence  of  letters  and  punctuation,  but  it’s  always  possible 
that  the  seemingly  meaningless  book  would  be  a  science-fiction  screen¬ 
play  written  in  the  language  of  a  Neanderthal  tribe,  or  the  lyrics  to  a  set 
of  incomparably  beautiful  love  songs  written  in  a  language  that  never  ex¬ 
isted.  In  view  of  these  examples,  what  does  it  really  mean  to  say  that 
something  is  random? 

13.1.3  Probability  distributions 

So  far  we’ve  discussed  random  processes  having  only  two  possible 
outcomes:  yes  or  no,  win  or  lose,  on  or  off.  More  generally,  a  random 
process  could  have  a  result  that  is  a  number.  Some  processes  yield 
integers,  as  when  you  roll  a  die  and  get  a  result  from  one  to  six,  but 
some  are  not  restricted  to  whole  numbers,  for  example  the  number 
of  seconds  that  a  uranium-238  atom  will  exist  before  undergoing 
radioactive  decay. 

Consider  a  throw  of  a  die.  If  the  die  is  “honest,”  then  we  expect 
all  six  values  to  be  equally  likely.  Since  all  six  probabilities  must 
add  up  to  1,  then  probability  of  any  particular  value  coming  up 
must  be  1/6.  We  can  summarize  this  in  a  graph,  d.  Areas  under 
the  curve  can  be  interpreted  as  total  probabilities.  For  instance, 
the  area  under  the  curve  from  1  to  3  is  1/6  + 1/6  + 1/6  =  1/2,  so 
the  probability  of  getting  a  result  from  1  to  3  is  1/2.  The  function 
shown  on  the  graph  is  called  the  probability  distribution. 

Figure  e  shows  the  probabilities  of  various  results  obtained  by 
rolling  two  dice  and  adding  them  together,  as  in  the  game  of  craps. 
The  probabilities  are  not  all  the  same.  There  is  a  small  probability 
of  getting  a  two,  for  example,  because  there  is  only  one  way  to  do 
it,  by  rolling  a  one  and  then  another  one.  The  probability  of  rolling 
a  seven  is  high  because  there  are  six  different  ways  to  do  it:  1+6, 
2+5,  etc. 

If  the  number  of  possible  outcomes  is  large  but  finite,  for  example 
the  number  of  hairs  on  a  dog,  the  graph  would  start  to  look  like  a 
smooth  curve  rather  than  a  ziggurat. 

What  about  probability  distributions  for  random  numbers  that 
are  not  integers?  We  can  no  longer  make  a  graph  with  probability 
on  the  y  axis,  because  the  probability  of  getting  a  given  exact  num¬ 
ber  is  typically  zero.  For  instance,  there  is  zero  probability  that  a 
radioactive  atom  will  last  for  exactly  3  seconds,  since  there  is  are 
infinitely  many  possible  results  that  are  close  to  3  but  not  exactly 
three:  2.999999999999999996876876587658465436,  for  example.  It 
doesn’t  usually  make  sense,  therefore,  to  talk  about  the  probability 
of  a  single  numerical  result,  but  it  does  make  sense  to  talk  about 
the  probability  of  a  certain  range  of  results.  For  instance,  the  prob¬ 
ability  that  an  atom  will  last  more  than  3  and  less  than  4  seconds  is 
a  perfectly  reasonable  thing  to  discuss.  We  can  still  summarize  the 
probability  information  on  a  graph,  and  we  can  still  interpret  areas 
under  the  curve  as  probabilities. 


result 

d  /  Probability  distribution  for 
the  result  of  rolling  a  single  die. 


e  /  Rolling  two  dice  and  adding 
them  up. 
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f  /  A  probability  distribution 
for  height  of  human  adults  (not 
real  data). 


height  (cm) 

g  /  Example  1. 


h  /  The  average  of  a  proba¬ 
bility  distribution. 


But  the  y  axis  can  no  longer  be  a  unitless  probability  scale.  In 
radioactive  decay,  for  example,  we  want  the  x  axis  to  have  units  of 
time,  and  we  want  areas  under  the  curve  to  be  unitless  probabilities. 
The  area  of  a  single  square  on  the  graph  paper  is  then 

(unitless  area  of  a  square) 

=  (width  of  square  with  time  units) 
x  (height  of  square) . 

If  the  units  are  to  cancel  out,  then  the  height  of  the  square  must 
evidently  be  a  quantity  with  units  of  inverse  time.  In  other  words, 
the  y  axis  of  the  graph  is  to  be  interpreted  as  probability  per  unit 
time,  not  probability. 

Figure  f  shows  another  example,  a  probability  distribution  for 
people’s  height.  This  kind  of  bell-shaped  curve  is  quite  common. 

self-check  B 

Compare  the  number  of  people  with  heights  in  the  range  of  1 30-1 35  cm 
to  the  number  in  the  range  135-140.  >  Answer,  p.  969 

Looking  for  tall  basketball  players  example  1 

>  A  certain  country  with  a  large  population  wants  to  find  very  tall 
people  to  be  on  its  Olympic  basketball  team  and  strike  a  blow 
against  western  imperialism.  Out  of  a  pool  of  1 08  people  who  are 
the  right  age  and  gender,  how  many  are  they  likely  to  find  who  are 
over  225  cm  (7  feet  4  inches)  in  height?  Figure  g  gives  a  close-up 
of  the  “tail”  of  the  distribution  shown  previously  in  figure  f. 

>  The  shaded  area  under  the  curve  represents  the  probability  that 
a  given  person  is  tall  enough.  Each  rectangle  represents  a  prob¬ 
ability  of  0.2  x  10-7  cm-1  x  1  cm  =  2  x  10~8.  There  are  about  35 
rectangles  covered  by  the  shaded  area,  so  the  probability  of  hav¬ 
ing  a  height  greater  than  225  cm  is  7  x  1 0-7  ,  or  just  under  one  in 
a  million.  Using  the  rule  for  calculating  averages,  the  average,  or 
expected  number  of  people  this  tall  is  (1 08)  x  (7  x  1 0  7)  =  70. 

Average  and  width  of  a  probability  distribution 

If  the  next  Martian  you  meet  asks  you,  “How  tall  is  an  adult  hu¬ 
man?,”  you  will  probably  reply  with  a  statement  about  the  average 
human  height,  such  as  “Oh,  about  5  feet  6  inches.”  If  you  wanted 
to  explain  a  little  more,  you  could  say,  “But  that’s  only  an  average. 
Most  people  are  somewhere  between  5  feet  and  6  feet  tall.”  Without 
bothering  to  draw  the  relevant  bell  curve  for  your  new  extraterres¬ 
trial  acquaintance,  you’ve  summarized  the  relevant  information  by 
giving  an  average  and  a  typical  range  of  variation. 

The  average  of  a  probability  distribution  can  be  defined  geomet¬ 
rically  as  the  horizontal  position  at  which  it  could  be  balanced  if 
it  was  constructed  out  of  cardboard.  A  convenient  numerical  mea- 
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sure  of  the  amount  of  variation  about  the  average,  or  amount  of 
uncertainty,  is  the  full  width  at  half  maximum,  or  FWHM,  shown 
in  figure  i. 

A  great  deal  more  could  be  said  about  this  topic,  and  indeed 
an  introductory  statistics  course  could  spend  months  on  ways  of 
defining  the  center  and  width  of  a  distribution.  Rather  than  force- 
feeding  you  on  mathematical  detail  or  techniques  for  calculating 
these  things,  it  is  perhaps  more  relevant  to  point  out  simply  that 
there  are  various  ways  of  defining  them,  and  to  inoculate  you  against 
the  misuse  of  certain  definitions. 

The  average  is  not  the  only  possible  way  to  say  what  is  a  typical 
value  for  a  quantity  that  can  vary  randomly;  another  possible  defi¬ 
nition  is  the  median,  defined  as  the  value  that  is  exceeded  with  50% 
probability.  When  discussing  incomes  of  people  living  in  a  certain 
town,  the  average  could  be  very  misleading,  since  it  can  be  affected 
massively  if  a  single  resident  of  the  town  is  Bill  Gates.  Nor  is  the 
FWHM  the  only  possible  way  of  stating  the  amount  of  random  vari¬ 
ation;  another  possible  way  of  measuring  it  is  the  standard  deviation 
(defined  as  the  square  root  of  the  average  squared  deviation  from 
the  average  value). 

13.1.4  Exponential  decay  and  half-life 

Half-life 

Most  people  know  that  radioactivity  “lasts  a  certain  amount  of 
time,”  but  that  simple  statement  leaves  out  a  lot.  As  an  example, 
consider  the  following  medical  procedure  used  to  diagnose  thyroid 
function.  A  very  small  quantity  of  the  isotope  131 1,  produced  in  a 
nuclear  reactor,  is  fed  to  or  injected  into  the  patient.  The  body’s 
biochemical  systems  treat  this  artificial,  radioactive  isotope  exactly 
the  same  as  127I,  which  is  the  only  naturally  occurring  type.  (Nu¬ 
tritionally,  iodine  is  a  necessary  trace  element.  Iodine  taken  into 
the  body  is  partly  excreted,  but  the  rest  becomes  concentrated  in 
the  thyroid  gland.  Iodized  salt  has  had  iodine  added  to  it  to  pre¬ 
vent  the  nutritional  deficiency  known  as  goiters,  in  which  the  iodine- 
starved  thyroid  becomes  swollen.)  As  the  131I  undergoes  beta  decay, 
it  emits  electrons,  neutrinos,  and  gamma  rays.  The  gamma  rays  can 
be  measured  by  a  detector  passed  over  the  patient’s  body.  As  the 
radioactive  iodine  becomes  concentrated  in  the  thyroid,  the  amount 
of  gamma  radiation  coming  from  the  thyroid  becomes  greater,  and 
that  emitted  by  the  rest  of  the  body  is  reduced.  The  rate  at  which 
the  iodine  concentrates  in  the  thyroid  tells  the  doctor  about  the 
health  of  the  thyroid. 

If  you  ever  undergo  this  procedure,  someone  will  presumably 
explain  a  little  about  radioactivity  to  you,  to  allay  your  fears  that 
you  will  turn  into  the  Incredible  Hulk,  or  that  your  next  child  will 
have  an  unusual  number  of  limbs.  Since  iodine  stays  in  your  thyroid 


full  width  at  half 
maximum  (FWHM) 


i/The  full  width  at  half  maxi¬ 
mum  (FWHM)  of  a  probability 
distribution. 
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for  a  long  time  once  it  gets  there,  one  thing  you’ll  want  to  know  is 
whether  your  thyroid  is  going  to  become  radioactive  forever.  They 
may  just  tell  you  that  the  radioactivity  “only  lasts  a  certain  amount 
of  time,”  but  we  can  now  carry  out  a  quantitative  derivation  of  how 
the  radioactivity  really  will  die  out. 

Let  P.surv(t)  be  the  probability  that  an  iodine  atom  will  survive 
without  decaying  for  a  period  of  at  least  t.  It  has  been  experimen¬ 
tally  measured  that  half  all  131I  atoms  decay  in  8  hours,  so  we  have 

Psurv (  8  hr)  =  0.5. 

Now  using  the  law  of  independent  probabilities,  the  probability 
of  surviving  for  16  hours  equals  the  probability  of  surviving  for  the 
first  8  hours  multiplied  by  the  probability  of  surviving  for  the  second 
8  hours, 


Psurvi  16  hr)  =  0.50  x  0.50 
=  0.25. 


Similarly  we  have 


Psurv{ 24  hr)  =  0.50  x  0.5  x  0.5 
=  0.125. 

Generalizing  from  this  pattern,  the  probability  of  surviving  for  any 
time  t  that  is  a  multiple  of  8  hours  is 

Psurvit)  =  0.5^  hL 

We  now  know  how  to  find  the  probability  of  survival  at  intervals 
of  8  hours,  but  what  about  the  points  in  time  in  between?  What 
would  be  the  probability  of  surviving  for  4  hours?  Well,  using  the 
law  of  independent  probabilities  again,  we  have 

Psurv ( 8  hr)  —  Psurv 1 I  hr)  X  Psurv (I  hr), 

which  can  be  rearranged  to  give 

Psurv  (4  hr)  —  Psurv  ( 8  hr) 

=  \/o)5 

=  0.707. 

This  is  exactly  what  we  would  have  found  simply  by  plugging  in 
Psurv (t)  =  0.5t//8  hr  and  ignoring  the  restriction  to  multiples  of  8 
hours.  Since  8  hours  is  the  amount  of  time  required  for  half  of  the 
atoms  to  decay,  it  is  known  as  the  half-life,  written  y2  •  The  general 
rule  is  as  follows: 

Exponential  Decay  Equation 

Psurv(t)  =  0.5^  V2 
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Using  the  rule  for  calculating  averages,  we  can  also  find  the  num¬ 
ber  of  atoms,  N(t),  remaining  in  a  sample  at  time  t: 

N(t)  =  N( 0)  x  0.5t/fl/2 

Both  of  these  equations  have  graphs  that  look  like  dying-out  expo¬ 
nentials,  as  in  the  example  below. 

Radioactive  contamination  at  Chernobyl  example  2 

>  One  of  the  most  dangerous  radioactive  isotopes  released  by  the 
Chernobyl  disaster  in  1986  was  90Sr,  whose  half-life  is  28  years, 
(a)  How  long  will  it  be  before  the  contamination  is  reduced  to  one 
tenth  of  its  original  level?  (b)  If  a  total  of  1 027  atoms  was  released, 
about  how  long  would  it  be  before  not  a  single  atom  was  left? 

>  (a)  We  want  to  know  the  amount  of  time  that  a  90Sr  nucleus 
has  a  probability  of  0.1  of  surviving.  Starting  with  the  exponential 
decay  formula, 

Psurv  =  0.5f/fV2, 

we  want  to  solve  for  t.  Taking  natural  logarithms  of  both  sides, 

In  P  =  — —  In  0.5, 

h/2 

so 


Plugging  in  P  =  0.1  and  /2  =  28  years,  we  get  t  =  93  years. 

(b)  This  is  just  like  the  first  part,  but  P  =  10~27  .  The  result  is 
about  2500  years. 


14C  Dating  example  3 

Almost  all  the  carbon  on  Earth  is  12C,  but  not  quite.  The  isotope 
14C,  with  a  half-life  of  5600  years,  is  produced  by  cosmic  rays  in 
the  atmosphere.  It  decays  naturally,  but  is  replenished  at  such  a 
rate  that  the  fraction  of  14C  in  the  atmosphere  remains  constant, 
at  1.3  x  10~12  .  Living  plants  and  animals  take  in  both  12C  and 
14C  from  the  atmosphere  and  incorporate  both  into  their  bodies. 
Once  the  living  organism  dies,  it  no  longer  takes  in  C  atoms  from 
the  atmosphere,  and  the  proportion  of  14C  gradually  falls  off  as  it 
undergoes  radioactive  decay.  This  effect  can  be  used  to  find  the 
age  of  dead  organisms,  or  human  artifacts  made  from  plants  or 
animals.  Figure  j  on  page  840  shows  the  exponential  decay  curve 
of  14C  in  various  objects.  Similar  methods,  using  longer-lived  iso¬ 
topes,  provided  the  first  firm  proof  that  the  earth  was  billions  of 
years  old,  not  a  few  thousand  as  some  had  claimed  on  religious 
grounds. 
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j  /  Calibration  of  the  14C  dating  method  using  tree  rings  and  arti¬ 
facts  whose  ages  were  known  from  other  methods.  Redrawn  from  Emilio 
Segre,  Nuclei  and  Particles,  1965. 

Rate  of  decay 

If  you  want  to  find  how  many  radioactive  decays  occur  within  a 
time  interval  lasting  from  time  t  to  time  t  +  At,  the  most  straight¬ 
forward  approach  is  to  calculate  it  like  this: 

(number  of  decays  between  t  and  t  +  At) 

=  N(t)  -  N(t  +  At) 

Usually  we’re  interested  in  the  case  where  At  is  small  compared  to 
ti/2,  and  in  this  limiting  case  the  calculation  starts  to  look  exactly 
like  the  limit  that  goes  into  the  definition  of  the  derivative  d N /  dt.  It 
is  therefore  more  convenient  to  talk  about  the  rate  of  decay  —  dN/  dt 
rather  than  the  number  of  decays  in  some  finite  time  interval.  Doing 
calculus  on  the  function  ex  is  also  easier  than  with  0.5X,  so  we  rewrite 
the  function  N(t )  as 

N  =  N(0)e~t/T, 

where  r  =  fi/2/ hr  2  is  shown  in  example  6  on  p.  843  to  be  the 
average  time  of  survival.  The  rate  of  decay  is  then 

dIV  N(0)  t/T 
dt  T 
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Mathematically,  differentating  an  exponential  just  gives  back  an¬ 
other  exponential.  Physically,  this  is  telling  us  that  as  N  falls  off 
exponentially,  the  rate  of  decay  falls  off  at  the  same  exponential 
rate,  because  a  lower  N  means  fewer  atoms  that  remain  available  to 
decay. 

self-check  C 

Check  that  both  sides  of  the  equation  for  the  rate  of  decay  have  units  of 
s-1 ,  i.e.,  decays  per  unit  time.  >  Answer,  p.  969 

The  hot  potato  example  4 

>  A  nuclear  physicist  with  a  demented  sense  of  humor  tosses 
you  a  cigar  box,  yelling  “hot  potato.”  The  label  on  the  box  says 
“contains  1020  atoms  of  17F,  half-life  of  66  s,  produced  today  in 
our  reactor  at  1  p.m.”  It  takes  you  two  seconds  to  read  the  label, 
after  which  you  toss  it  behind  some  lead  bricks  and  run  away.  The 
time  is  1 :40  p.m.  Will  you  die? 

o  The  time  elapsed  since  the  radioactive  fluorine  was  produced 
in  the  reactor  was  40  minutes,  or  2400  s.  The  number  of  elapsed 
half-lives  is  therefore  t/ty2  =  36.  The  initial  number  of  atoms 
was  A/(0)  =  1020  .  The  number  of  decays  per  second  is  now 
about  1 07  s^1 ,  so  it  produced  about  2  x  1 07  high-energy  electrons 
while  you  held  it  in  your  hands.  Although  twenty  million  electrons 
sounds  like  a  lot,  it  is  not  really  enough  to  be  dangerous. 

By  the  way,  none  of  the  equations  we’ve  derived  so  far  was  the 
actual  probability  distribution  for  the  time  at  which  a  particular 
radioactive  atom  will  decay.  That  probability  distribution  would  be 
found  by  substituting  N( 0)  =  1  into  the  equation  for  the  rate  of 
decay. 

Discussion  Questions 

A  In  the  medical  procedure  involving  131 1,  why  is  it  the  gamma  rays 
that  are  detected,  not  the  electrons  or  neutrinos  that  are  also  emitted? 

B  For  1  s,  Fred  holds  in  his  hands  1  kg  of  radioactive  stuff  with  a 
half-life  of  1000  years.  Ginger  holds  1  kg  of  a  different  substance,  with  a 
half-life  of  1  min,  for  the  same  amount  of  time.  Did  they  place  themselves 
in  equal  danger,  or  not? 

C  How  would  you  interpret  it  if  you  calculated  N(t),  and  found  it  was 
less  than  one? 

D  Does  the  half-life  depend  on  how  much  of  the  substance  you  have? 
Does  the  expected  time  until  the  sample  decays  completely  depend  on 
how  much  of  the  substance  you  have? 

13.1.5  Applications  of  calculus 

The  area  under  the  probability  distribution  is  of  course  an  in¬ 
tegral.  If  we  call  the  random  number  x  and  the  probability  distri¬ 
bution  D(x),  then  the  probability  that  x  lies  in  a  certain  range  is 
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given  by 

[b 

(probability  of  a  <  x  <  b)  =  /  D{x)  d.x. 

J  a 

What  about  averages?  If  x  had  a  finite  number  of  equally  probable 
values,  we  would  simply  add  them  up  and  divide  by  how  many  we 
had.  If  they  weren’t  equally  likely,  we’d  make  the  weighted  average 
x\P\  +  X2P-2+- ••  But  we  need  to  generalize  this  to  a  variable  x  that 
can  take  on  any  of  a  continuum  of  values.  The  continuous  version 
of  a  sum  is  an  integral,  so  the  average  is 

(average  value  of  x)  =  J  xD{x)  dx. 

where  the  integral  is  over  all  possible  values  of  x. 

Probability  distribution  for  radioactive  decay  example  5 

Here  is  a  rigorous  justification  for  the  statement  in  subsection 
1 3.1 .4  that  the  probability  distribution  for  radioactive  decay  is  found 
by  substituting  /V(0)  =  1  into  the  equation  for  the  rate  of  decay.  We 
know  that  the  probability  distribution  must  be  of  the  form 

D{t)  =  /c0.5f/fV2, 


where  k  is  a  constant  that  we  need  to  determine.  The  atom  is 
guaranteed  to  decay  eventually,  so  normalization  gives  us 


(probability  of  0  <  t  <  oo)  =  1 

/»oo 

=  /  D(t)6t. 

Jo 

The  integral  is  most  easily  evaluated  by  converting  the  function 
into  an  exponential  with  e  as  the  base 


D(t)  =  kex  p 


In 


=  kex  p 


=  kexp 


■J—  In  0.5 
.  *1/2 


which  gives  an  integral  of  the  familiar  form  f  ecxdx  =  (1  /c)ecx. 
We  thus  have 


kt-\/2 
In  2 


exp 


which  gives  the  desired  result: 


k  = 


In  2 

U/2 
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Average  lifetime  example  6 

You  might  think  that  the  half-life  would  also  be  the  average  life¬ 
time  of  an  atom,  since  half  the  atoms’  lives  are  shorter  and  half 
longer.  But  the  half  whose  lives  are  longer  include  some  that  sur¬ 
vive  for  many  half-lives,  and  these  rare  long-lived  atoms  skew  the 
average.  We  can  calculate  the  average  lifetime  as  follows: 

roo 

(average  lifetime)  =  /  t  D(t)dt 
Jo 

Using  the  convenient  base-e  form  again,  we  have 

In  2  f°°  (  In  2  \ 

(average  lifetime)  =  - —  /  fexp  -- — t  d t. 

'1/2  Jo  V  '1/2  / 

This  integral  is  of  a  form  that  can  either  be  attacked  with  in¬ 
tegration  by  parts  or  by  looking  it  up  in  a  table.  The  result  is 
f  xecxdx  =  £ecx  -  ^ ecx ,  and  the  first  term  can  be  ignored  for 
our  purposes  because  it  equals  zero  at  both  limits  of  integration. 
We  end  up  with 

/  ■  x  In  2 

(average  lifetime)  =  - — 

'1/2 

f1/2 

"  TrT2 

=  1.443  tyz, 

which  is,  as  expected,  longer  than  one  half-life. 


Section  13.1  Rules  of  Randomness 


843 


k/ln  recent  decades,  a  huge  hole  in  the  ozone  layer  has  spread 
out  from  Antarctica.  Left:  November  1978.  Right:  November  1992 


13.2  Light  As  a  Particle 

The  only  thing  that  interferes  with  my  learning  is  my  education. 

Albert  Einstein 

Radioactivity  is  random,  but  do  the  laws  of  physics  exhibit  ran¬ 
domness  in  other  contexts  besides  radioactivity?  Yes.  Radioactive 
decay  was  just  a  good  playpen  to  get  us  started  with  concepts  of 
randomness,  because  all  atoms  of  a  given  isotope  are  identical.  By 
stocking  the  playpen  with  an  unlimited  supply  of  identical  atom- 
toys,  nature  helped  us  to  realize  that  their  future  behavior  could  be 
different  regardless  of  their  original  identicality.  We  are  now  ready 
to  leave  the  playpen,  and  see  how  randomness  fits  into  the  structure 
of  physics  at  the  most  fundamental  level. 

The  laws  of  physics  describe  light  and  matter,  and  the  quantum 
revolution  rewrote  both  descriptions.  Radioactivity  was  a  good  ex¬ 
ample  of  matter’s  behaving  in  a  way  that  was  inconsistent  with 
classical  physics,  but  if  we  want  to  get  under  the  hood  and  under¬ 
stand  how  nonclassical  things  happen,  it  will  be  easier  to  focus  on 
light  rather  than  matter.  A  radioactive  atom  such  as  uranium-235 
is  after  all  an  extremely  complex  system,  consisting  of  92  protons, 
143  neutrons,  and  92  electrons.  Light,  however,  can  be  a  simple  sine 
wave. 

However  successful  the  classical  wave  theory  of  light  had  been 
-  allowing  the  creation  of  radio  and  radar,  for  example  —  it  still 
failed  to  describe  many  important  phenomena.  An  example  that 
is  currently  of  great  interest  is  the  way  the  ozone  layer  protects  us 
from  the  dangerous  short-wavelength  ultraviolet  part  of  the  sun’s 
spectrum.  In  the  classical  description,  light  is  a  wave.  When  a  wave 
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passes  into  and  back  out  of  a  medium,  its  frequency  is  unchanged, 
and  although  its  wavelength  is  altered  while  it  is  in  the  medium, 
it  returns  to  its  original  value  when  the  wave  reemerges.  Luckily 
for  us,  this  is  not  at  all  what  ultraviolet  light  does  when  it  passes 
through  the  ozone  layer,  or  the  layer  would  offer  no  protection  at 
all! 

13.2.1  Evidence  for  light  as  a  particle 

For  a  long  time,  physicists  tried  to  explain  away  the  problems 
with  the  classical  theory  of  light  as  arising  from  an  imperfect  under¬ 
standing  of  atoms  and  the  interaction  of  light  with  individual  atoms 
and  molecules.  The  ozone  paradox,  for  example,  could  have  been 
attributed  to  the  incorrect  assumption  that  one  could  think  of  the 
ozone  layer  as  a  smooth,  continuous  substance,  when  in  reality  it 
was  made  of  individual  ozone  molecules.  It  wasn’t  until  1905  that 
Albert  Einstein  threw  down  the  gauntlet,  proposing  that  the  prob¬ 
lem  had  nothing  to  do  with  the  details  of  light’s  interaction  with 
atoms  and  everything  to  do  with  the  fundamental  nature  of  light 
itself. 


a  /  Digital  camera  images  of  dimmer  and  dimmer  sources  of  light. 
The  dots  are  records  of  individual  photons. 


In  those  days  the  data  were  sketchy,  the  ideas  vague,  and  the 
experiments  difficult  to  interpret;  it  took  a  genius  like  Einstein  to  cut 
through  the  thicket  of  confusion  and  find  a  simple  solution.  Today, 
however,  we  can  get  right  to  the  heart  of  the  matter  with  a  piece  of 
ordinary  consumer  electronics,  the  digital  camera.  Instead  of  film,  a 
digital  camera  has  a  computer  chip  with  its  surface  divided  up  into  a 
grid  of  light-sensitive  squares,  called  “pixels.”  Compared  to  a  grain 
of  the  silver  compound  used  to  make  regular  photographic  film,  a 
digital  camera  pixel  is  activated  by  an  amount  of  light  energy  orders 
of  magnitude  smaller.  We  can  learn  something  new  about  light  by 
using  a  digital  camera  to  detect  smaller  and  smaller  amounts  of 
light,  as  shown  in  figure  a.  Figure  a/1  is  fake,  but  a/2  and  a/3  are 
real  digital-camera  images  made  by  Prof.  Lyman  Page  of  Princeton 
University  as  a  classroom  demonstration.  Figure  a/1  is  what  we 
would  see  if  we  used  the  digital  camera  to  take  a  picture  of  a  fairly 
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b  /  A  wave  is  partially  absorbed. 


c  /  A  stream  of  particles  is 
partially  absorbed. 


d  /  Einstein  and  Seurat:  twins 
separated  at  birth?  Seine  Grande 
Jatte  by  Georges  Seurat  (19th 
century). 


dim  source  of  light.  In  figures  a/2  and  a/3,  the  intensity  of  the  light 
was  drastically  reduced  by  inserting  semitransparent  absorbers  like 
the  tinted  plastic  used  in  sunglasses.  Going  from  a/1  to  a/2  to  a/3, 
more  and  more  light  energy  is  being  thrown  away  by  the  absorbers. 

The  results  are  drastically  different  from  what  we  would  expect 
based  on  the  wave  theory  of  light.  If  light  was  a  wave  and  nothing 
but  a  wave,  b,  then  the  absorbers  would  simply  cut  down  the  wave’s 
amplitude  across  the  whole  wavefront.  The  digital  camera’s  entire 
chip  would  be  illuminated  uniformly,  and  weakening  the  wave  with 
an  absorber  would  just  mean  that  every  pixel  would  take  a  long  time 
to  soak  up  enough  energy  to  register  a  signal. 

But  figures  a/2  and  a/3  show  that  some  pixels  take  strong  hits 
while  others  pick  up  no  energy  at  all.  Instead  of  the  wave  picture, 
the  image  that  is  naturally  evoked  by  the  data  is  something  more 
like  a  hail  of  bullets  from  a  machine  gun,  c.  Each  “bullet”  of  light 
apparently  carries  only  a  tiny  amount  of  energy,  which  is  why  de¬ 
tecting  them  individually  requires  a  sensitive  digital  camera  rather 
than  an  eye  or  a  piece  of  him. 

Although  Einstein  was  interpreting  different  observations,  this 
is  the  conclusion  he  reached  in  his  1905  paper:  that  the  pure  wave 
theory  of  light  is  an  oversimplification,  and  that  the  energy  of  a  beam 
of  light  comes  in  finite  chunks  rather  than  being  spread  smoothly 
throughout  a  region  of  space. 

We  now  think  of  these  chunks  as  particles  of  light,  and  call  them 
“photons,”  although  Einstein  avoided  the  word  “particle,”  and  the 
word  “photon”  was  invented  later.  Regardless  of  words,  the  trou¬ 
ble  was  that  waves  and  particles  seemed  like  inconsistent  categories. 
The  reaction  to  Einstein’s  paper  could  be  kindly  described  as  vig¬ 
orously  skeptical.  Even  twenty  years  later,  Einstein  wrote,  “There 
are  therefore  now  two  theories  of  light,  both  indispensable,  and  — 
as  one  must  admit  today  despite  twenty  years  of  tremendous  effort 
on  the  part  of  theoretical  physicists  —  without  any  logical  connec¬ 
tion.”  In  the  remainder  of  this  chapter  we  will  learn  how  the  seeming 
paradox  was  eventually  resolved. 

Discussion  Questions 

A  Suppose  someone  rebuts  the  digital  camera  data  in  figure  a,  claim¬ 
ing  that  the  random  pattern  of  dots  occurs  not  because  of  anything  fun¬ 
damental  about  the  nature  of  light  but  simply  because  the  camera’s  pixels 
are  not  all  exactly  the  same  —  some  are  just  more  sensitive  than  others. 
How  could  we  test  this  interpretation? 

B  Discuss  how  the  correspondence  principle  applies  to  the  observa¬ 
tions  and  concepts  discussed  in  this  section. 

13.2.2  How  much  light  is  one  photon? 
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The  photoelectric  effect 

We  have  seen  evidence  that  light  energy  comes  in  little  chunks, 
so  the  next  question  to  be  asked  is  naturally  how  much  energy  is 
in  one  chunk.  The  most  straightforward  experimental  avenue  for 
addressing  this  question  is  a  phenomenon  known  as  the  photoelec¬ 
tric  effect.  The  photoelectric  effect  occurs  when  a  photon  strikes 
the  surface  of  a  solid  object  and  knocks  out  an  electron.  It  occurs 
continually  all  around  you.  It  is  happening  right  now  at  the  surface 
of  your  skin  and  on  the  paper  or  computer  screen  from  which  you 
are  reading  these  words.  It  does  not  ordinarily  lead  to  any  observ¬ 
able  electrical  effect,  however,  because  on  the  average  free  electrons 
are  wandering  back  in  just  as  frequently  as  they  are  being  ejected. 
(If  an  object  did  somehow  lose  a  significant  number  of  electrons, 
its  growing  net  positive  charge  would  begin  attracting  the  electrons 
back  more  and  more  strongly.) 

Figure  e  shows  a  practical  method  for  detecting  the  photoelec¬ 
tric  effect.  Two  very  clean  parallel  metal  plates  (the  electrodes  of  a 
capacitor)  are  sealed  inside  a  vacuum  tube,  and  only  one  plate  is  ex¬ 
posed  to  light.  Because  there  is  a  good  vacuum  between  the  plates, 
any  ejected  electron  that  happens  to  be  headed  in  the  right  direc¬ 
tion  will  almost  certainly  reach  the  other  capacitor  plate  without 
colliding  with  any  air  molecules. 

The  illuminated  (bottom)  plate  is  left  with  a  net  positive  charge, 
and  the  unilluminated  (top)  plate  acquires  a  negative  charge  from 
the  electrons  deposited  on  it.  There  is  thus  an  electric  field  between 
the  plates,  and  it  is  because  of  this  field  that  the  electrons’  paths  are 
curved,  as  shown  in  the  diagram.  However,  since  vacuum  is  a  good 
insulator,  any  electrons  that  reach  the  top  plate  are  prevented  from 
responding  to  the  electrical  attraction  by  jumping  back  across  the 
gap.  Instead  they  are  forced  to  make  their  way  around  the  circuit, 
passing  through  an  ammeter.  The  ammeter  allows  a  measurement 
of  the  strength  of  the  photoelectric  effect. 

An  unexpected  dependence  on  frequency 

The  photoelectric  effect  was  discovered  serendipitously  by  Hein¬ 
rich  Hertz  in  1887,  as  he  was  experimenting  with  radio  waves.  He 
was  not  particularly  interested  in  the  phenomenon,  but  he  did  notice 
that  the  effect  was  produced  strongly  by  ultraviolet  light  and  more 
weakly  by  lower  frequencies.  Light  whose  frequency  was  lower  than 
a  certain  critical  value  did  not  eject  any  electrons  at  all.  (In  fact 
this  was  all  prior  to  Thomson’s  discovery  of  the  electron,  so  Hertz 
would  not  have  described  the  effect  in  terms  of  electrons  —  we  are 
discussing  everything  with  the  benefit  of  hindsight.)  This  depen¬ 
dence  on  frequency  didn’t  make  any  sense  in  terms  of  the  classical 
wave  theory  of  light.  A  light  wave  consists  of  electric  and  magnetic 
fields.  The  stronger  the  fields,  i.e. ,  the  greater  the  wave’s  ampli¬ 
tude,  the  greater  the  forces  that  would  be  exerted  on  electrons  that 


e  /  Apparatus  for  observing 
the  photoelectric  effect.  A  beam 
of  light  strikes  a  capacitor  plate 
inside  a  vacuum  tube,  and  elec¬ 
trons  are  ejected  (black  arrows). 
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f  /  The  hamster  in  her  hamster 
ball  is  like  an  electron  emerging 
from  the  metal  (tiled  kitchen  floor) 
into  the  surrounding  vacuum 
(wood  floor).  The  wood  floor  is 
higher  than  the  tiled  floor,  so  as 
she  rolls  up  the  step,  the  hamster 
will  lose  a  certain  amount  of 
kinetic  energy,  analogous  to  Es. 
If  her  kinetic  energy  is  too  small, 
she  won’t  even  make  it  up  the 
step. 


found  themselves  bathed  in  the  light.  It  should  have  been  amplitude 
(brightness)  that  was  relevant,  not  frequency.  The  dependence  on 
frequency  not  only  proves  that  the  wave  model  of  light  needs  mod¬ 
ifying,  but  with  the  proper  interpretation  it  allows  us  to  determine 
how  much  energy  is  in  one  photon,  and  it  also  leads  to  a  connec¬ 
tion  between  the  wave  and  particle  models  that  we  need  in  order  to 
reconcile  them. 

To  make  any  progress,  we  need  to  consider  the  physical  process 
by  which  a  photon  would  eject  an  electron  from  the  metal  electrode. 
A  metal  contains  electrons  that  are  free  to  move  around.  Ordinarily, 
in  the  interior  of  the  metal,  such  an  electron  feels  attractive  forces 
from  atoms  in  every  direction  around  it.  The  forces  cancel  out.  But 
if  the  electron  happens  to  find  itself  at  the  surface  of  the  metal, 
the  attraction  from  the  interior  side  is  not  balanced  out  by  any 
attraction  from  outside.  In  popping  out  through  the  surface  the 
electron  therefore  loses  some  amount  of  energy  Es,  which  depends 
on  the  type  of  metal  used. 

Suppose  a  photon  strikes  an  electron,  annihilating  itself  and  giv¬ 
ing  up  all  its  energy  to  the  electron.  (We  now  know  that  this  is  what 
always  happens  in  the  photoelectric  effect,  although  it  had  not  yet 
been  established  in  1905  whether  or  not  the  photon  was  completely 
annihilated.)  The  electron  will  (1)  lose  kinetic  energy  through  colli¬ 
sions  with  other  electrons  as  it  plows  through  the  metal  on  its  way 
to  the  surface;  (2)  lose  an  amount  of  kinetic  energy  equal  to  Es  as 
it  emerges  through  the  surface;  and  (3)  lose  more  energy  on  its  way 
across  the  gap  between  the  plates,  due  to  the  electric  field  between 
the  plates.  Even  if  the  electron  happens  to  be  right  at  the  surface  of 
the  metal  when  it  absorbs  the  photon,  and  even  if  the  electric  field 
between  the  plates  has  not  yet  built  up  very  much,  Es  is  the  bare 
minimum  amount  of  energy  that  it  must  receive  from  the  photon 
if  it  is  to  contribute  to  a  measurable  current.  The  reason  for  using 
very  clean  electrodes  is  to  minimize  Es  and  make  it  have  a  definite 
value  characteristic  of  the  metal  surface,  not  a  mixture  of  values 
due  to  the  various  types  of  dirt  and  crud  that  are  present  in  tiny 
amounts  on  all  surfaces  in  everyday  life. 

We  can  now  interpret  the  frequency  dependence  of  the  photo¬ 
electric  effect  in  a  simple  way:  apparently  the  amount  of  energy 
possessed  by  a  photon  is  related  to  its  frequency.  A  low-frequency 
red  or  infrared  photon  has  an  energy  less  than  Es,  so  a  beam  of 
them  will  not  produce  any  current.  A  high-frequency  blue  or  violet 
photon,  on  the  other  hand,  packs  enough  of  a  punch  to  allow  an 
electron  to  make  it  to  the  other  plate.  At  frequencies  higher  than 
the  minimum,  the  photoelectric  current  continues  to  increase  with 
the  frequency  of  the  light  because  of  effects  (1)  and  (3). 
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Numerical  relationship  between  energy  and  frequency 

Prompted  by  Einstein’s  photon  paper,  Robert  Millikan  (whom 
we  first  encountered  in  chapter  8)  figured  out  how  to  use  the  pho¬ 
toelectric  effect  to  probe  precisely  the  link  between  frequency  and 
photon  energy.  Rather  than  going  into  the  historical  details  of  Mil¬ 
likan’s  actual  experiments  (a  lengthy  experimental  program  that 
occupied  a  large  part  of  his  professional  career)  we  will  describe  a 
simple  version,  shown  in  figure  g,  that  is  used  sometimes  in  college 
laboratory  courses.2  The  idea  is  simply  to  illuminate  one  plate  of 
the  vacuum  tube  with  light  of  a  single  wavelength  and  monitor  the 
voltage  difference  between  the  two  plates  as  they  charge  up.  Since 
the  resistance  of  a  voltmeter  is  very  high  (much  higher  than  the 
resistance  of  an  ammeter),  we  can  assume  to  a  good  approximation 
that  electrons  reaching  the  top  plate  are  stuck  there  permanently, 
so  the  voltage  will  keep  on  increasing  for  as  long  as  electrons  are 
making  it  across  the  vacuum  tube. 

At  a  moment  when  the  voltage  difference  has  a  reached  a  value 
AV,  the  minimum  energy  required  by  an  electron  to  make  it  out  of 
the  bottom  plate  and  across  the  gap  to  the  other  plate  is  Es  +  eAV. 
As  AV  increases,  we  eventually  reach  a  point  at  which  Es  +  eAV 
equals  the  energy  of  one  photon.  No  more  electrons  can  cross  the 
gap,  and  the  reading  on  the  voltmeter  stops  rising.  The  quantity 
Es+e  AV  now  tells  us  the  energy  of  one  photon.  If  we  determine  this 
energy  for  a  variety  of  wavelengths,  h,  we  find  the  following  simple 
relationship  between  the  energy  of  a  photon  and  the  frequency  of 
the  light: 


E  =  hf , 

where  h  is  a  constant  with  the  value  6.63  x  10-34  J  •  s.  Note  how 
the  equation  brings  the  wave  and  particle  models  of  light  under  the 
same  roof:  the  left  side  is  the  energy  of  one  particle  of  light,  while 
the  right  side  is  the  frequency  of  the  same  light,  interpreted  as  a 
wave.  The  constant  h  is  known  as  Planck’s  constant,  for  historical 
reasons  explained  in  the  footnote  beginning  on  the  preceding  page. 


2What  I’m  presenting  in  this  chapter  is  a  simplified  explanation  of  how  the 
photon  could  have  been  discovered.  The  actual  history  is  more  complex.  Max 
Planck  (1858-1947)  began  the  photon  saga  with  a  theoretical  investigation  of 
the  spectrum  of  light  emitted  by  a  hot,  glowing  object.  He  introduced  quanti¬ 
zation  of  the  energy  of  light  waves,  in  multiples  of  hf,  purely  as  a  mathematical 
trick  that  happened  to  produce  the  right  results.  Planck  did  not  believe  that 
his  procedure  could  have  any  physical  significance.  In  his  1905  paper  Einstein 
took  Planck’s  quantization  as  a  description  of  reality,  and  applied  it  to  various 
theoretical  and  experimental  puzzles,  including  the  photoelectric  effect.  Mil¬ 
likan  then  subjected  Einstein’s  ideas  to  a  series  of  rigorous  experimental  tests. 
Although  his  results  matched  Einstein’s  predictions  perfectly,  Millikan  was  skep¬ 
tical  about  photons,  and  his  papers  conspicuously  omit  any  reference  to  them. 
Only  in  his  autobiography  did  Millikan  rewrite  history  and  claim  that  he  had 
given  experimental  proof  for  photons. 


g  /  A  different  way  of  study¬ 
ing  the  photoelectric  effect. 


Es+e  AV 


f 

h/The  quantity  Es  +  eAV  in¬ 
dicates  the  energy  of  one  photon. 
It  is  found  to  be  proportional  to 
the  frequency  of  the  light. 
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self-check  D 


How  would  you  extract  h  from  the  graph  in  figure  h?  What  if  you  didn’t 
even  know  Es  in  advance,  and  could  only  graph  eAV  versus  f?  > 
Answer,  p.  969 

Since  the  energy  of  a  photon  is  hf,  a  beam  of  light  can  only  have 
energies  of  hf,  2 hf,  3hf,  etc.  Its  energy  is  quantized  —  there  is  no 
such  thing  as  a  fraction  of  a  photon.  Quantum  physics  gets  its  name 
from  the  fact  that  it  quantizes  quantities  like  energy,  momentum, 
and  angular  momentum  that  had  previously  been  thought  to  be 
smooth,  continuous  and  infinitely  divisible. 

Number  of  photons  emitted  by  a  lightbulb  per  second  example  7 

>  Roughly  how  many  photons  are  emitted  by  a  100-W  lightbulb  in 
1  second? 

>  People  tend  to  remember  wavelengths  rather  than  frequencies 
for  visible  light.  The  bulb  emits  photons  with  a  range  of  frequen¬ 
cies  and  wavelengths,  but  let’s  take  600  nm  as  a  typical  wave¬ 
length  for  purposes  of  estimation.  The  energy  of  a  single  photon 
is 


Ephoton  —  hf 


=  hc/k 


A  power  of  100  W  means  100  joules  per  second,  so  the  number 
of  photons  is 


(100  J)/ Ephoton  =  (100  J)/(/jc/A) 

«3x  1020 

This  hugeness  of  this  number  is  consistent  with  the  correspon¬ 
dence  principle.  The  experiments  that  established  the  classical 
theory  of  optics  weren’t  wrong.  They  were  right,  within  their  do¬ 
main  of  applicability,  in  which  the  number  of  photons  was  so  large 
as  to  be  indistinguishable  from  a  continuous  beam. 

Measuring  the  wave  example  8 

When  surfers  are  out  on  the  water  waiting  for  their  chance  to 
catch  a  wave,  they’re  interested  in  both  the  height  of  the  waves 
and  when  the  waves  are  going  to  arrive.  In  other  words,  they  ob¬ 
serve  both  the  amplitude  and  phase  of  the  waves,  and  it  doesn’t 
matter  to  them  that  the  water  is  granular  at  the  molecular  level. 
The  correspondence  principle  requires  that  we  be  able  to  do  the 
same  thing  for  electromagnetic  waves,  since  the  classical  theory 
of  electricity  and  magnetism  was  all  stated  and  verified  experi¬ 
mentally  in  terms  of  the  fields  E  and  B,  which  are  the  amplitude 
of  an  electromagnetic  wave.  The  phase  is  also  necessary,  since 
the  induction  effects  predicted  by  Maxwell’s  equation  would  flip 
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their  signs  depending  on  whether  an  oscillating  field  is  on  its  way 
up  or  on  its  way  back  down. 

This  is  a  more  demanding  application  of  the  correspondence  prin¬ 
ciple  than  the  one  in  example  7,  since  amplitudes  and  phases 
constitute  more  detailed  information  than  the  over-all  intensity  of 
a  beam  of  light.  Eyeball  measurements  can’t  detect  this  type  of  in¬ 
formation,  since  the  eye  is  much  bigger  than  a  wavelength,  but  for 
example  an  AM  radio  receiver  can  do  it  with  radio  waves,  since 
the  wavelength  for  a  station  at  1000  kHz  is  about  300  meters, 
which  is  much  larger  than  the  antenna.  The  correspondence  prin¬ 
ciple  demands  that  we  be  able  to  explain  this  in  terms  of  the  pho¬ 
ton  theory,  and  this  requires  not  just  that  we  have  a  large  number 
of  photons  emitted  by  the  transmitter  per  second,  as  in  example  7, 
but  that  even  by  the  time  they  spread  out  and  reach  the  receiving 
antenna,  there  should  be  many  photons  overlapping  each  other 
within  a  space  of  one  cubic  wavelength.  Problem  47  on  p.  942 
verifies  that  the  number  is  in  fact  extremely  large. 

Momentum  of  a  photon  example  9 

>  According  to  the  theory  of  relativity,  the  momentum  of  a  beam 
of  light  is  given  by  p  =  E/c.  Apply  this  to  find  the  momentum 
of  a  single  photon  in  terms  of  its  frequency,  and  in  terms  of  its 
wavelength. 

>  Combining  the  equations  p  =  E/c  and  E  =  hf,  we  find 

P  =  E/c 


To  reexpress  this  in  terms  of  wavelength,  we  use  c  =  fX\ 

h  c 

P=c- A 
_  h 

~  A 

The  second  form  turns  out  to  be  simpler. 

Discussion  Questions 

A  The  photoelectric  effect  only  ever  ejects  a  very  tiny  percentage  of 
the  electrons  available  near  the  surface  of  an  object.  How  well  does  this 
agree  with  the  wave  model  of  light,  and  how  well  with  the  particle  model? 
Consider  the  two  different  distance  scales  involved:  the  wavelength  of  the 
light,  and  the  size  of  an  atom,  which  is  on  the  order  of  1 CT10  or  1 0-9  m. 

B  What  is  the  significance  of  the  fact  that  Planck’s  constant  is  numeri¬ 
cally  very  small?  How  would  our  everyday  experience  of  light  be  different 
if  it  was  not  so  small? 

C  How  would  the  experiments  described  above  be  affected  if  a  single 
electron  was  likely  to  get  hit  by  more  than  one  photon? 

D  Draw  some  representative  trajectories  of  electrons  for  AV  =  0,  A\/ 
less  than  the  maximum  value,  and  A  V  greater  than  the  maximum  value. 


Section  13.2  Light  As  a  Particle 


851 


E  Explain  based  on  the  photon  theory  of  light  why  ultraviolet  light  would 
be  more  likely  than  visible  or  infrared  light  to  cause  cancer  by  damaging 
DNA  molecules.  How  does  this  relate  to  discussion  question  C? 

F  Does  E  =  hf  imply  that  a  photon  changes  its  energy  when  it  passes 
from  one  transparent  material  into  another  substance  with  a  different  in¬ 
dex  of  refraction? 

13.2.3  Wave-particle  duality 

How  can  light  be  both  a  particle  and  a  wave?  We  are  now 
ready  to  resolve  this  seeming  contradiction.  Often  in  science  when 
something  seems  paradoxical,  it’s  because  we  (1)  don’t  define  our 
terms  carefully,  or  (2)  don’t  test  our  ideas  against  any  specific  real- 
world  situation.  Let’s  define  particles  and  waves  as  follows: 

•  Waves  exhibit  superposition,  and  specifically  interference  phe¬ 
nomena. 

•  Particles  can  only  exist  in  whole  numbers,  not  fractions. 

As  a  real-world  check  on  our  philosophizing,  there  is  one  partic¬ 
ular  experiment  that  works  perfectly.  We  set  up  a  double-slit  inter¬ 
ference  experiment  that  we  know  will  produce  a  diffraction  pattern 
if  light  is  an  honest-to- goodness  wave,  but  we  detect  the  light  with 
a  detector  that  is  capable  of  sensing  individual  photons,  e.g.,  a  dig¬ 
ital  camera.  To  make  it  possible  to  pick  out  individual  dots  due  to 
individual  photons,  we  must  use  Liters  to  cut  down  the  intensity  of 
the  light  to  a  very  low  level,  just  as  in  the  photos  by  Prof.  Page 
on  p.  845.  The  whole  thing  is  sealed  inside  a  light-tight  box.  The 
results  are  shown  in  figure  i.  (In  fact,  the  similar  figures  in  on  page 
845  are  simply  cutouts  from  these  figures.) 


i  /  Wave  interference  patterns 
photographed  by  Prof.  Lyman 
Page  with  a  digital  camera.  Laser 
light  with  a  single  well-defined 
wavelength  passed  through  a 
series  of  absorbers  to  cut  down 
its  intensity,  then  through  a  set  of 
slits  to  produce  interference,  and 
finally  into  a  digital  camera  chip. 
(A  triple  slit  was  actually  used, 
but  for  conceptual  simplicity  we 
discuss  the  results  in  the  main 
text  as  if  it  was  a  double  slit.)  In 
panel  2  the  intensity  has  been 
reduced  relative  to  1,  and  even 
more  so  for  panel  3. 


Neither  the  pure  wave  theory  nor  the  pure  particle  theory  can 
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explain  the  results.  If  light  was  only  a  particle  and  not  a  wave,  there 
would  be  no  interference  effect.  The  result  of  the  experiment  would 
be  like  firing  a  hail  of  bullets  through  a  double  slit,  j.  Only  two 
spots  directly  behind  the  slits  would  be  hit. 

If,  on  the  other  hand,  light  was  only  a  wave  and  not  a  particle, 
we  would  get  the  same  kind  of  diffraction  pattern  that  would  happen 
with  a  water  wave,  k.  There  would  be  no  discrete  dots  in  the  photo, 
only  a  diffraction  pattern  that  shaded  smoothly  between  light  and 
dark. 

Applying  the  definitions  to  this  experiment,  light  must  be  both 
a  particle  and  a  wave.  It  is  a  wave  because  it  exhibits  interference 
effects.  At  the  same  time,  the  fact  that  the  photographs  contain 
discrete  dots  is  a  direct  demonstration  that  light  refuses  to  be  split 
into  units  of  less  than  a  single  photon.  There  can  only  be  whole 
numbers  of  photons:  four  photons  in  figure  i/3,  for  example. 

A  wrong  interpretation:  photons  interfering  with  each  other 

One  possible  interpretation  of  wave-particle  duality  that  occurred 
to  physicists  early  in  the  game  was  that  perhaps  the  interference  ef¬ 
fects  came  from  photons  interacting  with  each  other.  By  analogy,  a 
water  wave  consists  of  moving  water  molecules,  and  interference  of 
water  waves  results  ultimately  from  all  the  mutual  pushes  and  pulls 
of  the  molecules.  This  interpretation  was  conclusively  disproved  by 
G.I.  Taylor,  a  student  at  Cambridge.  The  demonstration  by  Prof. 
Page  that  we’ve  just  been  discussing  is  essentially  a  modernized 
version  of  Taylor’s  work.  Taylor  reasoned  that  if  interference  effects 
came  from  photons  interacting  with  each  other,  a  bare  minimum  of 
two  photons  would  have  to  be  present  at  the  same  time  to  produce 
interference.  By  making  the  light  source  extremely  dim,  we  can  be 
virtually  certain  that  there  are  never  two  photons  in  the  box  at  the 
same  time.  In  figure  i/3,  however,  the  intensity  of  the  light  has 
been  cut  down  so  much  by  the  absorbers  that  if  it  was  in  the  open, 
the  average  separation  between  photons  would  be  on  the  order  of 
a  kilometer!  At  any  given  moment,  the  number  of  photons  in  the 
box  is  most  likely  to  be  zero.  It  is  virtually  certain  that  there  were 
never  two  photons  in  the  box  at  once. 

The  concept  of  a  photon ’s  path  is  undefined. 

If  a  single  photon  can  demonstrate  double-slit  interference,  then 
which  slit  did  it  pass  through?  The  unavoidable  answer  must  be  that 
it  passes  through  both!  This  might  not  seem  so  strange  if  we  think 
of  the  photon  as  a  wave,  but  it  is  highly  counterintuitive  if  we  try 
to  visualize  it  as  a  particle.  The  moral  is  that  we  should  not  think 
in  terms  of  the  path  of  a  photon.  Like  the  fully  human  and  fully 
divine  Jesus  of  Christian  theology,  a  photon  is  supposed  to  be  100% 
wave  and  100%  particle.  If  a  photon  had  a  well  defined  path,  then  it 
would  not  demonstrate  wave  superposition  and  interference  effects, 
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A,  I 


j  /  Bullets  pass  through  a  double 
slit. 


k  /  A  water  wave  passes  through 
a  double  slit. 


I/A  single  photon  can  go 
through  both  slits. 


contradicting  its  wave  nature.  (In  subsection  13.3.4  we  will  discuss 
the  Heisenberg  uncertainty  principle,  which  gives  a  numerical  way 
of  approaching  this  issue.) 

Another  wrong  interpretation:  the  pilot  wave  hypothesis 

A  second  possible  explanation  of  wave-particle  duality  was  taken 
seriously  in  the  early  history  of  quantum  mechanics.  What  if  the 
photon  particle  is  like  a  surfer  riding  on  top  of  its  accompanying 
wave ?  As  the  wave  travels  along,  the  particle  is  pushed,  or  “piloted” 
by  it.  Imagining  the  particle  and  the  wave  as  two  separate  entities 
allows  us  to  avoid  the  seemingly  paradoxical  idea  that  a  photon  is 
both  at  once.  The  wave  happily  does  its  wave  tricks,  like  super¬ 
position  and  interference,  and  the  particle  acts  like  a  respectable 
particle,  resolutely  refusing  to  be  in  two  different  places  at  once.  If 
the  wave,  for  instance,  undergoes  destructive  interference,  becoming 
nearly  zero  in  a  particular  region  of  space,  then  the  particle  simply 
is  not  guided  into  that  region. 

The  problem  with  the  pilot  wave  interpretation  is  that  the  only 
way  it  can  be  experimentally  tested  or  verified  is  if  someone  manages 
to  detach  the  particle  from  the  wave,  and  show  that  there  really  are 
two  entities  involved,  not  just  one.  Part  of  the  scientific  method  is 
that  hypotheses  are  supposed  to  be  experimentally  testable.  Since 
nobody  has  ever  managed  to  separate  the  wavelike  part  of  a  photon 
from  the  particle  part,  the  interpretation  is  not  useful  or  meaningful 
in  a  scientific  sense. 

The  probability  interpretation 

The  correct  interpretation  of  wave-particle  duality  is  suggested 
by  the  random  nature  of  the  experiment  we’ve  been  discussing:  even 
though  every  photon  wave/particle  is  prepared  and  released  in  the 
same  way,  the  location  at  which  it  is  eventually  detected  by  the 
digital  camera  is  different  every  time.  The  idea  of  the  probability 
interpretation  of  wave-particle  duality  is  that  the  location  of  the 
photon-particle  is  random,  but  the  probability  that  it  is  in  a  certain 
location  is  higher  where  the  photon-wave’s  amplitude  is  greater. 

More  specifically,  the  probability  distribution  of  the  particle  must 
be  proportional  to  the  square  of  the  wave’s  amplitude, 

(probability  distribution)  oc  (amplitude)2. 

This  follows  from  the  correspondence  principle  and  from  the  fact 
that  a  wave’s  energy  density  is  proportional  to  the  square  of  its  am¬ 
plitude.  If  we  run  the  double-slit  experiment  for  a  long  enough  time, 
the  pattern  of  dots  fills  in  and  becomes  very  smooth  as  would  have 
been  expected  in  classical  physics.  To  preserve  the  correspondence 
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between  classical  and  quantum  physics,  the  amount  of  energy  de¬ 
posited  in  a  given  region  of  the  picture  over  the  long  run  must  be 
proportional  to  the  square  of  the  wave’s  amplitude.  The  amount  of 
energy  deposited  in  a  certain  area  depends  on  the  number  of  pho¬ 
tons  picked  up,  which  is  proportional  to  the  probability  of  finding 
any  given  photon  there. 

A  microwave  oven  example  10 

>  The  figure  shows  two-dimensional  (top)  and  one-dimensional 
(bottom)  representations  of  the  standing  wave  inside  a  microwave 
oven.  Gray  represents  zero  field,  and  white  and  black  signify  the 
strongest  fields,  with  white  being  a  field  that  is  in  the  opposite  di¬ 
rection  compared  to  black.  Compare  the  probabilities  of  detecting 
a  microwave  photon  at  points  A,  B,  and  C. 

>  A  and  C  are  both  extremes  of  the  wave,  so  the  probabilities  of 
detecting  a  photon  at  A  and  C  are  equal.  It  doesn’t  matter  that  we 
have  represented  C  as  negative  and  A  as  positive,  because  it  is 
the  square  of  the  amplitude  that  is  relevant.  The  amplitude  at  B  is 
about  1/2  as  much  as  the  others,  so  the  probability  of  detecting  a 
photon  there  is  about  1/4  as  much. 

The  probability  interpretation  was  disturbing  to  physicists  who 
had  spent  their  previous  careers  working  in  the  deterministic  world 
of  classical  physics,  and  ironically  the  most  strenuous  objections 
against  it  were  raised  by  Einstein,  who  had  invented  the  photon 
concept  in  the  first  place.  The  probability  interpretation  has  never¬ 
theless  passed  every  experimental  test,  and  is  now  as  well  established 
as  any  part  of  physics. 

An  aspect  of  the  probability  interpretation  that  has  made  many 
people  uneasy  is  that  the  process  of  detecting  and  recording  the 
photon’s  position  seems  to  have  a  magical  ability  to  get  rid  of  the 
wavelike  side  of  the  photon’s  personality  and  force  it  to  decide  for 
once  and  for  all  where  it  really  wants  to  be.  But  detection  or  mea¬ 
surement  is  after  all  only  a  physical  process  like  any  other,  governed 
by  the  same  laws  of  physics.  We  will  postpone  a  detailed  discus¬ 
sion  of  this  issue  until  p.  872,  since  a  measuring  device  like  a  digital 
camera  is  made  of  matter,  but  we  have  so  far  only  discussed  how 
quantum  mechanics  relates  to  light. 

What  is  the  proportionality  constant?  example  1 1 

>  What  is  the  proportionality  constant  that  would  make  an  actual 
equation  out  of  (probability  distribution)  oc  (amplitude)2? 

>  The  probability  that  the  photon  is  in  a  certain  small  region  of 
volume  v  should  equal  the  fraction  of  the  wave’s  energy  that  is 
within  that  volume.  For  a  sinusoidal  wave,  which  has  a  single, 
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well-defined  frequency  f,  this  gives 


energy  in  volume  v 
energy  of  photon 
energy  in  volume  v 
=  hf 


We  assume  v  is  small  enough  so  that  the  electric  and  magnetic 
fields  are  nearly  constant  throughout  it.  We  then  have 


P  = 


v 


We  can  simplify  this  formidable  looking  expression  by  recognizing 
that  in  a  plane  wave,  |E|  and  |Bj  are  related  by  |E|  =  c|B|.  This 
implies  (problem  40,  p.  732),  that  the  electric  and  magnetic  fields 
each  contribute  half  the  total  energy,  so  we  can  simplify  the  result 
to 


P  =  2 


'  i  | 
k  8nk  I 


Ankhf 


hf 

'El2. 


The  probability  is  proportional  to  the  square  of  the  wave’s  ampli¬ 
tude,  as  advertised.3 


3 But  note  that  along  the  way,  we  had  to  make  two  crucial  assumptions:  that 
the  wave  was  sinusoidal,  and  that  it  was  a  plane  wave.  These  assumptions  will 
not  prevent  us  from  describing  examples  such  as  double-slit  diffraction,  in  which 
the  wave  is  approximately  sinusoidal  within  some  sufficiently  small  region  such 
as  one  pixel  of  a  camera’s  imaging  chip.  Nevertheless,  these  issues  turn  out  to 
be  symptoms  of  deeper  problems,  beyond  the  scope  of  this  book,  involving  the 
way  in  which  relativity  and  quantum  mechanics  should  be  combined.  As  a  taste 
of  the  ideas  involved,  consider  what  happens  when  a  photon  is  reflected  from 
a  conducting  surface,  as  in  example  23  on  p.  705,  so  that  the  electric  field  at 
the  surface  is  zero,  but  the  magnetic  field  isn’t.  The  superposition  is  a  standing 
wave,  not  a  plane  wave,  so  |Ej  =  c|B|  need  not  hold,  and  doesn’t.  A  detector’s 
probability  of  detecting  a  photon  near  the  surface  could  be  zero  if  the  detector 
sensed  electric  fields,  but  nonzero  if  it  sensed  magnetism.  It  doesn’t  make  sense 
to  say  that  either  of  these  is  the  probability  that  the  photon  “was  really  there.” 
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Discussion  Questions 


A  Referring  back  to  the  example  of  the  carrot  in  the  microwave  oven, 
show  that  it  would  be  nonsensical  to  have  probability  be  proportional  to 
the  field  itself,  rather  than  the  square  of  the  field. 

B  Einstein  did  not  try  to  reconcile  the  wave  and  particle  theories  of 
light,  and  did  not  say  much  about  their  apparent  inconsistency.  Einstein 
basically  visualized  a  beam  of  light  as  a  stream  of  bullets  coming  from 
a  machine  gun.  In  the  photoelectric  effect,  a  photon  “bullet”  would  only 
hit  one  atom,  just  as  a  real  bullet  would  only  hit  one  person.  Suppose 
someone  reading  his  1905  paper  wanted  to  interpret  it  by  saying  that 
Einstein’s  so-called  particles  of  light  are  simply  short  wave-trains  that  only 
occupy  a  small  region  of  space.  Comparing  the  wavelength  of  visible  light 
(a  few  hundred  nm)  to  the  size  of  an  atom  (on  the  order  of  0.1  nm),  explain 
why  this  poses  a  difficulty  for  reconciling  the  particle  and  wave  theories. 

C  Can  a  white  photon  exist? 

D  In  double-slit  diffraction  of  photons,  would  you  get  the  same  pattern 
of  dots  on  the  digital  camera  image  if  you  covered  one  slit?  Why  should  it 
matter  whether  you  give  the  photon  two  choices  or  only  one? 

13.2.4  Photons  in  three  dimensions 

Up  until  now  I’ve  been  sneaky  and  avoided  a  full  discussion  of 
the  three-dimensional  aspects  of  the  probability  interpretation.  The 
example  of  the  carrot  in  the  microwave  oven,  for  example,  reduced 
to  a  one-dimensional  situation  because  we  were  considering  three 
points  along  the  same  line  and  because  we  were  only  comparing 
ratios  of  probabilities.  The  purpose  of  bringing  it  up  now  is  to  head 
off  any  feeling  that  you’ve  been  cheated  conceptually  rather  than  to 
prepare  you  for  mathematical  problem  solving  in  three  dimensions, 
which  would  not  be  appropriate  for  the  level  of  this  course. 

A  typical  example  of  a  probability  distribution  in  section  13.1 
was  the  distribution  of  heights  of  human  beings.  The  thing  that 
varied  randomly,  height,  h.  had  units  of  meters,  and  the  probabil¬ 
ity  distribution  was  a  graph  of  a  function  D{h).  The  units  of  the 
probability  distribution  had  to  be  m_1  (inverse  meters)  so  that  ar¬ 
eas  under  the  curve,  interpreted  as  probabilities,  would  be  unitless: 
(area)  =  (height)  (width)  =  m-1  •  m. 

Now  suppose  we  have  a  two-dimensional  problem,  e.g.,  the  prob¬ 
ability  distribution  for  the  place  on  the  surface  of  a  digital  camera 
chip  where  a  photon  will  be  detected.  The  point  where  it  is  detected 
would  be  described  with  two  variables,  x  and  y,  each  having  units 
of  meters.  The  probability  distribution  will  be  a  function  of  both 
variables,  D(x,y).  A  probability  is  now  visualized  as  the  volume 
under  the  surface  described  by  the  function  D(x,y),  as  shown  in 
figure  n.  The  units  of  D  must  be  m-2  so  that  probabilities  will  be 
unitless:  (probability)  =  (depth)  (length)  (width)  =  m-2  •  m  •  m.  In 
terms  of  calculus,  we  have  P  =  J  Ddxdy. 

Generalizing  finally  to  three  dimensions,  we  find  by  analogy  that 


volume 
under 'surface 


n  /  Probability  is  the  volume 
under  a  surface  defined  by 
D(x,y). 
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the  probability  distribution  will  be  a  function  of  all  three  coordi¬ 
nates,  D(x,y,z),  and  will  have  units  of  m“3.  It  is  unfortunately 
impossible  to  visualize  the  graph  unless  you  are  a  mutant  with  a 
natural  feel  for  life  in  four  dimensions.  If  the  probability  distri¬ 
bution  is  nearly  constant  within  a  certain  volume  of  space  v,  the 
probability  that  the  photon  is  in  that  volume  is  simply  vD.  If  not, 
then  we  can  use  an  integral,  P  =  f  D  dx  dy  dz. 
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13.3  Matter  As  a  Wave 

[In]  a  few  minutes  I  shall  be  all  melted...  I  have  been  wicked  in  my 
day,  but  I  never  thought  a  little  girl  like  you  would  ever  be  able  to 
melt  me  and  end  my  wicked  deeds.  Look  out  —  here  I  go! 

The  Wicked  Witch  of  the  West 

As  the  Wicked  Witch  learned  the  hard  way,  losing  molecular 
cohesion  can  be  unpleasant.  That’s  why  we  should  be  very  grate¬ 
ful  that  the  concepts  of  quantum  physics  apply  to  matter  as  well 
as  light.  If  matter  obeyed  the  laws  of  classical  physics,  molecules 
wouldn’t  exist. 

Consider,  for  example,  the  simplest  atom,  hydrogen.  Why  does 
one  hydrogen  atom  form  a  chemical  bond  with  another  hydrogen 
atom?  Roughly  speaking,  we’d  expect  a  neighboring  pair  of  hy¬ 
drogen  atoms,  A  and  B,  to  exert  no  force  on  each  other  at  all, 
attractive  or  repulsive:  there  are  two  repulsive  interactions  (proton 
A  with  proton  B  and  electron  A  with  electron  B)  and  two  attractive 
interactions  (proton  A  with  electron  B  and  electron  A  with  proton 
B).  Thinking  a  little  more  precisely,  we  should  even  expect  that  once 
the  two  atoms  got  close  enough,  the  interaction  would  be  repulsive. 
For  instance,  if  you  squeezed  them  so  close  together  that  the  two 
protons  were  almost  on  top  of  each  other,  there  would  be  a  tremen¬ 
dously  strong  repulsion  between  them  due  to  the  1  / r 2  nature  of  the 
electrical  force.  The  repulsion  between  the  electrons  would  not  be 
as  strong,  because  each  electron  ranges  over  a  large  area,  and  is  not 
likely  to  be  found  right  on  top  of  the  other  electron.  Thus  hydrogen 
molecules  should  not  exist  according  to  classical  physics. 

Quantum  physics  to  the  rescue!  As  we’ll  see  shortly,  the  whole 
problem  is  solved  by  applying  the  same  quantum  concepts  to  elec- 


Section  1 3.3  Matter  As  a  Wave 


859 


trons  that  we  have  already  used  for  photons. 

13.3.1  Electrons  as  waves 

We  started  our  journey  into  quantum  physics  by  studying  the 
random  behavior  of  matter  in  radioactive  decay,  and  then  asked  how 
randomness  could  be  linked  to  the  basic  laws  of  nature  governing 
light.  The  probability  interpretation  of  wave-particle  duality  was 
strange  and  hard  to  accept,  but  it  provided  such  a  link.  It  is  now 
natural  to  ask  whether  the  same  explanation  could  be  applied  to 
matter.  If  the  fundamental  building  block  of  light,  the  photon,  is 
a  particle  as  well  as  a  wave,  is  it  possible  that  the  basic  units  of 
matter,  such  as  electrons,  are  waves  as  well  as  particles? 

A  young  French  aristocrat  studying  physics,  Louis  de  Broglie 
(pronounced  “broylee”),  made  exactly  this  suggestion  in  his  1923 
Ph.D.  thesis.  His  idea  had  seemed  so  farfetched  that  there  was 
serious  doubt  about  whether  to  grant  him  the  degree.  Einstein  was 
asked  for  his  opinion,  and  with  his  strong  support,  de  Broglie  got 
his  degree. 

Only  two  years  later,  American  physicists  C.J.  Davisson  and  L. 
Gernrer  confirmed  de  Broglie’s  idea  by  accident.  They  had  been 
studying  the  scattering  of  electrons  from  the  surface  of  a  sample 
of  nickel,  made  of  many  small  crystals.  (One  can  often  see  such  a 
crystalline  pattern  on  a  brass  doorknob  that  has  been  polished  by 
repeated  handling.)  An  accidental  explosion  occurred,  and  when 
they  put  their  apparatus  back  together  they  observed  something 
entirely  different:  the  scattered  electrons  were  now  creating  an  in¬ 
terference  pattern!  This  dramatic  proof  of  the  wave  nature  of  matter 
came  about  because  the  nickel  sample  had  been  melted  by  the  ex¬ 
plosion  and  then  resolidified  as  a  single  crystal.  The  nickel  atoms, 
now  nicely  arranged  in  the  regular  rows  and  columns  of  a  crystalline 
lattice,  were  acting  as  the  lines  of  a  diffraction  grating.  The  new 
crystal  was  analogous  to  the  type  of  ordinary  diffraction  grating  in 
which  the  lines  are  etched  on  the  surface  of  a  mirror  (a  reflection 
grating)  rather  than  the  kind  in  which  the  light  passes  through  the 
transparent  gaps  between  the  lines  (a  transmission  grating). 


Although  we  will  concentrate  on  the  wave-particle  duality  of  elec¬ 
trons  because  it  is  important  in  chemistry  and  the  physics  of  atoms, 
all  the  other  “particles”  of  matter  you’ve  learned  about  show  wave 
properties  as  well.  Figure  a,  for  instance,  shows  a  wave  interference 
pattern  of  neutrons. 

It  might  seem  as  though  all  our  work  was  already  done  for  us, 
and  there  would  be  nothing  new  to  understand  about  electrons: 
they  have  the  same  kind  of  funny  wave-particle  duality  as  photons. 
That’s  almost  true,  but  not  quite.  There  are  some  important  ways 
in  which  electrons  differ  significantly  from  photons: 
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a/ A  double-slit  interference  pattern  made  with  neutrons.  (A.  Zeilinger, 
R.  Gahler,  C.G.  Shull,  W.  Treimer,  and  W.  Mampe,  Reviews  of  Modern 
Physics ,  Vol.  60,  1988.) 


1.  Electrons  have  mass,  and  photons  don’t. 

2.  Photons  always  move  at  the  speed  of  light,  but  electrons  can 
move  at  any  speed  less  than  c. 

3.  Photons  don’t  have  electric  charge,  but  electrons  do,  so  electric 
forces  can  act  on  them.  The  most  important  example  is  the 
atom,  in  which  the  electrons  are  held  by  the  electric  force  of 
the  nucleus. 

4.  Electrons  cannot  be  absorbed  or  emitted  as  photons  are.  De¬ 
stroying  an  electron  or  creating  one  out  of  nothing  would  vio¬ 
late  conservation  of  charge. 


(In  section  13.4  we  will  learn  of  one  more  fundamental  way  in  which 
electrons  differ  from  photons,  for  a  total  of  five.) 

Because  electrons  are  different  from  photons,  it  is  not  immedi¬ 
ately  obvious  which  of  the  photon  equations  from  chapter  1 1  can  be 
applied  to  electrons  as  well.  A  particle  property,  the  energy  of  one 
photon,  is  related  to  its  wave  properties  via  E  =  hf  or,  equivalently, 
E  =  hc/X.  The  momentum  of  a  photon  was  given  by  p  =  hf  /c  or 
p  =  h/X.  Ultimately  it  was  a  matter  of  experiment  to  determine 
which  of  these  equations,  if  any,  would  work  for  electrons,  but  we 
can  make  a  quick  and  dirty  guess  simply  by  noting  that  some  of 
the  equations  involve  c,  the  speed  of  light,  and  some  do  not.  Since 
c  is  irrelevant  in  the  case  of  an  electron,  we  might  guess  that  the 
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equations  of  general  validity  are  those  that  do  not  have  c  in  them: 

E  =  hf 
p  =  h/X 

This  is  essentially  the  reasoning  that  de  Broglie  went  through, 
and  experiments  have  confirmed  these  two  equations  for  all  the  fun¬ 
damental  building  blocks  of  light  and  matter,  not  just  for  photons 
and  electrons. 

The  second  equation,  which  I  soft-pedaled  in  the  previous  chap¬ 
ter,  takes  on  a  greater  important  for  electrons.  This  is  first  of  all 
because  the  momentum  of  matter  is  more  likely  to  be  significant 
than  the  momentum  of  light  under  ordinary  conditions,  and  also 
because  force  is  the  transfer  of  momentum,  and  electrons  are  af¬ 
fected  by  electrical  forces. 

The  wavelength  of  an  elephant  example  12 

t>  What  is  the  wavelength  of  a  trotting  elephant? 

>  One  may  doubt  whether  the  equation  should  be  applied  to  an 
elephant,  which  is  not  just  a  single  particle  but  a  rather  large  col¬ 
lection  of  them.  Throwing  caution  to  the  wind,  however,  we  esti¬ 
mate  the  elephant’s  mass  at  103  kg  and  its  trotting  speed  at  10 
m/s.  Its  wavelength  is  therefore  roughly 


P 

h 

mv 


_  6.63  x  10~34  J  s 
-  (103  kg ) ( 1 0  m/s) 

1n-37  (kg  m2/s2)-s 
~10  kg-m/s 
=  10~37  m 


The  wavelength  found  in  this  example  is  so  fantastically  small 
that  we  can  be  sure  we  will  never  observe  any  measurable  wave 
phenomena  with  elephants  or  any  other  human-scale  objects.  The 
result  is  numerically  small  because  Planck’s  constant  is  so  small, 
and  as  in  some  examples  encountered  previously,  this  smallness  is 
in  accord  with  the  correspondence  principle. 

Although  a  smaller  mass  in  the  equation  A  =  li/mv  does  re¬ 
sult  in  a  longer  wavelength,  the  wavelength  is  still  quite  short  even 
for  individual  electrons  under  typical  conditions,  as  shown  in  the 
following  example. 

The  typical  wavelength  of  an  electron  example  13 

>  Electrons  in  circuits  and  in  atoms  are  typically  moving  through 
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voltage  differences  on  the  order  of  1  V,  so  that  a  typical  energy  is 
(e)(1  V),  which  is  on  the  order  of  10~19  J.  What  is  the  wavelength 
of  an  electron  with  this  amount  of  kinetic  energy? 

>  This  energy  is  nonrelativistic,  since  it  is  much  less  than  me2. 
Momentum  and  energy  are  therefore  related  by  the  nonrelativistic 
equation  K  =  p2/2m.  Solving  for  p  and  substituting  in  to  the 
equation  for  the  wavelength,  we  find 


V2mK 

=  1.6  x  1(T9  m. 

This  is  on  the  same  order  of  magnitude  as  the  size  of  an  atom, 
which  is  no  accident:  as  we  will  discuss  in  the  next  chapter  in 
more  detail,  an  electron  in  an  atom  can  be  interpreted  as  a  stand¬ 
ing  wave.  The  smallness  of  the  wavelength  of  a  typical  electron 
also  helps  to  explain  why  the  wave  nature  of  electrons  wasn’t  dis¬ 
covered  until  a  hundred  years  after  the  wave  nature  of  light.  To 
scale  the  usual  wave-optics  devices  such  as  diffraction  gratings 
down  to  the  size  needed  to  work  with  electrons  at  ordinary  ener¬ 
gies,  we  need  to  make  them  so  small  that  their  parts  are  compa¬ 
rable  in  size  to  individual  atoms.  This  is  essentially  what  Davisson 
and  Germer  did  with  their  nickel  crystal. 

self-check  E 


These  remarks  about  the  inconvenient  smallness  of  electron  wavelengths 
apply  only  under  the  assumption  that  the  electrons  have  typical  ener¬ 
gies.  What  kind  of  energy  would  an  electron  have  to  have  in  order  to 
have  a  longer  wavelength  that  might  be  more  convenient  to  work  with? 

>  Answer,  p.  969 


What  kind  of  wave  is  it? 

If  a  sound  wave  is  a  vibration  of  matter,  and  a  photon  is  a 
vibration  of  electric  and  magnetic  fields,  what  kind  of  a  wave  is 
an  electron  made  of?  The  disconcerting  answer  is  that  there  is 
no  experimental  “observable,”  i.e. ,  directly  measurable  quantity,  to 
correspond  to  the  electron  wave  itself.  In  other  words,  there  are 
devices  like  microphones  that  detect  the  oscillations  of  air  pressure 
in  a  sound  wave,  and  devices  such  as  radio  receivers  that  measure 
the  oscillation  of  the  electric  and  magnetic  fields  in  a  light  wave, 
but  nobody  has  ever  found  any  way  to  measure  the  electron  wave 
directly. 

We  can  of  course  detect  the  energy  (or  momentum)  possessed  by 
an  electron  just  as  we  could  detect  the  energy  of  a  photon  using  a 
digital  camera.  (In  fact  I’d  imagine  that  an  unmodified  digital  cam¬ 
era  chip  placed  in  a  vacuum  chamber  would  detect  electrons  just  as 
handily  as  photons.)  But  this  only  allows  us  to  determine  where  the 


b  /  These  two  electron  waves 
are  not  distinguishable  by  any 
measuring  device. 
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wave  carries  high  probability  and  where  it  carries  low  probability. 
Probability  is  proportional  to  the  square  of  the  wave’s  amplitude, 
but  measuring  its  square  is  not  the  same  as  measuring  the  wave 
itself.  In  particular,  we  get  the  same  result  by  squaring  either  a 
positive  number  or  its  negative,  so  there  is  no  way  to  determine  the 
positive  or  negative  sign  of  an  electron  wave.  This  unobservability  of 
the  phase  of  the  wavefunction  is  discussed  in  more  detail  on  p.  885. 

Most  physicists  tend  toward  the  school  of  philosophy  known  as 
operationalism,  which  says  that  a  concept  is  only  meaningful  if  we 
can  define  some  set  of  operations  for  observing,  measuring,  or  test¬ 
ing  it.  According  to  a  strict  operationalist,  then,  the  electron  wave 
itself  is  a  meaningless  concept.  Nevertheless,  it  turns  out  to  be  one 
of  those  concepts  like  love  or  humor  that  is  impossible  to  measure 
and  yet  very  useful  to  have  around.  We  therefore  give  it  a  symbol, 
'k  (the  capital  Greek  letter  psi),  and  a  special  name,  the  electron 
wavefunction  (because  it  is  a  function  of  the  coordinates  x ,  y,  and 
z  that  specify  where  you  are  in  space).  It  would  be  impossible,  for 
example,  to  calculate  the  shape  of  the  electron  wave  in  a  hydro¬ 
gen  atom  without  having  some  symbol  for  the  wave.  But  when  the 
calculation  produces  a  result  that  can  be  compared  directly  to  ex¬ 
periment,  the  final  algebraic  result  will  turn  out  to  involve  only  T2, 
which  is  what  is  observable,  not  T  itself. 

Since  T,  unlike  E  and  B,  is  not  directly  measurable,  we  are  free 
to  make  the  probability  equations  have  a  simple  form:  instead  of 
having  the  probability  density  equal  to  some  funny  constant  multi¬ 
plied  by  T2,  we  simply  define  \k  so  that  the  constant  of  proportion¬ 
ality  is  one: 

(probability  distribution)  =  |  \k  | 2 . 

Since  the  probability  distribution  has  units  of  m~:!,  the  units  of  \k 
must  be  m -3/2.  The  square  of  a  negative  number  is  still  positive,  so 
the  absolute  value  signs  may  seem  unnecessary,  but  as  we’ll  see  on 
p.  881  in  sec.  13.3.6,  the  wavefunction  may  in  general  be  a  complex 
number.  In  fact,  only  standing  waves,  not  traveling  waves,  can  really 
be  represented  by  real  numbers,  although  we  will  often  cheat  and 
draw  pictures  of  traveling  waves  as  if  they  were  real- valued  functions. 

Discussion  Question 

A  Frequency  is  oscillations  per  second,  whereas  wavelength  is  meters 
per  oscillation.  How  could  the  equations  E  =  hf  and  p  =  h/ A  be  made 
to  look  more  alike  by  using  quantities  that  were  more  closely  analogous? 
(This  more  symmetric  treatment  makes  it  easier  to  incorporate  relativity 
into  quantum  mechanics,  since  relativity  says  that  space  and  time  are  not 
entirely  separate.) 

13.3.2  Dispersive  waves 

A  colleague  of  mine  who  teaches  chemistry  loves  to  tell  the  story 
about  an  exceptionally  bright  student  who,  when  told  of  the  equa¬ 
tion  p  =  h/X,  protested,  “But  when  I  derived  it,  it  had  a  factor  of 
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2!”  The  issue  that’s  involved  is  a  real  one,  albeit  one  that  could  be 
glossed  over  (and  is,  in  most  textbooks)  without  raising  any  alarms 
in  the  mind  of  the  average  student.  The  present  optional  section 
addresses  this  point;  it  is  intended  for  the  student  who  wishes  to 
delve  a  little  deeper. 

Here’s  how  the  now-legendary  student  was  presumably  reason¬ 
ing.  We  start  with  the  equation  v  =  /A,  which  is  valid  for  any  sine 
wave,  whether  it’s  quantum  or  classical.  Let’s  assume  we  already 
know  E  =  hf,  and  are  trying  to  derive  the  relationship  between 
wavelength  and  momentum: 


vh 
~  /  •; 
vh 

1  9 

2  rmr 
2  h 

mv 
_  2  h 

P  ' 

The  reasoning  seems  valid,  but  the  result  does  contradict  the 
accepted  one,  which  is  after  all  solidly  based  on  experiment. 

The  mistaken  assumption  is  that  we  can  figure  everything  out  in 
terms  of  pure  sine  waves.  Mathematically,  the  only  wave  that  has 
a  perfectly  well  defined  wavelength  and  frequency  is  a  sine  wave, 
and  not  just  any  sine  wave  but  an  infinitely  long  sine  wave,  c.  The 
unphysical  thing  about  such  a  wave  is  that  it  has  no  leading  or 
trailing  edge,  so  it  can  never  be  said  to  enter  or  leave  any  particular 
region  of  space.  Our  derivation  made  use  of  the  velocity,  v,  and  if 
velocity  is  to  be  a  meaningful  concept,  it  must  tell  us  how  quickly 
stuff  (mass,  energy,  momentum,  . . . )  is  transported  from  one  region 
of  space  to  another.  Since  an  infinitely  long  sine  wave  doesn’t  remove 
any  stuff  from  one  region  and  take  it  to  another,  the  “velocity  of  its 
stuff”  is  not  a  well  defined  concept. 

Of  course  the  individual  wave  peaks  do  travel  through  space,  and 
one  might  think  that  it  would  make  sense  to  associate  their  speed 
with  the  “speed  of  stuff,”  but  as  we  will  see,  the  two  velocities  are 
in  general  unequal  when  a  wave’s  velocity  depends  on  wavelength. 
Such  a  wave  is  called  a  dispersive  wave,  because  a  wave  pulse  consist¬ 
ing  of  a  superposition  of  waves  of  different  wavelengths  will  separate 
(disperse)  into  its  separate  wavelengths  as  the  waves  move  through 
space  at  different  speeds.  Nearly  all  the  waves  we  have  encountered 
have  been  nondispersive.  For  instance,  sound  waves  and  light  waves 
(in  a  vacuum)  have  speeds  independent  of  wavelength.  A  water  wave 
is  one  good  example  of  a  dispersive  wave.  Long-wavelength  water 


c  /  Part  of  an  infinite  sine  wave. 


d/A  finite-length  sine  wave. 


e  /  A  beat  pattern  created  by 
superimposing  two  sine  waves 
with  slightly  different  wave¬ 
lengths. 
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waves  travel  faster,  so  a  ship  at  sea  that  encounters  a  storm  typi¬ 
cally  sees  the  long- wavelength  parts  of  the  wave  first.  When  dealing 
with  dispersive  waves,  we  need  symbols  and  words  to  distinguish 
the  two  speeds.  The  speed  at  which  wave  peaks  move  is  called  the 
phase  velocity,  vp,  and  the  speed  at  which  “stuff”  moves  is  called 
the  group  velocity,  vg. 

An  infinite  sine  wave  can  only  tell  us  about  the  phase  velocity, 
not  the  group  velocity,  which  is  really  what  we  would  be  talking 
about  when  we  refer  to  the  speed  of  an  electron.  If  an  infinite 
sine  wave  is  the  simplest  possible  wave,  what’s  the  next  best  thing? 
We  might  think  the  runner  up  in  simplicity  would  be  a  wave  train 
consisting  of  a  chopped-off  segment  of  a  sine  wave,  d.  However,  this 
kind  of  wave  has  kinks  in  it  at  the  end.  A  simple  wave  should  be 
one  that  we  can  build  by  superposing  a  small  number  of  infinite 
sine  waves,  but  a  kink  can  never  be  produced  by  superposing  any 
number  of  infinitely  long  sine  waves. 

Actually  the  simplest  wave  that  transports  stuff  from  place  to 
place  is  the  pattern  shown  in  figure  e.  Called  a  beat  pattern,  it  is 
formed  by  superposing  two  sine  waves  whose  wavelengths  are  similar 
but  not  quite  the  same.  If  you  have  ever  heard  the  pulsating  howling 
sound  of  musicians  in  the  process  of  tuning  their  instruments  to  each 
other,  you  have  heard  a  beat  pattern.  The  beat  pattern  gets  stronger 
and  weaker  as  the  two  sine  waves  go  in  and  out  of  phase  with  each 
other.  The  beat  pattern  has  more  “stuff”  (energy,  for  example) 
in  the  areas  where  constructive  interference  occurs,  and  less  in  the 
regions  of  cancellation.  As  the  whole  pattern  moves  through  space, 
stuff  is  transported  from  some  regions  and  into  other  ones. 

If  the  frequency  of  the  two  sine  waves  differs  by  10%,  for  in¬ 
stance,  then  ten  periods  will  be  occur  between  times  when  they  are 
in  phase.  Another  way  of  saying  it  is  that  the  sinusoidal  “envelope” 
(the  dashed  lines  in  figure  e)  has  a  frequency  equal  to  the  difference 
in  frequency  between  the  two  waves.  For  instance,  if  the  waves  had 
frequencies  of  100  Hz  and  110  Hz,  the  frequency  of  the  envelope 
would  be  10  Hz. 

To  apply  similar  reasoning  to  the  wavelength,  we  must  define  a 
quantity  z  =  1/A  that  relates  to  wavelength  in  the  same  way  that 
frequency  relates  to  period.  In  terms  of  this  new  variable,  the  z  of 
the  envelope  equals  the  difference  between  the  z's  of  the  two  sine 
waves. 

The  group  velocity  is  the  speed  at  which  the  envelope  moves 
through  space.  Let  A /  and  Az  be  the  differences  between  the 
frequencies  and  z's  of  the  two  sine  waves,  which  means  that  they 
equal  the  frequency  and  z  of  the  envelope.  The  group  velocity  is 
Vg  =  fenveiopeKnveiope  =  A//Az.  If  A /  and  A^  are  sufficiently 
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small,  we  can  approximate  this  expression  as  a  derivative, 


This  expression  is  usually  taken  as  the  definition  of  the  group  veloc¬ 
ity  for  wave  patterns  that  consist  of  a  superposition  of  sine  waves 
having  a  narrow  range  of  frequencies  and  wavelengths.  In  quan¬ 
tum  mechanics,  with  f  =  E/h  and  z  =  p/h,  we  have  vg  =  d E /  dp. 
In  the  case  of  a  nonrelativistic  electron  the  relationship  between 
energy  and  momentum  is  E  =  p2 /2m,  so  the  group  velocity  is 
d E /  dp  =  p/m  =  v,  exactly  what  it  should  be.  It  is  only  the  phase 
velocity  that  differs  by  a  factor  of  two  from  what  we  would  have 
expected,  but  the  phase  velocity  is  not  the  physically  important 
thing. 


13.3.3  Bound  states 


Electrons  are  at  their  most  interesting  when  they’re  in  atoms, 
that  is,  when  they  are  bound  within  a  small  region  of  space.  We 
can  understand  a  great  deal  about  atoms  and  molecules  based  on 
simple  arguments  about  such  bound  states,  without  going  into  any 
of  the  realistic  details  of  atom.  The  simplest  model  of  a  bound  state 
is  known  as  the  particle  in  a  box:  like  a  ball  on  a  pool  table,  the 
electron  feels  zero  force  while  in  the  interior,  but  when  it  reaches 
an  edge  it  encounters  a  wall  that  pushes  back  inward  on  it  with 
a  large  force.  In  particle  language,  we  would  describe  the  electron 
as  bouncing  off  of  the  wall,  but  this  incorrectly  assumes  that  the 
electron  has  a  certain  path  through  space.  It  is  more  correct  to 
describe  the  electron  as  a  wave  that  undergoes  100%  reflection  at 
the  boundaries  of  the  box. 

Like  a  generation  of  physics  students  before  me,  I  rolled  my 
eyes  when  initially  introduced  to  the  unrealistic  idea  of  putting  a 
particle  in  a  box.  It  seemed  completely  impractical,  an  artificial 
textbook  invention.  Today,  however,  it  has  become  routine  to  study 
electrons  in  rectangular  boxes  in  actual  laboratory  experiments.  The 
“box”  is  actually  just  an  empty  cavity  within  a  solid  piece  of  silicon, 
amounting  in  volume  to  a  few  hundred  atoms.  The  methods  for 
creating  these  electron-in-a-box  setups  (known  as  “quantum  dots”) 
were  a  by-product  of  the  development  of  technologies  for  fabricating 
computer  chips. 

For  simplicity  let’s  imagine  a  one-dimensional  electron  in  a  box, 
i.e.,  we  assume  that  the  electron  is  only  free  to  move  along  a  line. 
The  resulting  standing  wave  patterns,  of  which  the  first  three  are 
shown  in  the  figure,  are  just  like  some  of  the  patterns  we  encoun¬ 
tered  with  sound  waves  in  musical  instruments.  The  wave  patterns 
must  be  zero  at  the  ends  of  the  box,  because  we  are  assuming  the 
walls  are  impenetrable,  and  there  should  therefore  be  zero  proba¬ 
bility  of  finding  the  electron  outside  the  box.  Each  wave  pattern  is 
labeled  according  to  n,  the  number  of  peaks  and  valleys  it  has.  In 


f  /  Three  possible  standing- 
wave  patterns  for  a  particle  in  a 
box. 
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quantum  physics,  these  wave  patterns  are  referred  to  as  “states”  of 
the  particle-in-the-box  system. 

The  following  seemingly  innocuous  observations  about  the  par¬ 
ticle  in  the  box  lead  us  directly  to  the  solutions  to  some  of  the  most 
vexing  failures  of  classical  physics: 

The  particle’s  energy  is  quantized  (can  only  have  certain  values). 
Each  wavelength  corresponds  to  a  certain  momentum,  and  a  given 
momentum  implies  a  definite  kinetic  energy,  E  =  p2 /2m.  (This  is 
the  second  type  of  energy  quantization  we  have  encountered.  The 
type  we  studied  previously  had  to  do  with  restricting  the  number 
of  particles  to  a  whole  number,  while  assuming  some  specific  wave¬ 
length  and  energy  for  each  particle.  This  type  of  quantization  refers 
to  the  energies  that  a  single  particle  can  have.  Both  photons  and 
matter  particles  demonstrate  both  types  of  quantization  under  the 
appropriate  circumstances.) 

The  particle  has  a  minimum  kinetic  energy.  Long  wavelengths  cor¬ 
respond  to  low  momenta  and  low  energies.  There  can  be  no  state 
with  an  energy  lower  than  that  of  the  n  =  1  state,  called  the  ground 
state. 

The  smaller  the  space  in  which  the  particle  is  confined,  the  higher 
its  kinetic  energy  must  be.  Again,  this  is  because  long  wavelengths 
give  lower  energies. 

Spectra  of  thin  gases  example  14 

A  fact  that  was  inexplicable  by  classical  physics  was  that  thin 
gases  absorb  and  emit  light  only  at  certain  wavelengths.  This 
was  observed  both  in  earthbound  laboratories  and  in  the  spectra 
of  stars.  The  figure  on  the  left  shows  the  example  of  the  spec¬ 
trum  of  the  star  Sirius,  in  which  there  are  “gap  teeth”  at  certain 
wavelengths.  Taking  this  spectrum  as  an  example,  we  can  give  a 
straightforward  explanation  using  quantum  physics. 

Energy  is  released  in  the  dense  interior  of  the  star,  but  the  outer 
layers  of  the  star  are  thin,  so  the  atoms  are  far  apart  and  electrons 
are  confined  within  individual  atoms.  Although  their  standing- 
wave  patterns  are  not  as  simple  as  those  of  the  particle  in  the 
box,  their  energies  are  quantized. 

When  a  photon  is  on  its  way  out  through  the  outer  layers,  it  can  be 
absorbed  by  an  electron  in  an  atom,  but  only  if  the  amount  of  en¬ 
ergy  it  carries  happens  to  be  the  right  amount  to  kick  the  electron 
from  one  of  the  allowed  energy  levels  to  one  of  the  higher  lev¬ 
els.  The  photon  energies  that  are  missing  from  the  spectrum  are 
the  ones  that  equal  the  difference  in  energy  between  two  elec¬ 
tron  energy  levels.  (The  most  prominent  of  the  absorption  lines  in 
Sirius’s  spectrum  are  absorption  lines  of  the  hydrogen  atom.) 

The  stability  of  atoms  example  15 


blue 


red 


g/The  spectrum  of  the  light 
from  the  star  Sirius. 
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In  many  Star  Trek  episodes  the  Enterprise,  in  orbit  around  a  planet, 
suddenly  lost  engine  power  and  began  spiraling  down  toward  the 
planet’s  surface.  This  was  utter  nonsense,  of  course,  due  to  con¬ 
servation  of  energy:  the  ship  had  no  way  of  getting  rid  of  energy, 
so  it  did  not  need  the  engines  to  replenish  it. 

Consider,  however,  the  electron  in  an  atom  as  it  orbits  the  nu¬ 
cleus.  The  electron  does  have  a  way  to  release  energy:  it  has  an 
acceleration  due  to  its  continuously  changing  direction  of  motion, 
and  according  to  classical  physics,  any  accelerating  charged  par¬ 
ticle  emits  electromagnetic  waves.  According  to  classical  physics, 
atoms  should  collapse! 

The  solution  lies  in  the  observation  that  a  bound  state  has  a  min¬ 
imum  energy.  An  electron  in  one  of  the  higher-energy  atomic 
states  can  and  does  emit  photons  and  hop  down  step  by  step  in 
energy.  But  once  it  is  in  the  ground  state,  it  cannot  emit  a  photon 
because  there  is  no  lower-energy  state  for  it  to  go  to. 

Chemical  bonds  example  16 

I  began  this  section  with  a  classical  argument  that  chemical 
bonds,  as  in  an  H2  molecule,  should  not  exist.  Quantum  physics 
explains  why  this  type  of  bonding  does  in  fact  occur.  When  the 
atoms  are  next  to  each  other,  the  electrons  are  shared  between 
them.  The  “box”  is  about  twice  as  wide,  and  a  larger  box  allows 
a  smaller  energy.  Energy  is  required  in  order  to  separate  the 
atoms.  (A  qualitatively  different  type  of  bonding  is  discussed  on 
page  904.  Example  25  on  page  899  revisits  the  H2  bond  in  more 
detail.) 

Discussion  Questions 

A  Neutrons  attract  each  other  via  the  strong  nuclear  force,  so  according 
to  classical  physics  it  should  be  possible  to  form  nuclei  out  of  clusters  of 
two  or  more  neutrons,  with  no  protons  at  all.  Experimental  searches, 
however,  have  failed  to  turn  up  evidence  of  a  stable  two-neutron  system 
(dineutron)  or  larger  stable  clusters.  These  systems  are  apparently  not 
just  unstable  in  the  sense  of  being  able  to  beta  decay  but  unstable  in 
the  sense  that  they  don’t  hold  together  at  all.  Explain  based  on  quantum 
physics  why  a  dineutron  might  spontaneously  fly  apart. 

B  The  following  table  shows  the  energy  gap  between  the  ground  state 
and  the  first  excited  state  for  four  nuclei,  in  units  of  picojoules.  (The  nuclei 
were  chosen  to  be  ones  that  have  similar  structures,  e.g.,  they  are  all 
spherical  in  shape.) 


nucleus 

energy  gap  (picojoules) 

4He 

3.234 

160 

0.968 

40Ca 

0.536 

208  Pb 

0.418 

Explain  the  trend  in  the  data. 
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h  /  Two  hydrogen  atoms  bond 
to  form  an  HE  molecule.  In  the 
molecule,  the  two  electrons’  wave 
patterns  overlap  ,  and  are  about 
twice  as  wide. 


i/ Werner  Heisenberg  (1901- 
1976).  Heisenberg  helped  to 
develop  the  foundations  of  quan¬ 
tum  mechanics,  including  the 
Heisenberg  uncertainty  principle. 
He  was  the  scientific  leader  of 
the  Nazi  atomic-bomb  program 
up  until  its  cancellation  in  1942, 
when  the  military  decided  that  it 
was  too  ambitious  a  project  to 
undertake  in  wartime,  and  too 
unlikely  to  produce  results. 


13.3.4  The  uncertainty  principle  and  measurement 

Eliminating  randomness  through  measurement  ? 

A  common  reaction  to  quantum  physics,  among  both  early- 
twentieth-century  physicists  and  modern  students,  is  that  we  should 
be  able  to  get  rid  of  randomness  through  accurate  measurement.  If 
I  say,  for  example,  that  it  is  meaningless  to  discuss  the  path  of  a 
photon  or  an  electron,  one  might  suggest  that  we  simply  measure 
the  particle’s  position  and  velocity  many  times  in  a  row.  This  series 
of  snapshots  would  amount  to  a  description  of  its  path. 

A  practical  objection  to  this  plan  is  that  the  process  of  measure¬ 
ment  will  have  an  effect  on  the  thing  we  are  trying  to  measure.  This 
may  not  be  of  much  concern,  for  example,  when  a  traffic  cop  mea¬ 
sure’s  your  car’s  motion  with  a  radar  gun,  because  the  energy  and 
momentum  of  the  radar  pulses  are  insufficient  to  change  the  car’s 
motion  significantly.  But  on  the  subatomic  scale  it  is  a  very  real 
problem.  Making  a  videotape  through  a  microscope  of  an  electron 
orbiting  a  nucleus  is  not  just  difficult,  it  is  theoretically  impossi¬ 
ble.  The  video  camera  makes  pictures  of  things  using  light  that  has 
bounced  off  them  and  come  into  the  camera.  If  even  a  single  photon 
of  visible  light  was  to  bounce  off  of  the  electron  we  were  trying  to 
study,  the  electron’s  recoil  would  be  enough  to  change  its  behavior 
significantly. 

The  Heisenberg  uncertainty  principle 

This  insight,  that  measurement  changes  the  thing  being  mea¬ 
sured,  is  the  kind  of  idea  that  clove-cigarette-smoking  intellectuals 
outside  of  the  physical  sciences  like  to  claim  they  knew  all  along.  If 
only,  they  say,  the  physicists  had  made  more  of  a  habit  of  reading 
literary  journals,  they  could  have  saved  a  lot  of  work.  The  anthro¬ 
pologist  Margaret  Mead  has  recently  been  accused  of  inadvertently 
encouraging  her  teenaged  Samoan  informants  to  exaggerate  the  free¬ 
dom  of  youthful  sexual  experimentation  in  their  society.  If  this  is 
considered  a  damning  critique  of  her  work,  it  is  because  she  could 
have  done  better:  other  anthropologists  claim  to  have  been  able  to 
eliminate  the  observer-as-participant  problem  and  collect  untainted 
data. 

The  German  physicist  Werner  Heisenberg,  however,  showed  that 
in  quantum  physics,  any  measuring  technique  runs  into  a  brick  wall 
when  we  try  to  improve  its  accuracy  beyond  a  certain  point.  Heisen¬ 
berg  showed  that  the  limitation  is  a  question  of  what  there  is  to  be 
known,  even  in  principle,  about  the  system  itself,  not  of  the  ability 
or  inability  of  a  specific  measuring  device  to  ferret  out  information 
that  is  knowable  but  not  previously  hidden. 

Suppose,  for  example,  that  we  have  constructed  an  electron  in  a 
box  (quantum  dot)  setup  in  our  laboratory,  and  we  are  able  to  adjust 
the  length  L  of  the  box  as  desired.  All  the  standing  wave  patterns 
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pretty  much  fill  the  box,  so  our  knowledge  of  the  electron’s  position 
is  of  limited  accuracy.  If  we  write  Ax  for  the  range  of  uncertainty 
in  our  knowledge  of  its  position,  then  Ax  is  roughly  the  same  as  the 
length  of  the  box: 

Ax  ps  L 

If  we  wish  to  know  its  position  more  accurately,  we  can  certainly 
squeeze  it  into  a  smaller  space  by  reducing  L,  but  this  has  an  unin¬ 
tended  side-effect.  A  standing  wave  is  really  a  superposition  of  two 
traveling  waves  going  in  opposite  directions.  The  equation  p  =  h/X 
really  only  gives  the  magnitude  of  the  momentum  vector,  not  its 
direction,  so  we  should  really  interpret  the  wave  as  a  50/50  mixture 
of  a  right-going  wave  with  momentum  p  =  h/X  and  a  left-going  one 
with  momentum  p  =  —h/X.  The  uncertainty  in  our  knowledge  of 
the  electron’s  momentum  is  A p  =  2h/X,  covering  the  range  between 
these  two  values.  Even  if  we  make  sure  the  electron  is  in  the  ground 
state,  whose  wavelength  A  =  2L  is  the  longest  possible,  we  have  an 
uncertainty  in  momentum  of  A p  =  h/L.  In  general,  we  find 

&P>h/L, 

with  equality  for  the  ground  state  and  inequality  for  the  higher- 
energy  states.  Thus  if  we  reduce  L  to  improve  our  knowledge  of  the 
electron’s  position,  we  do  so  at  the  cost  of  knowing  less  about  its 
momentum.  This  trade-off  is  neatly  summarized  by  multiplying  the 
two  equations  to  give 

ApAx  >  h. 

Although  we  have  derived  this  in  the  special  case  of  a  particle  in  a 
box,  it  is  an  example  of  a  principle  of  more  general  validity: 


The  Heisenberg  uncertainty  principle 

It  is  not  possible,  even  in  principle,  to  know  the  momentum  and  the 
position  of  a  particle  simultaneously  and  with  perfect  accuracy.  The 
uncertainties  in  these  two  quantities  are  always  such  that  ApAx  > 
h. 


(This  approximation  can  be  made  into  a  strict  inequality,  ApAx  > 
h/ 47T,  but  only  with  more  careful  definitions,  which  we  will  not 
bother  with.) 

Note  that  although  I  encouraged  you  to  think  of  this  deriva¬ 
tion  in  terms  of  a  specific  real-world  system,  the  quantum  dot,  no 
reference  was  ever  made  to  any  specific  laboratory  equipment  or  pro¬ 
cedures.  The  argument  is  simply  that  we  cannot  know  the  particle’s 
position  very  accurately  unless  it  has  a  very  well  defined  position, 
it  cannot  have  a  very  well  defined  position  unless  its  wave-pattern 
covers  only  a  very  small  amount  of  space,  and  its  wave-pattern  can¬ 
not  be  thus  compressed  without  giving  it  a  short  wavelength  and 
a  correspondingly  uncertain  momentum.  The  uncertainty  princi¬ 
ple  is  therefore  a  restriction  on  how  much  there  is  to  know  about 
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a  particle,  not  just  on  what  we  can  know  about  it  with  a  certain 
technique. 

An  estimate  for  electrons  in  atoms  example  1 7 

>  A  typical  energy  for  an  electron  in  an  atom  is  on  the  order  of 
(1  volt)-e,  which  corresponds  to  a  speed  of  about  1%  of  the  speed 
of  light.  If  a  typical  atom  has  a  size  on  the  order  of  0.1  nm,  how 
close  are  the  electrons  to  the  limit  imposed  by  the  uncertainty 
principle? 

>  If  we  assume  the  electron  moves  in  all  directions  with  equal 
probability,  the  uncertainty  in  its  momentum  is  roughly  twice  its 
typical  momentum.  This  only  an  order-of-magnitude  estimate,  so 
we  take  Ap  to  be  the  same  as  a  typical  momentum: 

ApAx  =  PtyPicaAx 

=  (/ne/ecfron)(0.01C)(0.1  x  10~9  m) 

=  3  x  1(T34  J  s 

This  is  on  the  same  order  of  magnitude  as  Planck’s  constant,  so 
evidently  the  electron  is  “right  up  against  the  wall.”  (The  fact  that 
it  is  somewhat  less  than  h  is  of  no  concern  since  this  was  only  an 
estimate,  and  we  have  not  stated  the  uncertainty  principle  in  its 
most  exact  form.) 

self-check  F 

If  we  were  to  apply  the  uncertainty  principle  to  human-scale  objects, 
what  would  be  the  significance  of  the  small  numerical  value  of  Planck’s 
constant?  >  Answer,  p.  969 

Measurement  and  Schrodinger’s  cat 

On  p.  855  I  briefly  mentioned  an  issue  concerning  measurement 
that  we  are  now  ready  to  address  carefully.  If  you  hang  around  a 
laboratory  where  quantum-physics  experiments  are  being  done  and 
secretly  record  the  physicists’  conversations,  you’ll  hear  them  say 
many  things  that  assume  the  probability  interpretation  of  quantum 
mechanics.  Usually  they  will  speak  as  though  the  randomness  of 
quantum  mechanics  enters  the  picture  when  something  is  measured. 
In  the  digital  camera  experiments  of  section  13.2,  for  example,  they 
would  casually  describe  the  detection  of  a  photon  at  one  of  the  pixels 
as  if  the  moment  of  detection  was  when  the  photon  was  forced  to 
“make  up  its  mind.”  Although  this  mental  cartoon  usually  works 
fairly  well  as  a  description  of  things  they  experience  in  the  lab,  it 
cannot  ultimately  be  correct,  because  it  attributes  a  special  role  to 
measurement,  which  is  really  just  a  physical  process  like  all  other 
physical  processes.4 

4This  interpretation  of  quantum  mechanics  is  called  the  Copenhagen  inter¬ 
pretation,  because  it  was  originally  developed  by  a  school  of  physicists  centered 
in  Copenhagen  and  led  by  Niels  Bohr. 
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If  we  are  to  find  an  interpretation  that  avoids  giving  any  spe¬ 
cial  role  to  measurement  processes,  then  we  must  think  of  the  entire 
laboratory,  including  the  measuring  devices  and  the  physicists  them¬ 
selves,  as  one  big  quantum-mechanical  system  made  out  of  protons, 
neutrons,  electrons,  and  photons.  In  other  words,  we  should  take 
quantum  physics  seriously  as  a  description  not  just  of  microscopic 
objects  like  atoms  but  of  human-scale  (“macroscopic”)  things  like 
the  apparatus,  the  furniture,  and  the  people. 

The  most  celebrated  example  is  called  the  Schrodinger’s  cat  ex¬ 
periment.  Luckily  for  the  cat,  there  probably  was  no  actual  experi¬ 
ment  —  it  was  simply  a  “thought  experiment”  that  the  German  the¬ 
orist  Schrodinger  discussed  with  his  colleagues.  Schrodinger  wrote: 


One  can  even  construct  quite  burlesque  cases.  A  cat  is 
shut  up  in  a  steel  container,  together  with  the  following 
diabolical  apparatus  (which  one  must  keep  out  of  the 
direct  clutches  of  the  cat):  In  a  Geiger  tube  [radiation 
detector]  there  is  a  tiny  mass  of  radioactive  substance,  so 
little  that  in  the  course  of  an  hour  perhaps  one  atom  of 
it  disintegrates,  but  also  with  equal  probability  not  even 
one;  if  it  does  happen,  the  counter  [detector]  responds 
and  ...  activates  a  hammer  that  shatters  a  little  flask 
of  prussic  acid  [filling  the  chamber  with  poison  gas].  If 
one  has  left  this  entire  system  to  itself  for  an  hour,  then 
one  will  say  to  himself  that  the  cat  is  still  living,  if  in 
that  time  no  atom  has  disintegrated.  The  first  atomic 
disintegration  would  have  poisoned  it. 


Now  comes  the  strange  part.  Quantum  mechanics  describes  the 
particles  the  cat  is  made  of  as  having  wave  properties,  including  the 
property  of  superposition.  Schrodinger  describes  the  wavefunction 
of  the  box’s  contents  at  the  end  of  the  hour: 

The  wavefunction  of  the  entire  system  would  express  this  sit¬ 
uation  by  having  the  living  and  the  dead  cat  mixed  ...  in  equal 
parts  [50/50  proportions].  The  uncertainty  originally  restricted  to 
the  atomic  domain  has  been  transformed  into  a  macroscopic  uncer¬ 
tainty.  . . 

At  first  Schrodinger’s  description  seems  like  nonsense.  When  you 
opened  the  box,  would  you  see  two  ghostlike  cats,  as  in  a  doubly 
exposed  photograph,  one  dead  and  one  alive?  Obviously  not.  You 
would  have  a  single,  fully  material  cat,  which  would  either  be  dead 
or  very,  very  upset.  But  Schrodinger  has  an  equally  strange  and 
logical  answer  for  that  objection.  In  the  same  way  that  the  quantum 
randomness  of  the  radioactive  atom  spread  to  the  cat  and  made  its 
wavefunction  a  random  mixture  of  life  and  death,  the  randomness 
spreads  wider  once  you  open  the  box,  and  your  own  wavefunction 


j  /  Schrodinger’s  cat. 
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direction  of  motion 
(speeding  up) 

- > 


high  PE  low  PE 

low  KE  high  KE 

low  momentum  high  momentum 

long  wavelength  short  wavelength 

weak  curvature  strong  curvature 

k  /  An  electron  in  a  gentle 
electric  field  gradually  shortens 
its  wavelength  as  it  gains  energy. 
(As  discussed  on  p.  864,  it  is 
actually  not  quite  correct  to  graph 
the  wavefunction  of  an  electron 
as  a  real  number  unless  it  is  a 
standing  wave,  which  isn’t  the 
case  here.) 


I  /  The  wavefunction’s  tails 
go  where  classical  physics  says 
they  shouldn’t. 


becomes  a  mixture  of  a  person  who  has  just  killed  a  cat  and  a  person 
who  hasn’t.5 

Discussion  Questions 

A  Compare  A p  and  Ax  for  the  two  lowest  energy  levels  of  the  one¬ 
dimensional  particle  in  a  box,  and  discuss  how  this  relates  to  the  uncer¬ 
tainty  principle. 

B  On  a  graph  of  A p  versus  Ax,  sketch  the  regions  that  are  allowed  and 
forbidden  by  the  Heisenberg  uncertainty  principle.  Interpret  the  graph: 
Where  does  an  atom  lie  on  it?  An  elephant?  Can  either  p  or  x  be  mea¬ 
sured  with  perfect  accuracy  if  we  don’t  care  about  the  other? 

13.3.5  Electrons  in  electric  fields 

So  far  the  only  electron  wave  patterns  we’ve  considered  have 
been  simple  sine  waves,  but  whenever  an  electron  finds  itself  in  an 
electric  field,  it  must  have  a  more  complicated  wave  pattern.  Let’s 
consider  the  example  of  an  electron  being  accelerated  by  the  elec¬ 
tron  gun  at  the  back  of  a  TV  tube.  Newton’s  laws  are  not  useful, 
because  they  implicitly  assume  that  the  path  taken  by  the  particle  is 
a  meaningful  concept.  Conservation  of  energy  is  still  valid  in  quan¬ 
tum  physics,  however.  In  terms  of  energy,  the  electron  is  moving 
from  a  region  of  low  voltage  into  a  region  of  higher  voltage.  Since 
its  charge  is  negative,  it  loses  electrical  energy  by  moving  to  a  higher 
voltage,  so  its  kinetic  energy  increases.  As  its  electrical  energy  goes 
down,  its  kinetic  energy  goes  up  by  an  equal  amount,  keeping  the 
total  energy  constant.  Increasing  kinetic  energy  implies  a  growing 
momentum,  and  therefore  a  shortening  wavelength,  k. 

The  wavefunction  as  a  whole  does  not  have  a  single  well-defined 
wavelength,  but  the  wave  changes  so  gradually  that  if  you  only  look 
at  a  small  part  of  it  you  can  still  pick  out  a  wavelength  and  relate 
it  to  the  momentum  and  energy.  (The  picture  actually  exagger¬ 
ates  by  many  orders  of  magnitude  the  rate  at  which  the  wavelength 
changes.) 

But  what  if  the  electric  field  was  stronger?  The  electric  field  in 
an  old-fashioned  vacuum  tube  TV  screen  is  only  105  N/C,  but  the 
electric  field  within  an  atom  is  more  like  1012  N/C.  In  figure  m,  the 
wavelength  changes  so  rapidly  that  there  is  nothing  that  looks  like 
a  sine  wave  at  all.  We  could  get  a  rough  idea  of  the  wavelength  in  a 
given  region  by  measuring  the  distance  between  two  peaks,  but  that 
would  only  be  a  rough  approximation.  Suppose  we  want  to  know 
the  wavelength  at  point  P.  The  trick  is  to  construct  a  sine  wave,  like 
the  one  shown  with  the  dashed  line,  which  matches  the  curvature  of 
the  actual  wavefunction  as  closely  as  possible  near  P.  The  sine  wave 
that  matches  as  well  as  possible  is  called  the  “osculating”  curve,  from 
a  Latin  word  meaning  “to  kiss.”  The  wavelength  of  the  osculating 


sThis  interpretation,  known  as  the  many-worlds  interpretation,  was  developed 
by  Hugh  Everett  in  1957. 
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curve  is  the  wavelength  that  will  relate  correctly  to  conservation  of 
energy. 


m/A  typical  wavefunction  of  an  electron  in  an  atom  (heavy  curve) 
and  the  osculating  sine  wave  (dashed  curve)  that  matches  its  curvature 
at  point  P. 


Tunneling 

We  implicitly  assumed  that  the  particle-in-a-box  wavefunction 
would  cut  off  abruptly  at  the  sides  of  the  box,  1/1,  but  that  would  be 
unphysical.  A  kink  has  infinite  curvature,  and  curvature  is  related 
to  energy,  so  it  can’t  be  infinite.  A  physically  realistic  wavefunction 
must  always  “tail  off”  gradually,  1/2.  In  classical  physics,  a  particle 
can  never  enter  a  region  in  which  its  interaction  energy  U  would  be 
greater  than  the  amount  of  energy  it  has  available.  But  in  quantum 
physics  the  wavefunction  will  always  have  a  tail  that  reaches  into 
the  classically  forbidden  region.  If  it  was  not  for  this  effect,  called 
tunneling,  the  fusion  reactions  that  power  the  sun  would  not  occur 
due  to  the  high  electrical  energy  nuclei  need  in  order  to  get  close 
together!  Tunneling  is  discussed  in  more  detail  in  the  following 
subsection. 


13.3.6  The  Schrodinger  equation 

In  subsection  13.3.5  we  were  able  to  apply  conservation  of  energy 
to  an  electron’s  wavefunction,  but  only  by  using  the  clumsy  graph¬ 
ical  technique  of  osculating  sine  waves  as  a  measure  of  the  wave’s 
curvature.  You  have  learned  a  more  convenient  measure  of  curvature 
in  calculus:  the  second  derivative.  To  relate  the  two  approaches,  we 
take  the  second  derivative  of  a  sine  wave: 


A 

dx2 


sin(27rx/A) 


_d 

dx 


27T  27tx\ 

Tcos—  ) 


27TX 

A 


Taking  the  second  derivative  gives  us  back  the  same  function, 
but  with  a  minus  sign  and  a  constant  out  in  front  that  is  related 
to  the  wavelength.  We  can  thus  relate  the  second  derivative  to  the 
osculating  wavelength: 
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[1] 


$  I' 

dx2 


2vr 

T 


2 


This  could  be  solved  for  A  in  terms  of  'll,  but  it  will  turn  out 
below  to  be  more  convenient  to  leave  it  in  this  form. 

Applying  this  to  conservation  of  energy,  we  have 


[2] 


E  =  K  +  U 

„2 

+  u 


p 

2m 

Ml! 

2m 


+  U 


Note  that  both  equation  [1]  and  equation  [2]  have  A2  in  the  denom¬ 
inator.  We  can  simplify  our  algebra  by  multiplying  both  sides  of 
equation  [2]  by  T  to  make  it  look  more  like  equation  [1]: 


E-m  =  + 


2m 


h  \  2  /  27t 


=  —  (  —  )  (w1)  ^  +  U •’F 
2m  \2ir  )  \  A 


1 

2m 


h 

2n 


cPT 

dx2 


+  U-V 


Further  simplification  is  achieved  by  using  the  symbol  h  (h  with  a 
slash  through  it,  read  “h-bar”)  as  an  abbreviation  for  h/2ir.  We  then 
have  the  important  result  known  as  the  Schrodinger  equation: 


E-V  =  - 


h2  cPt 

2m  dx2 


+  U-V 


(Actually  this  is  a  simplified  version  of  the  Schrodinger  equation, 
applying  only  to  standing  waves  in  one  dimension.)  Physically  it 
is  a  statement  of  conservation  of  energy.  The  total  energy  E  must 
be  constant,  so  the  equation  tells  us  that  a  change  in  interaction 
energy  U  must  be  accompanied  by  a  change  in  the  curvature  of 
the  wavefunction.  This  change  in  curvature  relates  to  a  change 
in  wavelength,  which  corresponds  to  a  change  in  momentum  and 
kinetic  energy. 

self-check  G 

Considering  the  assumptions  that  were  made  in  deriving  the  Schrodinger 
equation,  would  it  be  correct  to  apply  it  to  a  photon?  To  an  electron  mov¬ 
ing  at  relativistic  speeds?  >  Answer,  p. 
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Usually  we  know  right  off  the  bat  how  U  depends  on  x,  so  the 
basic  mathematical  problem  of  quantum  physics  is  to  find  a  function 
’I'(x)  that  satisfies  the  Schrodinger  equation  for  a  given  interaction- 
energy  function  U(x).  An  equation,  such  as  the  Schrodinger  equa¬ 
tion,  that  specifies  a  relationship  between  a  function  and  its  deriva¬ 
tives  is  known  as  a  differential  equation. 

The  detailed  study  of  the  solution  of  the  Schrodinger  equation 
is  beyond  the  scope  of  this  book,  but  we  can  gain  some  important 
insights  by  considering  the  easiest  version  of  the  Schrodinger  equa¬ 
tion,  in  which  the  interaction  energy  U  is  constant.  We  can  then 
rearrange  the  Schrodinger  equation  as  follows: 

<#  T  _  2 m(U  -E) 
dx2  h2 


which  boils  down  to 


cP  T 


where,  according  to  our  assumptions,  a  is  independent  of  x.  We  need 
to  find  a  function  whose  second  derivative  is  the  same  as  the  original 
function  except  for  a  multiplicative  constant.  The  only  functions 
with  this  property  are  sine  waves  and  exponentials: 


cP 

dx2 


[  q  sin(rx  +  s )  ] 


_cf^ 

dx2 


[qerx+s\ 


— qr 2  sin(rx  +  s) 


qr2erx+s 


The  sine  wave  gives  negative  values  of  a,  a  =  —  r2,  and  the 
exponential  gives  positive  ones,  a  =  r2.  The  former  applies  to  the 
classically  allowed  region  with  U  <  E. 

This  leads  us  to  a  quantitative  calculation  of  the  tunneling  ef¬ 
fect  discussed  briefly  in  the  preceding  subsection.  The  wavefunction 
evidently  tails  off  exponentially  in  the  classically  forbidden  region. 
Suppose,  as  shown  in  figure  n,  a  wave-particle  traveling  to  the  right 
encounters  a  barrier  that  it  is  classically  forbidden  to  enter.  Al¬ 
though  the  form  of  the  Schrodinger  equation  we’re  using  technically 
does  not  apply  to  traveling  waves  (because  it  makes  no  reference 
to  time),  it  turns  out  that  we  can  still  use  it  to  make  a  reasonable 
calculation  of  the  probability  that  the  particle  will  make  it  through 
the  barrier.  If  we  let  the  barrier’s  width  be  w,  then  the  ratio  of  the 
wavefunction  on  the  left  side  of  the  barrier  to  the  wavefunction  on 
the  right  is 

nprx+s 

V _  _  p—rw 

qer(x+w)+s 

Probabilities  are  proportional  to  the  squares  of  wavefunctions,  so 


U 


x 


n/ Tunneling  through  a  bar¬ 
rier.  (As  discussed  on  p.  864,  it  is 
actually  not  quite  correct  to  graph 
the  wavefunction  of  an  electron 
as  a  real  number  unless  it  is  a 
standing  wave,  which  isn’t  the 
case  here.) 
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the  probability  of  making  it  through  the  barrier  is 


u 


elec 


r 


o/The  electrical,  nuclear, 
and  total  interaction  energies  for 
an  alpha  particle  escaping  from  a 
nucleus. 


p  _  g— 2rw 

=  exp  -  E) 


self-check  H 

If  we  were  to  apply  this  equation  to  find  the  probability  that  a  person  can 
walk  through  a  wall,  what  would  the  small  value  of  Planck’s  constant 
imply?  >  Answer,  p.  969 

Tunneling  in  alpha  decay  example  1 8 

Naively,  we  would  expect  alpha  decay  to  be  a  very  fast  process. 
The  typical  speeds  of  neutrons  and  protons  inside  a  nucleus  are 
extremely  high  (see  problem  20).  If  we  imagine  an  alpha  particle 
coalescing  out  of  neutrons  and  protons  inside  the  nucleus,  then 
at  the  typical  speeds  we’re  talking  about,  it  takes  a  ridiculously 
small  amount  of  time  for  them  to  reach  the  surface  and  try  to 
escape.  Clattering  back  and  forth  inside  the  nucleus,  we  could 
imagine  them  making  a  vast  number  of  these  “escape  attempts” 
every  second. 

Consider  figure  o,  however,  which  shows  the  interaction  energy 
for  an  alpha  particle  escaping  from  a  nucleus.  The  electrical  en¬ 
ergy  is  kq-\q2/r  when  the  alpha  is  outside  the  nucleus,  while  its 
variation  inside  the  nucleus  has  the  shape  of  a  parabola,  as  a 
consequence  of  the  shell  theorem.  The  nuclear  energy  is  con¬ 
stant  when  the  alpha  is  inside  the  nucleus,  because  the  forces 
from  all  the  neighboring  neutrons  and  protons  cancel  out;  it  rises 
sharply  near  the  surface,  and  flattens  out  to  zero  over  a  distance 
of  ~  1  fm,  which  is  the  maximum  distance  scale  at  which  the 
strong  force  can  operate.  There  is  a  classically  forbidden  region 
immediately  outside  the  nucleus,  so  the  alpha  particle  can  only 
escape  by  quantum  mechanical  tunneling.  (It’s  true,  but  some¬ 
what  counterintuitive,  that  a  repulsive  electrical  force  can  make  it 
more  difficult  for  the  alpha  to  get  out.) 

In  reality,  alpha-decay  half-lives  are  often  extremely  long  —  some¬ 
times  billions  of  years  —  because  the  tunneling  probability  is  so 
small.  Although  the  shape  of  the  barrier  is  not  a  rectangle,  the 
equation  for  the  tunneling  probability  on  page  878  can  still  be 
used  as  a  rough  guide  to  our  thinking.  Essentially  the  tunneling 
probability  is  so  small  because  U  -  E  is  fairly  big,  typically  about 
30  MeV  at  the  peak  of  the  barrier. 
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The  correspondence  principle  for  E  >  U  example  19 

The  correspondence  principle  demands  that  in  the  classical  limit 
h  ->•  0,  we  recover  the  correct  result  for  a  particle  encountering 
a  barrier  U,  for  both  E  <  U  and  E  >  U.  The  E  <  U  case 
was  analyzed  in  self-check  H  on  p.  878.  In  the  remainder  of  this 
example,  we  analyze  E  >  U,  which  turns  out  to  be  a  little  trickier. 

The  particle  has  enough  energy  to  get  over  the  barrier,  and  the 
classical  result  is  that  it  continues  forward  at  a  different  speed  (a 
reduced  speed  if  U  >  0,  or  an  increased  one  if  U  <  0),  then  re¬ 
gains  its  original  speed  as  it  emerges  from  the  other  side.  What 
happens  quantum-mechanically  in  this  case?  We  would  like  to 
get  a  “tunneling”  probability  of  1  in  the  classical  limit.  The  expres¬ 
sion  derived  on  p.  878,  however,  doesn’t  apply  here,  because  it 
was  derived  under  the  assumption  that  the  wavefunction  inside 
the  barrier  was  an  exponential;  in  the  classically  allowed  case, 
the  barrier  isn’t  classically  forbidden,  and  the  wavefunction  inside 
it  is  a  sine  wave. 


p/A  particle  encounters  a 
step  of  height  U  <  E  in  the 
interaction  energy.  Both  sides  are 
classically  allowed.  A  reflected 
wave  exists,  but  is  not  shown  in 
the  figure. 


We  can  simplify  things  a  little  by  letting  the  width  w  of  the  barrier 
go  to  infinity.  Classically,  after  all,  there  is  no  possibility  that  the 
particle  will  turn  around,  no  matter  how  wide  the  barrier.  We  then 
have  the  situation  shown  in  figure  p.6 

The  analysis  is  the  same  as  for  any  other  wave  being  partially 
reflected  at  the  boundary  between  two  regions  where  its  velocity 
differs,  and  the  result  is  the  same  as  the  one  found  on  p.  371 .  The 
ratio  of  the  amplitude  of  the  reflected  wave  to  that  of  the  incident 
wave  is  Ft  =  (v2  -  v1)/(v2  +  v-|).  The  probability  of  reflection  is  R2. 
(Counterintuitively,  R2  is  nonzero  even  if  U  <  0,  i.e.,  v2  >  v-\ .) 

This  seems  to  violate  the  correspondence  principle.  There  is  no 
m  or  h  anywhere  in  the  result,  so  we  seem  to  have  the  result  that, 
even  classically,  the  marble  in  figure  q  can  be  reflected! 

The  solution  to  this  paradox  is  that  the  step  in  figure  p  was  taken 
to  be  completely  abrupt  —  an  idealized  mathematical  discontinu¬ 
ity.  Suppose  we  make  the  transition  a  little  more  gradual,  as  in 
figure  r.  As  shown  in  problem  17  on  p.  384,  this  reduces  the  am¬ 
plitude  with  which  a  wave  is  reflected.  By  smoothing  out  the  step 
more  and  more,  we  continue  to  reduce  the  probability  of  reflec¬ 
tion,  until  finally  we  arrive  at  a  barrier  shaped  like  a  smooth  ramp. 
More  detailed  calculations  show  that  this  results  in  zero  reflection 
in  the  limit  where  the  width  of  the  ramp  is  large  compared  to  the 
wavelength. 

6As  in  several  previous  examples,  we  cheat  by  representing  a  traveling  wave 
as  a  real-valued  function.  See  p.  864. 


P=0 


q  /  The  marble  has  zero  proba¬ 
bility  of  being  reflected  from  the 
edge  of  the  table.  (This  example 
has  U  <  0,  not  U  >  0  as  in 
figures  p  and  r). 


U 


x 

r/ Making  the  step  more  grad¬ 
ual  reduces  the  probability  of 
reflection. 
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Three  dimensions 


s/1.  The  one-dimensional 
version  of  the  Laplacian  is  the 
second  derivative.  It  is  positive 
here  because  the  average  of  the 
two  nearby  points  is  greater  than 
the  value  at  the  center.  2.  The 
Laplacian  of  the  function  A  in 
example  20  is  positive  because 
the  average  of  the  four  nearby 
points  along  the  perpendicular 
axes  is  greater  than  the  function’s 
value  at  the  center.  3.  V2C  =  0. 
The  average  is  the  same  as  the 
value  at  the  center. 


For  simplicity,  we’ve  been  considering  the  Schrodinger  equation 
in  one  dimension,  so  that  'F  is  only  a  function  of  x ,  and  has  units  of 
nr-1/2  rather  than  m~3/2.  Since  the  Schrodinger  equation  is  a  state¬ 
ment  of  conservation  of  energy,  and  energy  is  a  scalar,  the  general¬ 
ization  to  three  dimensions  isn’t  particularly  complicated.  The  total 
energy  term  E  ■  *F  and  the  interaction  energy  term  U  ■  T  involve  noth¬ 
ing  but  scalars,  and  don’t  need  to  be  changed  at  all.  In  the  kinetic 
energy  term,  however,  we’re  essentially  basing  our  computation  of 
the  kinetic  energy  on  the  squared  magnitude  of  the  momentum,  p2, 
and  in  three  dimensions  this  would  clearly  have  to  be  generalized  to 
Px  d~Py  +p2z-  The  obvious  way  to  achieve  this  is  to  replace  the  second 
derivative  <#  T/  dx2  with  the  sum  d2^ /dx2  +  d2'S>/dy2  +  d2'f>/d z2. 
Here  the  partial  derivative  symbol  <9,  introduced  on  page  216,  indi¬ 
cates  that  when  differentiating  with  respect  to  a  particular  variable, 
the  other  variables  are  to  be  considered  as  constants.  This  operation 
on  the  function  T  is  notated  V2T,  and  the  derivative- like  operator 
V2  =  d2 /dx2  +  d2 /dy2  +  d2/dz2  is  called  the  Laplacian.  It  occurs 
elswehere  in  physics.  For  example,  in  classical  electrostatics,  the 
voltage  in  a  region  of  vacuum  must  be  a  solution  of  the  equation 
V2V  =  0.  Like  the  second  derivative,  the  Laplacian  is  essentially  a 
measure  of  curvature.  Or,  as  shown  in  figure  s,  we  can  think  of  it 
as  a  measure  of  how  much  the  value  of  a  function  at  a  certain  point 
differs  from  the  average  of  its  value  on  nearby  points. 

Examples  of  the  Laplacian  in  two  dimensions  example  20 

>  Compute  the  Laplacians  of  the  following  functions  in  two  dimen¬ 
sions,  and  interpret  them:  A  =  x2  +  y2,  B  =  -x2  -  y2,  C  =  x2  -  y2. 

>  The  first  derivative  of  function  A  with  respect  to  x  is  dA/dx  = 
2x.  Since  y  is  treated  as  a  constant  in  the  computation  of  the 
partial  derivative  d/dx,  the  second  term  goes  away.  The  second 
derivative  of  A  with  respect  to  x  is  d2A/dx2  =  2.  Similarly  we  have 
d2A/dy2  =  2,  so  V2A  =  4. 

All  derivative  operators,  including  V2,  have  the  linear  property 
that  multiplying  the  input  function  by  a  constant  just  multiplies  the 
output  function  by  the  same  constant.  Since  B  =  -A,  and  we 
have  V26  =  -4. 

For  function  C,  the  x  term  contributes  a  second  derivative  of  2, 
but  the  y  term  contributes  -2,  so  V2C  =  0. 

The  interpretation  of  the  positive  sign  in  V2A  =  4  is  that  A’s  graph 
is  shaped  like  a  trophy  cup,  and  the  cup  is  concave  up.  v2e  < 
0  is  because  B  is  concave  down.  Function  C  is  shaped  like  a 
saddle.  Since  its  curvature  along  one  axis  is  concave  up,  but  the 
curvature  along  the  other  is  down  and  equal  in  magnitude,  the 
function  is  considered  to  have  zero  concavity  over  all. 
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A  classically  allowed  region  with  constant  U  example  21 

In  a  classically  allowed  region  with  constant  U,  we  expect  the 
solutions  to  the  Schrodinger  equation  to  be  sine  waves.  A  sine 
wave  in  three  dimensions  has  the  form 


W  =  sin  (kxx  +  kyy  +  kzz ) . 

When  we  compute  d2W/dx2,  double  differentiation  of  sin  gives 
-sin,  and  the  chain  rule  brings  out  a  factor  of  k2.  Applying  all 
three  second  derivative  operators,  we  get 

V2T  =  (^-k2  -  k2  -  k2^j  sin  ( kxx  +  kyy  +  kzz ) 

=  -(/C?  +  /C* +  **)*. 


The  Schrodinger  equation  gives 


h2  o 

E-W  =  -  —  V2¥+  U  -V 

2  m 

=  -^--(k2  +  k2  +  k2)w+U-W 
2m 

h2 


E  —  U  =  - —  (  ky  +  kf,  +  k‘ 
2m 


which  can  be  satisfied  since  we’re  in  a  classically  allowed  region 
with  E  -  U  >  0,  and  the  right-hand  side  is  manifestly  positive. 


Use  of  complex  numbers 

In  a  classically  forbidden  region,  a  particle’s  total  energy,  U  +  K, 
is  less  than  its  U,  so  its  K  must  be  negative.  If  we  want  to  keep  be¬ 
lieving  in  the  equation  K  =  p2 /2m,  then  apparently  the  momentum 
of  the  particle  is  the  square  root  of  a  negative  number.  This  is  a 
symptom  of  the  fact  that  the  Schrodinger  equation  fails  to  describe 
all  of  nature  unless  the  wavefunction  and  various  other  quantities 
are  allowed  to  be  complex  numbers.  In  particular  it  is  not  possible 
to  describe  traveling  waves  correctly  without  using  complex  wave- 
functions.  Complex  numbers  were  reviewed  in  subsection  10.5.5, 

p.  608. 

This  may  seem  like  nonsense,  since  real  numbers  are  the  only 
ones  that  are,  well,  real!  Quantum  mechanics  can  always  be  re¬ 
lated  to  the  real  world,  however,  because  its  structure  is  such  that 
the  results  of  measurements  always  come  out  to  be  real  numbers. 
For  example,  we  may  describe  an  electron  as  having  non-real  mo¬ 
mentum  in  classically  forbidden  regions,  but  its  average  momentum 
will  always  come  out  to  be  real  (the  imaginary  parts  average  out  to 
zero),  and  it  can  never  transfer  a  non- real  quantity  of  momentum 
to  another  particle. 


Section  1 3.3  Matter  As  a  Wave 


881 


t/1.  Oscillations  can  go  back 
and  forth,  but  it’s  also  possible 
for  them  to  move  along  a  path 
that  bites  its  own  tail,  like  a  cir¬ 
cle.  Photons  act  like  one,  elec¬ 
trons  like  the  other. 

2.  Back-and-forth  oscillations  can 
naturally  be  described  by  a  seg¬ 
ment  taken  from  the  real  num¬ 
ber  line,  and  we  visualize  the  cor¬ 
responding  type  of  wave  as  a 
sine  wave.  Oscillations  around  a 
closed  path  relate  more  naturally 
to  the  complex  number  system. 
The  complex  number  system  has 
rotation  built  into  its  structure, 
e.g.,  the  sequence  1,  /,  /'2,  /'3, 
. . .  rotates  around  the  unit  circle  in 
90-degree  increments. 

3.  The  double  slit  experiment  em¬ 
bodies  the  one  and  only  mystery 
of  quantum  physics.  Either  type 
of  wave  can  undergo  double-slit 
interference. 


CO 

1  1 

1 

1 

1  1 

A  complete  investigation  of  these  issues  is  beyond  the  scope  of 
this  book,  and  this  is  why  we  have  normally  limited  ourselves  to 
standing  waves,  which  can  be  described  with  real-valued  wavefunc- 
tions.  Figure  t  gives  a  visual  depiction  of  the  difference  between 
real  and  complex  wavefunctions.  The  following  remarks  may  also 
be  helpful. 

Neither  of  the  graphs  in  t/2  should  be  interpreted  as  a  path 
traveled  by  something.  This  isn’t  anything  mystical  about  quantum 
physics.  It’s  just  an  ordinary  fact  about  waves,  which  we  first  en¬ 
countered  in  subsection  6.1.1,  p.  344,  where  we  saw  the  distinction 
between  the  motion  of  a  wave  and  the  motion  of  a  wave  pattern.  In 
both  examples  in  t/2,  the  wave  pattern  is  moving  in  a  straight  line 
to  the  right. 

The  helical  graph  in  t/2  shows  a  complex  wavefunction  whose 
value  rotates  around  a  circle  in  the  complex  plane  with  a  frequency  / 
related  to  its  energy  by  E  =  hf.  As  it  does  so,  its  squared  magnitude 
|T|2  stays  the  same,  so  the  corresponding  probability  stays  constant. 
Which  direction  does  it  rotate?  This  direction  is  purely  a  matter 
of  convention,  since  the  distinction  between  the  symbols  i  and  —i  is 
arbitrary  —  both  are  equally  valid  as  square  roots  of  —1.  We  can, 
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for  example,  arbitrarily  say  that  electrons  with  positive  energies 
have  wavefunctions  whose  phases  rotate  counterclockwise,  and  as 
long  as  we  follow  that  rule  consistently  within  a  given  calculation, 
everything  will  work.  Note  that  it  is  not  possible  to  define  anything 
like  a  right-hand  rule  here,  because  the  complex  plane  shown  in  the 
right-hand  side  of  t/2  doesn’t  represent  two  dimensions  of  physical 
space;  unlike  a  screw  going  into  a  piece  of  wood,  an  electron  doesn’t 
have  a  direction  of  rotation  that  depends  on  its  direction  of  travel. 

Superposition  of  complex  wavefunctions  example  22 

>  The  right  side  of  figure  t/3  is  a  cartoonish  representation  of 
double-slit  interference;  it  depicts  the  situation  at  the  center,  where 
symmetry  guarantees  that  the  interference  is  constuctive.  Sup¬ 
pose  that  at  some  off-center  point,  the  two  wavefunctions  being 
superposed  are  W-\  =  b  and  ¥2  =  bi,  where  b  is  a  real  number 
with  units.  Compare  the  probability  of  finding  the  electron  at  this 
position  with  what  it  would  have  been  if  the  superposition  had 
been  purely  constructive,  b  +  b  =  2b. 

>  The  probability  per  unit  volume  is  proportional  to  the  square  of 
the  magnitude  of  the  total  wavefunction,  so  we  have 

^off center  _  \b+bi\2  _  12+12  _  1 
P center  '  \b  +  b\*  ~  2*  +  02  '  2' 

Figure  u  shows  a  method  for  visualizing  complex  wavefunctions. 
The  idea  is  to  use  colors  to  represent  complex  numbers,  accord¬ 
ing  to  the  arbitrary  convention  defined  in  figure  u/1.  Brightness 
indicates  magnitude,  and  the  rainbow  hue  shows  the  argument.  Be¬ 
cause  this  representation  can’t  be  understood  in  a  black  and  white 
printed  book,  the  figure  is  also  reproduced  on  the  back  cover  of 
printed  copies.  To  avoid  any  confusion,  note  that  the  use  of  rain¬ 
bow  colors  does  not  mean  that  we  are  representing  actual  visible 
light.  In  fact,  we  will  be  using  these  visual  conventions  to  represent 
the  wavefunctions  of  a  material  particle  such  as  an  electron.  It  is  ar¬ 
bitrary  that  we  use  red  for  positive  real  numbers  and  blue-green  for 
negative  numbers,  and  that  we  pick  a  handedness  for  the  diagram 
such  that  going  from  red  toward  yellow  means  going  counterclock¬ 
wise.  Although  physically  the  rainbow  is  a  linear  spectrum,  we  are 
not  representing  physical  colors  here,  and  we  are  exploiting  the  fact 
that  the  human  brain  tends  to  perceive  color  as  a  circle  rather  than 
a  line,  with  violet  and  red  being  perceptually  similar.  One  of  the 
limitations  of  this  representation  is  that  brightness  is  limited,  so  we 
can’t  represent  complex  numbers  with  arbitrarily  large  magnitudes. 


Figure  u/2  shows  a  traveling  wave  as  it  propagates  to  the  right. 
The  standard  convention  in  physics  is  that  for  a  wave  moving  in  a 
certain  direction,  the  phase  in  the  forward  direction  is  farther  coun¬ 
terclockwise  in  the  complex  plane,  and  you  can  verify  for  yourself 
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u  / 1 .  A  representation  of  complex 
numbers  using  color  and  bright¬ 
ness.  2.  A  wave  traveling  toward 
the  right.  3.  A  wave  traveling  to¬ 
ward  the  left.  4.  A  standing  wave 
formed  by  superposition  of  waves 
2  and  3.  5.  A  two-dimensional 
standing  wave.  6.  A  double-slit 
diffraction  pattern. 


that  this  is  the  case  by  comparing  with  the  convention  defined  by 
u/1.  The  function  being  plotted  here  is  T  =  elkx ,  where  k  =  2ir/\ 
is  the  spatial  analog  of  frequency,  with  an  extra  factor  of  2-7r  for 
convenience.  For  the  use  of  the  complex  exponential,  see  sec.  10.5.6, 
p  .610;  it  simply  represents  a  point  on  the  unit  circle  in  the  complex 
plane.  The  wavelength  A  is  a  constant  and  can  be  measured,  for 
example,  from  one  yellow  point  to  the  next.  The  wavelength  is  not 
different  at  different  points  on  the  figure,  because  we  are  using  the 
colors  merely  as  a  visual  encoding  of  the  complex  numbers  —  so, 
for  example,  a  red  point  on  the  figure  is  not  a  point  where  the  wave 
has  a  longer  wavelength  than  it  does  at  a  blue  point. 

Figure  u/3  represents  a  wave  travaling  to  the  left. 

Figure  u/4  shows  a  standing  wave  created  by  superimposing  the 
traveling  waves  from  u/2  and  u/3,  T4  =  (T2  +  ^3) /2.  (The  rea¬ 
son  for  the  factor  of  2  is  simply  that  otherwise  some  portions  of  ’F4 
would  have  magnitudes  too  great  to  be  represented  using  the  avail¬ 
able  range  of  brightness.)  All  points  on  this  wave  have  real  values, 
represented  by  red  and  blue-green.  We  made  the  superposition  real 
by  an  appropriate  choice  of  the  phases  of  ^2  and  T3.  This  is  always 
possible  to  do  when  we  have  a  standing  wave,  but  it  is  only  possible 
for  a  standing  wave,  and  this  is  the  reason  for  all  of  the  disclaimers 
in  the  captions  of  previous  figures  in  which  I  took  the  liberty  of 
representing  a  traveling  wave  as  a  sine- wave  graph. 

Figure  u/5  shows  a  two-dimensional  standing  wave  of  a  particle 
in  a  box,  and  u/6  shows  a  double-slit  interference  pattern.  (In  the 
latter,  I’ve  cheated  by  making  the  amplitude  of  the  wave  on  the 
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right-hand  half  of  the  picture  much  greater  than  it  would  actually 
be.) 

A  paradox  resolved  example  23 

Consider  the  following  paradox.  Suppose  we  have  an  electron 
that  is  traveling  wave,  and  its  wavefunction  looks  like  a  wave-train 
consisting  of  5  cycles  of  a  sine  wave.  Call  the  distance  between 
the  leading  and  trailing  edges  of  the  wave-train  L,  so  that  A  = 
L/5.  By  sketching  the  wave,  you  can  easily  check  that  there  are 
1 1  points  where  its  value  equals  zero.  Therefore  at  a  particular 
moment  in  time,  there  are  1 1  points  where  a  detector  has  zero 
probability  of  detecting  the  electron. 

But  now  consider  how  this  would  look  in  a  frame  of  reference 
where  the  electron  is  moving  more  slowly,  at  one  fifth  of  the  speed 
we  saw  in  the  original  frame.  In  this  frame,  L  is  the  same,  but  A 
is  five  times  greater,  because  A  =  h/p.  Therefore  in  this  frame 
we  see  only  one  cycle  in  the  wave-train.  Now  there  are  only  3 
points  where  the  probability  of  detection  is  zero.  But  how  can  this 
be?  All  observers,  regardless  of  their  frames  of  reference,  should 
agree  on  whether  a  particular  detector  detects  the  electron. 

The  resolution  to  this  paradox  is  that  it  starts  from  the  assumption 
that  we  can  depict  a  traveling  wave  as  a  real-valued  sine  wave, 
which  is  zero  in  certain  places.  Actually,  we  can’t.  It  has  to  be 
a  complex  number  with  a  rotating  phase  angle  in  the  complex 
plane,  as  in  figure  u/2,  and  a  constant  magnitude. 

Linearity  of  the  Schrodinger  equation 

Some  mathematical  relationships  and  operations  are  linear,  and 
some  are  not.  For  example,  2  x  (3  +  2)  is  the  same  as  2  x  3+2  x  2,  but 
\/l  +  1  /  y/l  +  y/l.  Differentiation  is  a  linear  operation,  (/  +  g )'  = 
/'  +  </.  The  Schrodinger  equation  is  built  out  of  derivatives,  so 
it  is  linear  as  well.  That  is,  if  Ti  and  \&2  are  both  solutions  of  the 
Schrodinger  equation,  then  so  is  \hi  +  'I,2.  Linearity  normally  implies 
linearity  with  respect  both  to  addition  and  to  multiplication  by  a 
scalar.  For  example,  if  T  is  a  solution,  then  so  is  T  +  T  +  T,  which 
is  the  same  as  34/. 

Linearity  guarantees  that  the  phase  of  a  wavefunction  makes  no 
difference  as  to  its  validity  as  a  solution  to  the  Schrodinger  equa¬ 
tion.  If  sin  kx  is  a  solution,  then  so  is  the  sine  wave  —  sin  kx  with 
the  opposite  phase.  This  fact  is  logically  interdependent  with  the 
fact  that,  as  discussed  on  p.  864,  the  phase  of  a  wavefunction  is  un¬ 
observable.  For  measuring  devices  and  humans  are  material  objects 
that  can  be  described  by  wavefunctions.  So  suppose,  for  example, 
that  we  flip  the  phase  of  all  the  particles  inside  the  entire  labora¬ 
tory.  By  linearity,  the  evolution  of  this  measurement  process  is  still 
a  valid  solution  of  the  Schrodinger  equation. 

The  Schrodinger  equation  is  a  wave  equation,  and  its  linearity 
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implies  that  the  waves  obey  the  principle  of  superposition.  In  most 
cases  in  nature,  we  find  that  the  principle  of  superposition  for  waves 
is  at  best  an  approximation.  For  example,  if  the  amplitude  of  a 
tsunami  is  so  huge  that  the  trough  of  the  wave  reaches  all  the  way 
down  to  the  ocean  floor,  exposing  the  rocks  and  sand  as  it  passes 
overhead,  then  clearly  there  is  no  way  to  double  the  amplitude  of  the 
wave  and  still  get  something  that  obeys  the  laws  of  physics.  Even 
at  less  extreme  amplitudes,  superposition  is  only  an  approximation 
for  water  waves,  and  so  for  example  it  is  only  approximately  true 
that  when  two  sets  of  ripples  intersect  on  the  surface  of  a  pond,  they 
pass  through  without  “seeing”  each  other. 

It  is  therefore  natural  to  ask  whether  the  apparent  linearity  of 
the  Schrodinger  equation  is  only  an  approximation  to  some  more 
precise,  nonlinear  theory.  This  is  not  currently  believed  to  be  the 
case.  If  we  are  to  make  sense  of  Schrodinger’s  cat  (p.  872),  then 
the  experimenter  who  sees  a  live  cat  and  the  one  who  sees  a  dead 
cat  must  remain  oblivious  to  their  other  selves,  like  the  ripples  on 
the  pond  that  intersect  without  “seeing”  each  other.  Attempts  to 
create  slightly  nonlinear  versions  of  standard  quantum  mechanics 
have  been  shown  to  have  implausible  physical  properties,  such  as 
allowing  the  propagation  of  signals  faster  than  c. 

If  you  have  had  a  course  in  linear  algebra,  then  it  is  worth  noting 
that  the  linearity  of  the  Schrodinger  equation  allows  us  to  talk  about 
its  solutions  as  vectors  in  a  vector  space.  For  example,  if  Ti  repre¬ 
sents  an  unstable  nucleus  that  has  not  yet  gamma  decayed,  and  T2 
is  its  state  after  the  decay,  then  any  superposition  aTi  + /?' F2,  with 
real  or  complex  coefficients  a  and  /?,  is  a  possible  wavefunction,  and 
we  can  notate  this  as  a  vector,  (a,/3),  in  a  two-dimensional  vector 
space. 

Discussion  Questions 

A  The  zero  level  of  interaction  energy  U  is  arbitrary,  e.g.,  it’s  equally 
valid  to  pick  the  zero  of  gravitational  energy  to  be  on  the  floor  of  your  lab 
or  at  the  ceiling.  Suppose  we’re  doing  the  double-slit  experiment,  t/3,  with 
electrons.  We  define  the  zero-level  of  U  so  that  the  total  energy  E  =  U+K 
of  each  electron  is  positive,  and  we  observe  a  certain  interference  pattern 
like  the  one  in  figure  i  on  p.  852.  What  happens  if  we  then  redefine  the 
zero-level  of  U  so  that  the  electrons  have  E  <  0? 

B  The  figure  shows  a  series  of  snapshots  in  the  motion  of  two  pulses 
on  a  coil  spring,  one  negative  and  one  positive,  as  they  move  toward  one 
another  and  superpose.  The  final  image  is  very  close  to  the  moment  at 
which  the  two  pulses  cancel  completely.  The  following  discussion  is  sim¬ 
pler  if  we  consider  infinite  sine  waves  rather  than  pulses.  How  can  the 
cancellation  of  two  such  mechanical  waves  be  reconciled  with  conserva¬ 
tion  of  energy?  What  about  the  case  of  colliding  electromagnetic  waves? 

Quantum-mechanically,  the  issue  isn’t  conservation  of  energy,  it’s  conser¬ 
vation  of  probability,  i.e.,  if  there’s  initially  a  1 00%  probability  that  a  particle 
exists  somewhere,  we  don’t  want  the  probability  to  be  more  than  or  less 
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than  100%  at  some  later  time.  What  happens  when  the  colliding  waves 
have  real-valued  wavefunctions  4'?  Complex  ones?  What  happens  with 
standing  waves? 

C  The  figure  shows  a  skateboarder  tipping  over  into  a  swimming 
pool  with  zero  initial  kinetic  energy.  There  is  no  friction,  the  corners  are 
smooth  enough  to  allow  the  skater  to  pass  over  the  smoothly,  and  the 
vertical  distances  are  small  enough  so  that  negligible  time  is  required  for 
the  vertical  parts  of  the  motion.  The  pool  is  divided  into  a  deep  end  and  a 
shallow  end.  Their  widths  are  equal.  The  deep  end  is  four  times  deeper. 
(1)  Classically,  compare  the  skater’s  velocity  in  the  left  and  right  regions, 
and  infer  the  probability  of  finding  the  skater  in  either  of  the  two  halves  if 
an  observer  peeks  at  a  random  moment.  (2)  Quantum-mechanically,  this 
could  be  a  one-dimensional  model  of  an  electron  shared  between  two 
atoms  in  a  diatomic  molecule.  Compare  the  electron’s  kinetic  energies, 
momenta,  and  wavelengths  in  the  two  sides.  For  simplicity,  let’s  assume 
that  there  is  no  tunneling  into  the  classically  forbidden  regions.  What  is 
the  simplest  standing-wave  pattern  that  you  can  draw,  and  what  are  the 
probabilities  of  finding  the  electron  in  one  side  or  the  other?  Does  this 
obey  the  correspondence  principle? 
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13.4  The  Atom 

You  can  learn  a  lot  by  taking  a  car  engine  apart,  but  you  will  have 
learned  a  lot  more  if  you  can  put  it  all  back  together  again  and  make 
it  run.  Half  the  job  of  reductionism  is  to  break  nature  down  into 
its  smallest  parts  and  understand  the  rules  those  parts  obey.  The 
second  half  is  to  show  how  those  parts  go  together,  and  that  is  our 
goal  in  this  chapter.  We  have  seen  how  certain  features  of  all  atoms 
can  be  explained  on  a  generic  basis  in  terms  of  the  properties  of 
bound  states,  but  this  kind  of  argument  clearly  cannot  tell  us  any 
details  of  the  behavior  of  an  atom  or  explain  why  one  atom  acts 
differently  from  another. 

The  biggest  embarrassment  for  reductionists  is  that  the  job  of 
putting  things  back  together  job  is  usually  much  harder  than  the 
taking  them  apart.  Seventy  years  after  the  fundamentals  of  atomic 
physics  were  solved,  it  is  only  beginning  to  be  possible  to  calcu¬ 
late  accurately  the  properties  of  atoms  that  have  many  electrons. 
Systems  consisting  of  many  atoms  are  even  harder.  Supercomputer 
manufacturers  point  to  the  folding  of  large  protein  molecules  as  a 
process  whose  calculation  is  just  barely  feasible  with  their  fastest 
machines.  The  goal  of  this  chapter  is  to  give  a  gentle  and  visually 
oriented  guide  to  some  of  the  simpler  results  about  atoms. 

13.4.1  Classifying  states 

We’ll  focus  our  attention  first  on  the  simplest  atom,  hydrogen, 
with  one  proton  and  one  electron.  We  know  in  advance  a  little  of 
what  we  should  expect  for  the  structure  of  this  atom.  Since  the 
electron  is  bound  to  the  proton  by  electrical  forces,  it  should  dis¬ 
play  a  set  of  discrete  energy  states,  each  corresponding  to  a  certain 
standing  wave  pattern.  We  need  to  understand  what  states  there 
are  and  what  their  properties  are. 
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What  properties  should  we  use  to  classify  the  states?  The  most 
sensible  approach  is  to  used  conserved  quantities.  Energy  is  one 
conserved  quantity,  and  we  already  know  to  expect  each  state  to 
have  a  specific  energy.  It  turns  out,  however,  that  energy  alone  is 
not  sufficient.  Different  standing  wave  patterns  of  the  atom  can 
have  the  same  energy. 

Momentum  is  also  a  conserved  quantity,  but  it  is  not  particularly 
appropriate  for  classifying  the  states  of  the  electron  in  a  hydrogen 
atom.  The  reason  is  that  the  force  between  the  electron  and  the  pro¬ 
ton  results  in  the  continual  exchange  of  momentum  between  them. 
(Why  wasn’t  this  a  problem  for  energy  as  well?  Kinetic  energy  and 
momentum  are  related  by  K  =  p2 /2m,  so  the  much  more  massive 
proton  never  has  very  much  kinetic  energy.  We  are  making  an  ap¬ 
proximation  by  assuming  all  the  kinetic  energy  is  in  the  electron, 
but  it  is  quite  a  good  approximation.) 

Angular  momentum  does  help  with  classification.  There  is  no 
transfer  of  angular  momentum  between  the  proton  and  the  electron, 
since  the  force  between  them  is  a  center-to-center  force,  producing 
no  torque. 

Like  energy,  angular  momentum  is  quantized  in  quantum  physics. 
As  an  example,  consider  a  quantum  wave-particle  confined  to  a  cir¬ 
cle,  like  a  wave  in  a  circular  moat  surrounding  a  castle.  A  sine 
wave  in  such  a  “quantum  moat”  cannot  have  any  old  wavelength, 
because  an  integer  number  of  wavelengths  must  fit  around  the  cir¬ 
cumference,  C,  of  the  moat.  The  larger  this  integer  is,  the  shorter 
the  wavelength,  and  a  shorter  wavelength  relates  to  greater  momen¬ 
tum  and  angular  momentum.  Since  this  integer  is  related  to  angular 
momentum,  we  use  the  symbol  i  for  it: 

A  =  C/t 


The  angular  momentum  is 


L  =  rp. 


Here,  r  =  C / 2tt,  and  p  =  h/\  =  hi/C,  so 


l-£. 

2t r 

=  —i 
2ir 


hi 

~C 


In  the  example  of  the  quantum  moat,  angular  momentum  is  quan¬ 
tized  in  units  of  h/2-K.  This  makes  h/2ir  a  pretty  important  number, 
so  we  define  the  abbreviation  h  =  h/2ir.  This  symbol  is  read  “h- 
bar.” 

In  fact,  this  is  a  completely  general  fact  in  quantum  physics,  not 
just  a  fact  about  the  quantum  moat: 


a/1.  Eight  wavelengths  fit 
around  this  circle  {i  =  8).  This  is 
a  standing  wave.  2.  A  traveling 
wave  with  t  =  8,  depicted  ac¬ 
cording  to  the  color  conventions 
defined  in  figure  u,  p.  884. 
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Quantization  of  angular  momentum 

The  angular  momentum  of  a  particle  due  to  its  motion  through 
space  is  quantized  in  units  of  h. 


self-check  I 


What  is  the  angular  momentum  of  the  wavefunction  shown  at  the  be¬ 
ginning  of  the  section?  o  Answer,  p. 

969 
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b  /  Reconciling  the  uncertainty 
principle  with  the  definition  of 
angular  momentum. 


13.4.2  Three  dimensions 

Our  discussion  of  quantum-mechanical  angular  momentum  has 
so  far  been  limited  to  rotation  in  a  plane,  for  which  we  can  sim¬ 
ply  use  positive  and  negative  signs  to  indicate  clockwise  and  coun¬ 
terclockwise  directions  of  rotation.  A  hydrogen  atom,  however,  is 
unavoidably  three-dimensional.  The  classical  treatment  of  angular 
momentum  in  three-dimensions  has  been  presented  in  section  4.3;  in 
general,  the  angular  momentum  of  a  particle  is  defined  as  the  vector 
cross  product  r  x  p. 

There  is  a  basic  problem  here:  the  angular  momentum  of  the 
electron  in  a  hydrogen  atom  depends  on  both  its  distance  r  from 
the  proton  and  its  momentum  p,  so  in  order  to  know  its  angular 
momentum  precisely  it  would  seem  we  would  need  to  know  both 
its  position  and  its  momentum  simultaneously  with  good  accuracy. 
This,  however,  seems  forbidden  by  the  Heisenberg  uncertainty  prin¬ 
ciple. 

Actually  the  uncertainty  principle  does  place  limits  on  what  can 
be  known  about  a  particle’s  angular  momentum  vector,  but  it  does 
not  prevent  us  from  knowing  its  magnitude  as  an  exact  integer  mul¬ 
tiple  of  h.  The  reason  is  that  in  three  dimensions,  there  are  really 
three  separate  uncertainty  principles: 

ApxAx  >  h 
ApyAy  >  h 
ApzAz  >  h 

Now  consider  a  particle,  b/1,  that  is  moving  along  the  x  axis  at 
position  x  and  with  momentum  px.  We  may  not  be  able  to  know 
both  x  and  px  with  unlimited  accurately,  but  we  can  still  know  the 
particle’s  angular  momentum  about  the  origin  exactly:  it  is  zero, 
because  the  particle  is  moving  directly  away  from  the  origin. 

Suppose,  on  the  other  hand,  a  particle  finds  itself,  b/2,  at  a 
position  x  along  the  x  axis,  and  it  is  moving  parallel  to  the  y  axis 
with  momentum  py.  It  has  angular  momentum  xpy  about  the  z 
axis,  and  again  we  can  know  its  angular  momentum  with  unlimited 
accuracy,  because  the  uncertainty  principle  only  relates  x  to  px  and 
y  to  py.  It  does  not  relate  x  to  py. 


890 


Chapter  13  Quantum  Physics 


As  shown  by  these  examples,  the  uncertainty  principle  does 
not  restrict  the  accuracy  of  our  knowledge  of  angular  momenta  as 
severely  as  might  be  imagined.  However,  it  does  prevent  us  from 
knowing  all  three  components  of  an  angular  momentum  vector  si¬ 
multaneously.  The  most  general  statement  about  this  is  the  follow¬ 
ing  theorem: 


The  angular  momentum  vector  in  quantum  physics 

The  most  that  can  be  known  about  a  (nonzero)  orbital  angular 
momentum  vector  is  its  magnitude  and  one  of  its  three  vector  com¬ 
ponents.  Both  are  quantized  in  units  of  h. 


To  see  why  this  is  true,  consider  the  example  wavefunction  shown 
in  figure  c.  This  is  the  like  the  quantum  moat  of  figure  a,  p.  889, 
but  extended  to  one  more  dimension.  If  we  slice  the  sphere  in  any 
plane  perpendicular  to  the  z  axis,  we  get  an  8-cycle  circular  rainbow 
exactly  like  figure  a.  This  is  required  because  Lz  =  8 h.  But  if  we 
take  a  slice  perpendicular  to  some  other  axis,  such  as  the  y  axis,  we 
don’t  get  a  circular  rainbow  as  we  would  for  a  state  with  a  definite 
value  of  Ly.  It  is  obviously  not  possible  to  get  circular  rainbows  for 
slices  perpendicular  to  more  than  one  axis. 

For  those  with  a  taste  for  rigor,  here  is  a  complete  argument: 

Theorem:  On  the  sphere,  if  a  wavefunction  has  definite  values  of 
both  Lz  and  Lx,  then  it  is  a  wavefunction  that  is  constant  every¬ 
where,  so  L  =  0. 

Lemma  1:  If  component  of  l a  along  a  certain  axis  A  has  a  definite 
value  and  is  nonzero,  then  (a)  T  =  0  at  the  poles,  and  (b)  T  is  of 
the  form  Ae on  any  circle  in  a  plane  perpendicular  to  the  axis. 
Part  a  holds  because  L  =  0  if  r±  =  0.  For  b,  see  p.  889. 

Lemma  2:  If  the  component  of  L  along  a  certain  axis  has  a  definite 
value  and  is  zero,  then  T  is  constant  in  any  plane  perpendicular  to 
that  axis.  This  follows  from  lemma  1  in  the  case  where  Ia  =  0. 

Case  I:  tz  and  lx  are  both  nonzero.  We  have  T  =  0  at  the  poles 
along  both  the  x  axis  and  the  z  axis.  The  z-axis  pole  is  a  point 
on  the  great  circle  perpendicular  to  the  x  axis,  and  vice  versa,  so 
applying  lb,  A  =  0  and  'L  vanishes  on  both  of  these  great  circles. 
But  now  if  we  apply  lb  along  any  slice  perpendicular  to  either  axis, 
we  get  T  =  0  everywhere  on  that  slice,  so  *P  =  0  everywhere. 

Case  II:  iz  and  tx  are  both  zero.  By  lemma  2,  f  is  a  constant 
everywhere. 

Case  III:  One  component  is  zero  and  the  other  nonzero.  Let  lz  be 
the  one  that  is  zero.  By  la,  T  =  0  at  the  x-axis  pole,  so  by  2,  T  =  0 
on  the  great  circle  perpendicular  to  z.  But  then  lb  tells  us  that 
T  =  0  everywhere. 


c  /  A  wavefunction  on  the  sphere 
with  |L|  =  1 1fi  and  Lz  =  8 h,  shown 
using  the  color  conventions 
defined  in  figure  u,  p.  884. 
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d  /  Example  24. 


Completeness  example  24 

For  a  given  value  of  l,  consider  the  set  of  angular  momentum 
states  with  |L|  =M  and  the  various  possible  values  of  the  angular 
momentum’s  component  along  some  fixed  axis.  This  set  of  states 
is  complete,  meaning  that  they  encompass  all  the  possible  states 
with  this  l. 

For  example,  figure  d  shows  wavefunctions  with  t  =  1  that  are 
solutions  of  the  Schrodinger  equation  for  a  particle  that  is  con¬ 
fined  to  the  surface  of  a  sphere.  Although  the  formulae  for  these 
wavefunctions  are  not  particularly  complicated,7  they  are  not  our 
main  focus  here,  so  to  help  with  getting  a  feel  for  the  idea  of  com¬ 
pleteness,  I  have  simply  selected  three  points  on  the  sphere  at 
which  to  give  numerical  samples  of  the  value  of  the  wavefunction. 
These  are  the  top  (where  the  sphere  is  intersected  by  the  pos¬ 
itive  z  axis),  left  (x),  and  front  (y).  (Although  the  wavefunctions 
are  shown  using  the  color  conventions  defined  in  figure  u,  p.  884, 
these  numerical  samples  should  make  the  example  understand¬ 
able  if  you’re  looking  at  a  black  and  white  copy  of  the  book.) 

Suppose  we  arbitrarily  choose  the  z  axis  as  the  one  along  which 
to  quantize  the  component  of  the  angular  momentum.  With  this 
choice,  we  have  three  possible  values  for  tz\  -1 , 0,  and  1 .  These 
three  states  are  shown  in  the  three  boxes  surrounded  by  the  black 
rectangle.  This  set  of  three  states  is  complete. 

Consider,  for  example,  the  fourth  state,  shown  on  the  right  outside 
the  box.  This  state  is  clearly  identifiable  as  a  copy  of  the  4  =  0 
state,  rotated  by  90  degrees  counterclockwise,  so  it  is  the  4  = 
0  state.  We  might  imagine  that  this  would  be  an  entirely  new 
prize  to  be  added  to  our  stamp  collection.  But  it  is  actually  not 
a  state  that  we  didn’t  possess  before.  We  can  obtain  it  as  the 
sum  of  the  4  =  -1  and  4  =  1  states,  divided  by  an  appropriate 
normalization  factor.  Although  I’m  avoiding  making  this  example 
an  exercise  in  manipulating  formulae,  it  is  easy  to  check  that  the 
sum  does  work  out  properly  at  the  three  sample  points. 


7They  are  ¥i ,_i  =  sinGe-'*,  Ti0  =  v^COSG,  and  Vn  =  sinGe'^,  wher  G  is 
the  angle  measured  down  from  the  Z  axis,  and  4>  is  the  angle  running  counter¬ 
clockwise  around  the  Z  axis.  These  functions  are  called  spherical  harmonics. 
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e  /  A  cross-section  of  a  hydrogen  wavefunction. 


13.4.3  The  hydrogen  atom 

Deriving  the  wavefunctions  of  the  states  of  the  hydrogen  atom 
from  first  principles  would  be  mathematically  too  complex  for  this 
book,  but  it’s  not  hard  to  understand  the  logic  behind  such  a  wave- 
function  in  visual  terms.  Consider  the  wavefunction  from  the  begin¬ 
ning  of  the  section,  which  is  reproduced  in  figure  e.  Although  the 
graph  looks  three-dimensional,  it  is  really  only  a  representation  of 
the  part  of  the  wavefunction  lying  within  a  two-dimensional  plane. 
The  third  (up-down)  dimension  of  the  plot  represents  the  value  of 
the  wavefunction  at  a  given  point,  not  the  third  dimension  of  space. 
The  plane  chosen  for  the  graph  is  the  one  perpendicular  to  the  an¬ 
gular  momentum  vector. 

Each  ring  of  peaks  and  valleys  has  eight  wavelengths  going  around 
in  a  circle,  so  this  state  has  L  =  8 h,  i.e. ,  we  label  it  l  =  8.  The  wave¬ 
length  is  shorter  near  the  center,  and  this  makes  sense  because  when 
the  electron  is  close  to  the  nucleus  it  has  a  lower  electrical  energy, 
a  higher  kinetic  energy,  and  a  higher  momentum. 

Between  each  ring  of  peaks  in  this  wavefunction  is  a  nodal  cir¬ 
cle,  i.e.,  a  circle  on  which  the  wavefunction  is  zero.  The  full  three- 
dimensional  wavefunction  has  nodal  spheres:  a  series  of  nested  spher¬ 
ical  surfaces  on  which  it  is  zero.  The  number  of  radii  at  which  nodes 
occur,  including  r  =  oo,  is  called  n,  and  n  turns  out  to  be  closely 
related  to  energy.  The  ground  state  has  n  =  1  (a  single  node  only 
at  r  =  oo),  and  higher-energy  states  have  higher  n  values.  There 
is  a  simple  equation  relating  n  to  energy,  which  we  will  discuss  in 
subsection  13.4.4. 

The  numbers  n  and  £,  which  identify  the  state,  are  called  its 


n=3 


A 


n=2 


> 

E? 

CD 

C 

CD 


n=l 

f  /  The  energy  of  a  state  in 
the  hydrogen  atom  depends  only 
on  its  n  quantum  number. 
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quantum  numbers.  A  state  of  a  given  n  and  £  can  be  oriented 
in  a  variety  of  directions  in  space.  We  might  try  to  indicate  the 
orientation  using  the  three  quantum  numbers  £x  =  Lx/h,  £y  =  Ly/K , 
and  iz  =  Lz/h.  But  we  have  already  seen  that  it  is  impossible  to 
know  all  three  of  these  simultaneously.  To  give  the  most  complete 
possible  description  of  a  state,  we  choose  an  arbitrary  axis,  say  the 
2  axis,  and  label  the  state  according  to  n,  £,  and  £z.8 

Angular  momentum  requires  motion,  and  motion  implies  kinetic 
energy.  Thus  it  is  not  possible  to  have  a  given  amount  of  angular 
momentum  without  having  a  certain  amount  of  kinetic  energy  as 
well.  Since  energy  relates  to  the  n  quantum  number,  this  means 
that  for  a  given  n  value  there  will  be  a  maximum  possible  .  It  turns 
out  that  this  maximum  value  of  equals  n  —  1. 

In  general,  we  can  list  the  possible  combinations  of  quantum 
numbers  as  follows: 
n  can  equal  1,2,3,... 

£  can  range  from  0  to  n  —  1,  in  steps  of  1 
£z  can  range  from  —  £  to  £,  in  steps  of  1 

Applying  these  rules,  we  have  the  following  list  of  states: 


o 

o' 

II 

t-H 

ii 

£ 

one  state 

n  =  2,  £  =  0,  4  =  0 

one  state 

n  =  2,  £  =  1,  4  =  —  1,  0,  or  1 

three  states 

self-check  J 

Continue  the  list  for  n  =  3.  >  Answer,  p.  969 

Figure  g  on  page  895  shows  the  lowest-energy  states  of  the  hy¬ 
drogen  atom.  The  left-hand  column  of  graphs  displays  the  wave- 
functions  in  the  x  —  y  plane,  and  the  right-hand  column  shows  the 
probability  distribution  in  a  three-dimensional  representation. 


Discussion  Questions 

A  The  quantum  number  n  is  defined  as  the  number  of  radii  at  which 
the  wavefunction  is  zero,  including  r  =  oo.  Relate  this  to  the  features  of 
figure  g. 

B  Based  on  the  definition  of  n,  why  can’t  there  be  any  such  thing  as 
an  n  =  0  state? 

C  Relate  the  features  of  the  wavefunction  plots  in  figure  g  to  the 
corresponding  features  of  the  probability  distribution  pictures. 

D  How  can  you  tell  from  the  wavefunction  plots  in  figure  g  which  ones 
have  which  angular  momenta? 

E  Criticize  the  following  incorrect  statement:  “The  l  =  8  wavefunction 


8  See  page  902  for  a  note  about  the  two  different  systems  of  notations  that 
are  used  for  quantum  numbers. 
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in  figure  e  has  a  shorter  wavelength  in  the  center  because  in  the  center 
the  electron  is  in  a  higher  energy  level.” 

F  Discuss  the  implications  of  the  fact  that  the  probability  cloud  in  of  the 
n  =  2,  t  =  1  state  is  split  into  two  parts. 

13.4.4  Energies  of  states  in  hydrogen 

History 

The  experimental  technique  for  measuring  the  energy  levels  of 
an  atom  accurately  is  spectroscopy:  the  study  of  the  spectrum  of 
light  emitted  (or  absorbed)  by  the  atom.  Only  photons  with  certain 
energies  can  be  emitted  or  absorbed  by  a  hydrogen  atom,  for  ex- 
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ample,  since  the  amount  of  energy  gained  or  lost  by  the  atom  must 
equal  the  difference  in  energy  between  the  atom’s  initial  and  final 
states.  Spectroscopy  had  become  a  highly  developed  art  several 
decades  before  Einstein  even  proposed  the  photon,  and  the  Swiss 
spectroscopist  Johann  Balrner  determined  in  1885  that  there  was  a 
simple  equation  that  gave  all  the  wavelengths  emitted  by  hydrogen. 
In  modern  terms,  we  think  of  the  photon  wavelengths  merely  as  in¬ 
direct  evidence  about  the  underlying  energy  levels  of  the  atom,  and 
we  rework  Balmer’s  result  into  an  equation  for  these  atomic  energy 
levels: 

w  2.2  x  1(T18  J 

—  9  ? 

nz 

This  energy  includes  both  the  kinetic  energy  of  the  electron  and 
the  electrical  energy.  The  zero-level  of  the  electrical  energy  scale 
is  chosen  to  be  the  energy  of  an  electron  and  a  proton  that  are 
infinitely  far  apart.  With  this  choice,  negative  energies  correspond 
to  bound  states  and  positive  energies  to  unbound  ones. 

Where  does  the  mysterious  numerical  factor  of  2.2  x  10-18  J  come 
from?  In  1913  the  Danish  theorist  Niels  Bohr  realized  that  it  was 
exactly  numerically  equal  to  a  certain  combination  of  fundamental 
physical  constants: 

mk2e 4  1 

En  =  2ft?~  '  n?r 

where  m  is  the  mass  of  the  electron,  and  k  is  the  Coulomb  force 
constant  for  electric  forces. 

Bohr  was  able  to  cook  up  a  derivation  of  this  equation  based  on 
the  incomplete  version  of  quantum  physics  that  had  been  developed 
by  that  time,  but  his  derivation  is  today  mainly  of  historical  interest. 
It  assumes  that  the  electron  follows  a  circular  path,  whereas  the 
whole  concept  of  a  path  for  a  particle  is  considered  meaningless  in 
our  more  complete  modern  version  of  quantum  physics.  Although 
Bohr  was  able  to  produce  the  right  equation  for  the  energy  levels, 
his  model  also  gave  various  wrong  results,  such  as  predicting  that 
the  atom  would  be  flat,  and  that  the  ground  state  would  have  i  =  1 
rather  than  the  correct  t  =  0. 

Approximate  treatment 

Rather  than  leaping  straight  into  a  full  mathematical  treatment, 
we’ll  start  by  looking  for  some  physical  insight,  which  will  lead  to 
an  approximate  argument  that  correctly  reproduces  the  form  of  the 
Bohr  equation. 

A  typical  standing-wave  pattern  for  the  electron  consists  of  a 
central  oscillating  area  surrounded  by  a  region  in  which  the  wave- 
function  tails  off.  As  discussed  in  subsection  13.3.6,  the  oscillating 
type  of  pattern  is  typically  encountered  in  the  classically  allowed 
region,  while  the  tailing  off  occurs  in  the  classically  forbidden  re¬ 
gion  where  the  electron  has  insufficient  kinetic  energy  to  penetrate 
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according  to  classical  physics.  We  use  the  symbol  r  for  the  radius 
of  the  spherical  boundary  between  the  classically  allowed  and  clas¬ 
sically  forbidden  regions.  Classically,  r  would  be  the  distance  from 
the  proton  at  which  the  electron  would  have  to  stop,  turn  around, 
and  head  back  in. 


If  r  had  the  same  value  for  every  standing-wave  pattern,  then 
we’d  essentially  be  solving  the  particle-in-a-box  problem  in  three 
dimensions,  with  the  box  being  a  spherical  cavity.  Consider  the 
energy  levels  of  the  particle  in  a  box  compared  to  those  of  the  hy¬ 
drogen  atom,  h.  They’re  qualitatively  different.  The  energy  levels 
of  the  particle  in  a  box  get  farther  and  farther  apart  as  we  go  higher 
in  energy,  and  this  feature  doesn’t  even  depend  on  the  details  of 
whether  the  box  is  two-dimensional  or  three-dimensional,  or  its  ex¬ 
act  shape.  The  reason  for  the  spreading  is  that  the  box  is  taken  to 
be  completely  impenetrable,  so  its  size,  r,  is  fixed.  A  wave  pattern 
with  n  humps  has  a  wavelength  proportional  to  r/n,  and  therefore  a 
momentum  proportional  to  n,  and  an  energy  proportional  to  v? .  In 
the  hydrogen  atom,  however,  the  force  keeping  the  electron  bound 
isn’t  an  infinite  force  encountered  when  it  bounces  off  of  a  wall,  it’s 
the  attractive  electrical  force  from  the  nucleus.  If  we  put  more  en¬ 
ergy  into  the  electron,  it’s  like  throwing  a  ball  upward  with  a  higher 
energy  —  it  will  get  farther  out  before  coming  back  down.  This 
means  that  in  the  hydrogen  atom,  we  expect  r  to  increase  as  we  go 
to  states  of  higher  energy.  This  tends  to  keep  the  wavelengths  of 
the  high  energy  states  from  getting  too  short,  reducing  their  kinetic 
energy.  The  closer  and  closer  crowding  of  the  energy  levels  in  hydro¬ 
gen  also  makes  sense  because  we  know  that  there  is  a  certain  energy 
that  would  be  enough  to  make  the  electron  escape  completely,  and 
therefore  the  sequence  of  bound  states  cannot  extend  above  that 
energy. 

When  the  electron  is  at  the  maximum  classically  allowed  distance 
r  from  the  proton,  it  has  zero  kinetic  energy.  Thus  when  the  electron 
is  at  distance  r,  its  energy  is  purely  electrical: 

Up 2 

m  e  =  ~— 


Now  comes  the  approximation.  In  reality,  the  electron’s  wavelength 
cannot  be  constant  in  the  classically  allowed  region,  but  we  pretend 
that  it  is.  Since  n  is  the  number  of  nodes  in  the  wavefunction,  we 
can  interpret  it  approximately  as  the  number  of  wavelengths  that 
fit  across  the  diameter  2 r.  We  are  not  even  attempting  a  derivation 
that  would  produce  all  the  correct  numerical  factors  like  2  and  7r 
and  so  on,  so  we  simply  make  the  approximation 

r 

[2]  A  ~  . 

n 

Finally  we  assume  that  the  typical  kinetic  energy  of  the  electron  is 
on  the  same  order  of  magnitude  as  the  absolute  value  of  its  total 


particle  in  a  box  hydrogen  atom 

h  /  The  energy  levels  of  a 
particle  in  a  box,  contrasted  with 
those  of  the  hydrogen  atom. 
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energy.  (This  is  true  to  within  a  factor  of  two  for  a  typical  classical 
system  like  a  planet  in  a  circular  orbit  around  the  sun.)  We  then 
have 


[3]  absolute  value  of  total  energy 

fee2 

r 

~  K 
=  p2 /2m 
=  (h/X)2 /2m 

i  2  2  /q  2 

~  h  n  /  Zmr 


We  now  solve  the  equation  ke2/r  ~  li2n2 /2mr2  for  r  and  throw 
away  numerical  factors  we  can’t  hope  to  have  gotten  right,  yielding 

h2n 2 
mke 2 


Plugging  n  =  1  into  this  equation  gives  r  =  2  nm,  which  is  indeed 
on  the  right  order  of  magnitude.  Finally  we  combine  equations  [4] 
and  [1]  to  find 


E  ~ 


mk2e4 

h2n2 


which  is  correct  except  for  the  numerical  factors  we  never  aimed  to 
find. 


Exact  treatment  of  the  ground  state 

The  general  proof  of  the  Bohr  equation  for  all  values  of  n  is 
beyond  the  mathematical  scope  of  this  book,  but  it’s  fairly  straight¬ 
forward  to  verify  it  for  a  particular  n,  especially  given  a  lucky  guess 
as  to  what  functional  form  to  try  for  the  wavefunction.  The  form 
that  works  for  the  ground  state  is 

g,  =  ue~r/a, 


where  r  =  \/ x2  +  y2  +  z2  is  the  electron’s  distance  from  the  pro¬ 
ton,  and  u  provides  for  normalization.  In  the  following,  the  result 
dr/dx  =  x/r  comes  in  handy.  Computing  the  partial  derivatives 
that  occur  in  the  Laplacian,  we  obtain  for  the  x  term 


<9T 

dx 


d 2T 
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d'ft  dr 
dr  dx 
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so 


V2^=  +  4)  *■ 

\  ar  az J 

The  Schrodinger  equation  gives 

fp 

E-'V  = - V2T  +  C/-T 

2  m 

ti2  (  2  1  \  fee2 

=  - - „  I  T - T 

2m  \ar  az  J  r 

If  we  require  this  equation  to  hold  for  all  r,  then  we  must  have 
equality  for  both  the  terms  of  the  form  (constant)  X  T  and  for  those 
of  the  form  (constant/r)  X  T.  That  means 

e  =  -—2 

2maz 


and 


0  = 


h2 


mar 


ke2 

r 


These  two  equations  can  be  solved  for  the  unknowns  a  and  E,  giving 

h2 


a  = 


mke2 


and 


mk2e 4 

e  =  -^HT’ 

where  the  result  for  the  energy  agrees  with  the  Bohr  equation  for 
n  =  1.  The  calculation  of  the  normalization  constant  u  is  relegated 
to  homework  problem  36. 

seif-check  K 

We’ve  verified  that  the  function  W  =  he~da  is  a  solution  to  the  Schrodinger 
equation,  and  yet  it  has  a  kink  in  it  at  r  =  0.  What’s  going  on  here?  Didn’t 
I  argue  before  that  kinks  are  unphysical?  c>  Answer,  p.  969 

Wave  phases  in  the  hydrogen  molecule  example  25 

In  example  16  on  page  869,  I  argued  that  the  existence  of  the 
H2  molecule  could  essentially  be  explained  by  a  particle-in-a-box 
argument:  the  molecule  is  a  bigger  box  than  an  individual  atom, 
so  each  electron’s  wavelength  can  be  longer,  its  kinetic  energy 
lower.  Now  that  we’re  in  possession  of  a  mathematical  expression 
for  the  wavefunction  of  the  hydrogen  atom  in  its  ground  state,  we 
can  make  this  argument  a  little  more  rigorous  and  detailed.  Sup¬ 
pose  that  two  hydrogen  atoms  are  in  a  relatively  cool  sample  of 
monoatomic  hydrogen  gas.  Because  the  gas  is  cool,  we  can  as¬ 
sume  that  the  atoms  are  in  their  ground  states.  Now  suppose  that 
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the  two  atoms  approach  one  another.  Making  use  again  of  the  as¬ 
sumption  that  the  gas  is  cool,  it  is  reasonable  to  imagine  that  the 
atoms  approach  one  another  slowly.  Now  the  atoms  come  a  lit¬ 
tle  closer,  but  still  far  enough  apart  that  the  region  between  them 
is  classically  forbidden.  Each  electron  can  tunnel  through  this 
classically  forbidden  region,  but  the  tunneling  probability  is  small. 
Each  one  is  now  found  with,  say,  99%  probability  in  its  original 
home,  but  with  1%  probability  in  the  other  nucleus.  Each  elec¬ 
tron  is  now  in  a  state  consisting  of  a  superposition  of  the  ground 
state  of  its  own  atom  with  the  ground  state  of  the  other  atom. 
There  are  two  peaks  in  the  superposed  wavefunction,  but  one  is 
a  much  bigger  peak  than  the  other. 

An  interesting  question  now  arises.  What  are  the  relative  phases 
of  the  two  electrons?  As  discussed  on  page  863,  the  absolute 
phase  of  an  electron’s  wavefunction  is  not  really  a  meaningful 
concept.  Suppose  atom  A  contains  electron  Alice,  and  B  elec¬ 
tron  Bob.  Just  before  the  collision,  Alice  may  have  wondered,  “Is 
my  phase  positive  right  now,  or  is  it  negative?  But  of  course  I 
shouldn’t  ask  myself  such  silly  questions,”  she  adds  sheepishly. 


i  /  Example  25. 


But  relative  phases  are  well  defined.  As  the  two  atoms  draw 
closer  and  closer  together,  the  tunneling  probability  rises,  and 
eventually  gets  so  high  that  each  electron  is  spending  essentially 
50%  of  its  time  in  each  atom.  It’s  now  reasonable  to  imagine  that 
either  one  of  two  possibilities  could  obtain.  Alice’s  wavefunction 
could  either  look  like  i/1,  with  the  two  peaks  in  phase  with  one 
another,  or  it  could  look  like  i/2,  with  opposite  phases.  Because 
relative  phases  of  wavefunctions  are  well  defined,  states  1  and  2 
are  physically  distinguishable.  In  particular,  the  kinetic  energy  of 
state  2  is  much  higher;  roughly  speaking,  it  is  like  the  two-hump 
wave  pattern  of  the  particle  in  a  box,  as  opposed  to  1 ,  which  looks 
roughly  like  the  one-hump  pattern  with  a  much  longer  wavelength. 
Not  only  that,  but  an  electron  in  state  1  has  a  large  probability  of 
being  found  in  the  central  region,  where  it  has  a  large  negative 
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electrical  energy  due  to  its  interaction  with  both  protons.  State  2, 
on  the  other  hand,  has  a  low  probability  of  existing  in  that  region. 
Thus  state  1  represents  the  true  ground-state  wavefunction  of  the 
H2  molecule,  and  putting  both  Alice  and  Bob  in  that  state  results 
in  a  lower  energy  than  their  total  energy  when  separated,  so  the 
molecule  is  bound,  and  will  not  fly  apart  spontaneously. 

State  i/3,  on  the  other  hand,  is  not  physically  distinguishable  from 
i/2,  nor  is  i/4  from  i/1 .  Alice  may  say  to  Bob,  “Isn’t  it  wonderful  that 
we’re  in  state  1  or  4?  I  love  being  stable  like  this.”  But  she  knows 
it’s  not  meaningful  to  ask  herself  at  a  given  moment  which  state 
she’s  in,  1  or  4. 

Discussion  Questions 

A  States  of  hydrogen  with  n  greater  than  about  1 0  are  never  observed 
in  the  sun.  Why  might  this  be? 

B  Sketch  graphs  of  r  and  E  versus  n  for  the  hydrogen,  and  compare 
with  analogous  graphs  for  the  one-dimensional  particle  in  a  box. 

13.4.5  Electron  spin 

It’s  disconcerting  to  the  novice  ping-pong  player  to  encounter 
for  the  first  time  a  more  skilled  player  who  can  put  spin  on  the  ball. 
Even  though  you  can’t  see  that  the  ball  is  spinning,  you  can  tell 
something  is  going  on  by  the  way  it  interacts  with  other  objects  in 
its  environment.  In  the  same  way,  we  can  tell  from  the  way  electrons 
interact  with  other  things  that  they  have  an  intrinsic  spin  of  their 
own.  Experiments  show  that  even  when  an  electron  is  not  moving 
through  space,  it  still  has  angular  momentum  amounting  to  K/ 2. 
An  important  historical  experiment  of  this  type,  the  Stern-Gerlach 
experiment,  is  described  in  detail  in  section  13.5.1. 

This  may  seem  paradoxical  because  the  quantum  moat,  for  in¬ 
stance,  gave  only  angular  momenta  that  were  integer  multiples  of 
h,  not  half-units,  and  I  claimed  that  angular  momentum  was  al¬ 
ways  quantized  in  units  of  h,  not  just  in  the  case  of  the  quantum 
moat.  That  whole  discussion,  however,  assumed  that  the  angular 
momentum  would  come  from  the  motion  of  a  particle  through  space. 
The  h/ 2  angular  momentum  of  the  electron  is  simply  a  property  of 
the  particle,  like  its  charge  or  its  mass.  It  has  nothing  to  do  with 
whether  the  electron  is  moving  or  not,  and  it  does  not  come  from  any 
internal  motion  within  the  electron.  Nobody  has  ever  succeeded  in 
finding  any  internal  structure  inside  the  electron,  and  even  if  there 
was  internal  structure,  it  would  be  mathematically  impossible  for  it 
to  result  in  a  half-unit  of  angular  momentum. 

We  simply  have  to  accept  this  K/2  angular  momentum,  called 
the  “spin”  of  the  electron  —  Mother  Nature  rubs  our  noses  in  it  as 
an  observed  fact.  Protons  and  neutrons  have  the  same  h/2  spin, 
while  photons  have  an  intrinsic  spin  of  h.  In  general,  half-integer 
spins  are  typical  of  material  particles.  Integral  values  are  found  for 


spin,  s 


orbital  angular 
momentum,  K 


j/The  top  has  angular  mo¬ 
mentum  both  because  of  the 
motion  of  its  center  of  mass 
through  space  and  due  to  its 
internal  rotation.  Electron  spin  is 
roughly  analogous  to  the  intrinsic 
spin  of  the  top. 
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the  particles  that  carry  forces:  photons,  which  embody  the  electric 
and  magnetic  fields  of  force,  as  well  as  the  more  exotic  messengers 
of  the  nuclear  and  gravitational  forces. 

As  was  the  case  with  ordinary  angular  momentum,  we  can  de¬ 
scribe  spin  angular  momentum  in  terms  of  its  magnitude,  and  its 
component  along  a  given  axis.  We  write  s  and  sz  for  these  quantities, 
expressed  in  units  of  h,  so  an  electron  has  s  =  1/2  and  sz  =  +1/2 
or  —1/2. 

Taking  electron  spin  into  account,  we  need  a  total  of  four  quan¬ 
tum  numbers  to  label  a  state  of  an  electron  in  the  hydrogen  atom: 
n,  £,  £z,  and  sz.  (We  omit  s  because  it  always  has  the  same  value.) 
The  symbols  and  include  only  the  angular  momentum  the  electron 
has  because  it  is  moving  through  space,  not  its  spin  angular  mo¬ 
mentum.  The  availability  of  two  possible  spin  states  of  the  electron 
leads  to  a  doubling  of  the  numbers  of  states: 


n  = 

1, 

e  = 

0, 

4  =  0, 

sz  =  +1/2  or  —1/2 

two  states 

n  = 

2, 

£  = 

0, 

4  =  o, 

sz  =  +1/2  or  —1/2 

two  states 

n  = 

2, 

£  = 

1, 

4  =  -1,  0,  or  1, 

sz  =  +1/2  or  —1/2 

six  states 

A  note  about  notation 

There  are  unfortunately  two  inconsistent  systems  of  notation 
for  the  quantum  numbers  we’ve  been  discussing.  The  notation  I’ve 
been  using  is  the  one  that  is  used  in  nuclear  physics,  but  there  is  a 
different  one  that  is  used  in  atomic  physics. 


nuclear  physics 

atomic  physics 

n 

same 

t 

same 

4 

no  notation 

ly 

no  notation 

4 

m 

s  =  1/2 

no  notation  (sometimes  a) 

$X 

no  notation 

Sy 

no  notation 

s 

The  nuclear  physics  notation  is  more  logical  (not  giving  special  sta¬ 
tus  to  the  2  axis)  and  more  memorable  (£z  rather  than  the  obscure 
m),  which  is  why  I  use  it  consistently  in  this  book,  even  though 
nearly  all  the  applications  we’ll  consider  are  atomic  ones. 

We  are  further  encumbered  with  the  following  historically  de¬ 
rived  letter  labels,  which  deserve  to  be  eliminated  in  favor  of  the 
simpler  numerical  ones: 

£  =  0  £=1  £  =  2  £  =  3 

s  p  d  f 
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n  =  1  n  =  2  n  =  3  n  =  4  n  =  5  n  =  6  n  =  7 

K  L  M  N  0  P  Q 

The  spdf  labels  are  used  in  both  nuclear9  and  atomic  physics,  while 
the  KLMNOPQ  letters  are  used  only  to  refer  to  states  of  electrons. 

And  finally,  there  is  a  piece  of  notation  that  is  good  and  useful, 
but  which  I  simply  haven’t  mentioned  yet.  The  vector  j  =  t  +  s 
stands  for  the  total  angular  momentum  of  a  particle  in  units  of  h , 
including  both  orbital  and  spin  parts.  This  quantum  number  turns 
out  to  be  very  useful  in  nuclear  physics,  because  nuclear  forces  tend 
to  exchange  orbital  and  spin  angular  momentum,  so  a  given  energy 
level  often  contains  a  mixture  of  i  and  s  values,  while  remaining 
fairly  pure  in  terms  of  j. 

13.4.6  Atoms  with  more  than  one  electron 

What  about  other  atoms  besides  hydrogen?  It  would  seem  that 
things  would  get  much  more  complex  with  the  addition  of  a  second 
electron.  A  hydrogen  atom  only  has  one  particle  that  moves  around 
much,  since  the  nucleus  is  so  heavy  and  nearly  immobile.  Helium, 
with  two,  would  be  a  mess.  Instead  of  a  wavefunction  whose  square 
tells  us  the  probability  of  finding  a  single  electron  at  any  given  lo¬ 
cation  in  space,  a  helium  atom  would  need  to  have  a  wavefunction 
whose  square  would  tell  us  the  probability  of  finding  two  electrons 
at  any  given  combination  of  points.  Ouch!  In  addition,  we  would 
have  the  extra  complication  of  the  electrical  interaction  between  the 
two  electrons,  rather  than  being  able  to  imagine  everything  in  terms 
of  an  electron  moving  in  a  static  field  of  force  created  by  the  nucleus 
alone. 

Despite  all  this,  it  turns  out  that  we  can  get  a  surprisingly  good 
description  of  many-electron  atoms  simply  by  assuming  the  elec¬ 
trons  can  occupy  the  same  standing-wave  patterns  that  exist  in  a 
hydrogen  atom.  The  ground  state  of  helium,  for  example,  would 
have  both  electrons  in  states  that  are  very  similar  to  the  n  =  1 
states  of  hydrogen.  The  second-lowest-energy  state  of  helium  would 
have  one  electron  in  an  n  =  1  state,  and  the  other  in  an  n  =  2  states. 
The  relatively  complex  spectra  of  elements  heavier  than  hydrogen 
can  be  understood  as  arising  from  the  great  number  of  possible  com¬ 
binations  of  states  for  the  electrons. 

A  surprising  thing  happens,  however,  with  lithium,  the  three- 
electron  atom.  We  would  expect  the  ground  state  of  this  atom  to 
be  one  in  which  all  three  electrons  settle  down  into  n  =  1  states. 
What  really  happens  is  that  two  electrons  go  into  n  =  1  states,  but 
the  third  stays  up  in  an  n  =  2  state.  This  is  a  consequence  of  a  new 
principle  of  physics: 


9After  f,  the  series  continues  in  alphabetical  order.  In  nuclei  that  are  spinning 
rapidly  enough  that  they  are  almost  breaking  apart,  individual  protons  and 
neutrons  can  be  stirred  up  to  £  values  as  high  as  7,  which  is  j. 
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I  0 

in  ii  in  iv  v  vi  vii  v-r 

H  He 

3  4  I  5  I  6  I  7  I  8  I  9  10 

Li  Be  I  B  I  C  I  N  I  O  I  F  I  Ne 

n 

Na 


k/The  beginning  of  the  peri¬ 
odic  table. 


I  /  Hydrogen  is  highly  reactive. 


The  Pauli  Exclusion  Principle 

Only  one  electron  can  ever  occupy  a  given  state. 


There  are  two  n  =  1  states,  one  with  sz  =  +1/2  and  one  with 
sz  =  —1/2,  but  there  is  no  third  n  =  1  state  for  lithium’s  third 
electron  to  occupy,  so  it  is  forced  to  go  into  an  n  =  2  state. 

It  can  be  proved  mathematically  that  the  Pauli  exclusion  prin¬ 
ciple  applies  to  any  type  of  particle  that  has  half-integer  spin.  Thus 
two  neutrons  can  never  occupy  the  same  state,  and  likewise  for  two 
protons.  Photons,  however,  are  immune  to  the  exclusion  principle 
because  their  spin  is  an  integer. 

Deriving  the  periodic  table 

We  can  now  account  for  the  structure  of  the  periodic  table,  which 
seemed  so  mysterious  even  to  its  inventor  Mendeleev.  The  first  row 
consists  of  atoms  with  electrons  only  in  the  n  =  1  states: 

H  1  electron  in  an  n  =  1  state 
He  2  electrons  in  the  two  n  =  1  states 

The  next  row  is  built  by  filling  the  n  =  2  energy  levels: 

Li  2  electrons  in  n  =  1  states,  1  electron  in  an  n  =  2  state 
Be  2  electrons  in  n  =  1  states,  2  electrons  in  n  =  2  states 

O  2  electrons  in  n  =  1  states,  6  electrons  in  n  =  2  states 

F  2  electrons  in  n  =  1  states,  7  electrons  in  n  =  2  states 

Ne  2  electrons  in  n  =  1  states,  8  electrons  in  n  =  2  states 

In  the  third  row  we  start  in  on  the  n  =  3  levels: 

Na  2  electrons  in  n  =  1  states,  8  electrons  in  n  =  2  states,  1 
electron  in  an  n  =  3  state 


We  can  now  see  a  logical  link  between  the  filling  of  the  energy 
levels  and  the  structure  of  the  periodic  table.  Column  0,  for  exam¬ 
ple,  consists  of  atoms  with  the  right  number  of  electrons  to  fill  all 
the  available  states  up  to  a  certain  value  of  n.  Column  I  contains 
atoms  like  lithium  that  have  just  one  electron  more  than  that. 

This  shows  that  the  columns  relate  to  the  filling  of  energy  levels, 
but  why  does  that  have  anything  to  do  with  chemistry?  Why,  for 
example,  are  the  elements  in  columns  I  and  VII  dangerously  reac¬ 
tive?  Consider,  for  example,  the  element  sodium  (Na),  which  is  so 
reactive  that  it  may  burst  into  flames  when  exposed  to  air.  The 
electron  in  the  n  =  3  state  has  an  unusually  high  energy.  If  we  let 
a  sodium  atom  come  in  contact  with  an  oxygen  atom,  energy  can 
be  released  by  transferring  the  n  =  3  electron  from  the  sodium  to 
one  of  the  vacant  lower-energy  n  =  2  states  in  the  oxygen.  This 
energy  is  transformed  into  heat.  Any  atom  in  column  I  is  highly 
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reactive  for  the  same  reason:  it  can  release  energy  by  giving  away 
the  electron  that  has  an  unusually  high  energy. 

Column  VII  is  spectacularly  reactive  for  the  opposite  reason: 
these  atoms  have  a  single  vacancy  in  a  low-energy  state,  so  energy 
is  released  when  these  atoms  steal  an  electron  from  another  atom. 

It  might  seem  as  though  these  arguments  would  only  explain 
reactions  of  atoms  that  are  in  different  rows  of  the  periodic  table, 
because  only  in  these  reactions  can  a  transferred  electron  move  from 
a  higher-n  state  to  a  lower-ro  state.  This  is  incorrect.  An  n  =  2  elec¬ 
tron  in  fluorine  (F),  for  example,  would  have  a  different  energy  than 
an  n  =  2  electron  in  lithium  (Li),  due  to  the  different  number  of 
protons  and  electrons  with  which  it  is  interacting.  Roughly  speak¬ 
ing,  the  n  =  2  electron  in  fluorine  is  more  tightly  bound  (lower  in 
energy)  because  of  the  larger  number  of  protons  attracting  it.  The 
effect  of  the  increased  number  of  attracting  protons  is  only  partly 
counteracted  by  the  increase  in  the  number  of  repelling  electrons, 
because  the  forces  exerted  on  an  electron  by  the  other  electrons  are 
in  many  different  directions  and  cancel  out  partially. 


13.5  -k  Some  Optional  Topics 

13.5.1  The  Stern-Gerlach  experiment 

In  1921,  Otto  Stern  proposed  an  experiment  about  angular  mo¬ 
mentum,  shown  in  figure  a  on  p.  906,  that  his  boss  at  the  University 
of  Frankfurt  and  many  of  his  colleagues  were  certain  wouldn’t  work. 
At  this  time,  quantization  of  angular  momentum  had  been  proposed 
by  Niels  Bohr,  but  most  physicists,  if  they  had  heard  of  it  at  all, 
thought  of  the  idea  as  a  philosophical  metaphor  or  a  mathemati¬ 
cal  trick  that  just  happened  to  give  correct  results.  World  War  I 
was  over,  hyperinflation  was  getting  under  way  in  Germany  (a  pa¬ 
per  mark  was  worth  a  few  percent  of  its  prewar  value),  and  the 
Nazi  coup  was  still  in  the  future,  so  that  Stern,  a  Jew,  had  not  yet 
been  forced  to  flee  to  America.  Because  of  the  difficult  economic 
situation,  Stern  and  his  colleague  Walther  Gerlach  scraped  up  some 
of  the  funds  to  carry  out  the  experiment  from  US  banker  Henry 
Goldman,  cofounder  of  the  investment  house  Goldman-Sachs. 


Section  13.5  *  Some  Optional  Topics 
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a /Bottom:  A  schematic  di¬ 

agram  of  the  Stern-Gerlach 
experiment.  The  z  direction 

is  out  of  the  page.  The  entire 
apparatus  is  about  10  cm  long. 
Top:  A  portion  of  Gerlach’s  cel¬ 
ebratory  1922  postcard  to  Niels 
Bohr,  with  a  photo  showing  the 
results.  A  coordinate  system  is 
superimposed.  The  orientation  is 
flipped  downward  by  90  degrees 
compared  to  the  schematic.  The 
photo  was  taken  through  a  micro¬ 
scope,  and  Gerlach  drew  the  1 .0 
mm  scale  on  after  the  magnified 
photo  had  been  printed. 


The  entire  apparatus  was  sealed  inside  a  vacuum  chamber  with 
the  best  vacuum  obtainable  at  the  time.  A  sample  of  silver  was 
heated  to  1000°C,  evaporating  it.  The  atoms  leaving  the  oven  en¬ 
countered  two  narrow  slits,  so  that  what  emerged  was  a  beam  with 
a  width  of  only  0.03  mm,  or  about  a  third  of  the  width  of  a  human 
hair.  The  atoms  then  encountered  a  magnetic  field.  Because  the 
atoms  were  electrically  neutral,  we  would  normally  expect  them  to 
be  unaffected  by  a  magnetic  field.  But  in  the  planetary  model  of 
the  atom,  we  imagine  the  electrons  as  orbiting  in  circles  like  lit¬ 
tle  current  loops,  which  would  give  the  atom  a  magnetic  dipole 
moment  m.  Even  if  we  are  sophisticated  enough  about  quantum 
mechanics  not  to  believe  in  the  circular  orbits,  it  is  reasonable  to 
imagine  that  such  a  dipole  moment  would  exist.  When  a  dipole 
encounters  a  nonuniform  field,  it  experiences  a  force  (example  6, 
p.  573).  In  this  example,  the  forces  in  the  x  and  z  directions  would 
be  Fx  =  m  •  (OB /dx)  and  Fz  =  m  •  (OB /dz).  (Because  of  Gauss’s 
law  for  magnetism,  these  two  derivatives  are  not  independent  —  we 
have  dBx/dx-\-dB~/dz  =  0.)  The  rapidly  varying  magnetic  field  for 
this  experiment  was  provided  by  a  pair  of  specially  shaped  magnet 
poles,  described  in  example  27,  p.  722. 

Because  electrons  have  charge,  we  expect  the  motion  of  an  elec¬ 
tron  to  give  it  a  magnetic  dipole  moment  m.  But  they  also  have 
mass,  so  for  exactly  the  same  reasons,  we  expect  there  to  be  some 
angular  momentum  L  as  well.  The  analogy  is  in  fact  mathemati¬ 
cally  exact  —  charge  is  to  mass  as  m  is  to  L  -  so  that  classically  we 
expect  m  and  L  to  be  exactly  the  same  vector,  except  for  a  rescaling 
by  the  charge-to-mass  ratio  —e/m.  Therefore  this  experiment  with 
dipoles  and  magnetic  fields  is  actually  a  probe  of  the  behavior  of 
angular  momentum  at  the  atomic  level. 

Luckily  for  Stern  and  Gerlach,  who  had  no  modern  knowledge  of 
atomic  structure,  the  silver  atoms  that  they  chose  to  use  do  happen 
to  have  nonzero  total  L,  and  therefore  nonzero  m.  The  atoms  come 
out  of  the  oven  with  random  orientations. 

Classically,  we  would  expect  the  following.  Each  atom  has  an 
energy  m  •  B  due  to  its  interaction  with  the  magnetic  field,  and 
this  energy  is  conserved,  so  that  the  component  mx  stays  constant. 
However,  there  is  a  torque  m  x  B,  and  this  causes  the  direction  of 
the  atom’s  angular  momentum  to  precess,  i.e.,  wobble  like  a  top, 
with  its  angular  momentum  forming  a  cone  centered  on  the  x  axis 
(example  25,  p.  283).  This  precession  is  extremely  fast,  carrying  out 
about  1010  wobbles  per  second,  so  that  the  atom  precesses  about  106 
times  while  traveling  the  3.5  cm  length  of  the  spectrometer.  So  even 
though  the  forces  Fx  and  Fz  are  typically  about  the  same  size,  the 
rapid  precession  causes  Fz  to  average  out  to  nearly  zero,  and  only 
a  deflection  in  the  x  direction  is  expected.  Because  the  orientations 
of  the  atoms  are  random  as  they  enter  the  magnetic  field,  they  will 
have  every  possible  value  of  Lx  ranging  from  —  |Lj  to  +|L|,  and 
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therefore  we  expect  that  when  the  magnetic  field  is  turned  on,  the 
effect  should  be  to  smear  out  the  image  on  the  glass  plate  from  a 
vertical  line  to  a  somewhat  wider  oval.  The  atoms  are  dispersed  from 
left  to  right  along  a  certain  scale  of  measurement  according  to  their 
random  value  of  Lx.  The  spectrometer  is  a  device  for  determining 
Lx,  a  continuously  varying  number. 

But  that’s  all  the  classical  theory.  Quantum  mechanically,  Lx  is 
quantized,  so  that  only  certain  very  specific  values  of  Fx  can  occur. 
Although  the  discussion  of  precession  above  is  really  classical  rather 
than  quantum-mechanical,  the  result  of  Fz  averaging  out  to  zero 
turns  out  to  be  approximately  right  if  the  field  is  strong.  Therefore 
we  expect  to  see  well  separated  vertical  bands  on  the  glass  plate 
corresponding  to  the  quantized  values  of  Lx.  This  is  approximately 
what  is  seen  in  figure  a,  although  the  field  rapidly  weakens  outside 
the  x-y  plane,  so  we  get  the  slightly  more  complicated  pattern  like 
a  sideways  lipstick  kiss.  Since  we  observe  two  values  of  Lx  (the  two 
“lips”),  we  conclude  from  these  results  that  a  silver  atom  has  spin 
1/2,  so  that  Lx  takes  on  the  values  —h/ 2  and  +h/2.  Although  it 
took  about  five  years  for  the  experiment  to  be  interpreted  completely 
correctly,  we  now  understand  the  Stern-Gerlach  experiment  to  be 
not  just  a  confirmation  of  the  quantization  of  angular  momentum 
along  any  given  axis  but  also  the  first  experimental  evidence  that 
the  electron  has  an  intrinsic  spin  of  1/2. 

Discussion  Questions 

A  Could  the  Stern-Gerlach  experiment  be  carried  out  with  a  beam  of 
electrons? 

B  A  few  weeks  after  the  Stern-Gerlach  experiment’s  results  became 
public,  Einstein  and  Ehrenfest  carried  out  the  following  reasoning,  which 
seemed  to  them  to  make  the  results  inexplicable.  Before  a  particular  sil¬ 
ver  atom  enters  the  magnetic  field,  its  magnetic  moment  m  is  randomly 
oriented.  Once  it  enters  the  magnetic  field,  it  has  an  energy  m  •  B.  Unless 
there  is  a  mechanism  for  the  transfer  of  energy  in  or  out  of  the  atom,  this 
energy  can’t  change,  and  therefore  the  magnetic  moment  can  only  pre- 
cess  about  the  B  vector,  but  the  angle  between  m  and  B  must  remain  the 
same.  Therefore  the  atom  cannot  align  itself  with  the  field.  (They  consid¬ 
ered  various  mechanisms  of  energy  loss,  such  as  collisions  and  radiation, 
and  concluded  that  all  of  them  were  too  slow  by  orders  of  magnitude  to 
have  an  effect  during  the  atom’s  time  of  flight.)  It  seemed  to  them  that 
as  soon  as  the  atom  left  the  oven,  it  was  somehow  required  to  have  an¬ 
ticipated  the  direction  of  the  field  and  picked  one  of  two  orientations  with 
respect  to  it.  How  can  this  paradox  be  resolved? 

C  Suppose  we  send  a  beam  of  oxygen  molecules,  with  L  =  h,  through  a 
Stern-Gerlach  spectrometer,  throwing  away  the  emerging  parts  with  4  = 
-1  and  +1  to  make  a  beam  of  the  pure  4  =  0  state.  Now  we  let  this  beam 
pass  through  a  second  spectrometer  that  is  identical  but  oriented  along 
the  z  axis.  Can  we  produce  a  beam  in  which  every  molecule  has  both 
4  =  0  and  4  =  +1  ?  Hint:  See  example  24,  p.  892. 
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b  /  Visible  spectrum  of  N2.  Violet 
is  on  the  left,  red  on  the  right. 


c/1.  The  molecule  N2.  2.  The 
nucleus  178Hf. 


13.5.2  Rotation  and  vibration 

Types  of  excitations 

Figure  b  shows  the  visible- light  spectrum  of  the  molecule  N2.  Be¬ 
cause  this  particular  chemical  bond  is  unusually  strong,  the  molecule 
does  not  break  apart,  even  at  the  high  temperature  of  a  gas  discharge 
tube,  so  we  see  the  spectrum  of  the  molecule,  not  of  monoatomic 
nitrogen.  This  spectrum  is  more  complex  than  the  spectrum  of  the 
hydrogen  atom,  and  that’s  not  surprising,  because  the  number  of 
different  states  grows  exponentially  with  the  number  of  particles 
(here,  14  electrons  plus  2  nuclei). 


What  is  more  surprising  is  that  there  are  some  clear,  simple  pat¬ 
terns  in  this  spectrum  patterns  simpler  than  any  that  we  would 
see  in  the  spectrum  of  a  monoatomic  gas  with  the  same  number  of 
particles.  To  start  to  understand  this,  we  note  that  the  N2  lacks 
the  spherical  symmetry  of  an  individual  atom,  but  it  does  have  an 
axis  of  symmetry,  c/1.  These  properties  are  also  possessed  by  many 
nuclei,  e.g.,  c/2.  We  now  consider  three  different  ways  in  which  an 
excited  energy  state  could  occur  in  N2: 


•  Individual  particles  (electrons)  can  be  raised  to  a  higher  energy 
level. 

•  The  molecule  can  vibrate  along  its  long  axis,  so  that  the  nu¬ 
clei  (which  have  nearly  all  the  inertia)  move  back  and  forth, 
elongating  and  compressing  the  system. 

•  The  molecule  can  rotate. 


Vibration 

Particle  excitations  would  produce  the  type  of  very  complex,  dis¬ 
organized  spectrum  that  we  normally  see  in  atoms  that  have  many 
electrons,  so  that  isn’t  what  we’re  seeing  in  figure  b.  What  about 
vibrations?  For  a  classical  harmonic  oscillator,  we  know  that  the 
frequency  of  vibration  is  independent  of  the  amplitude.  If  a  clas¬ 
sical  oscillator  contains  electric  charge,  it  will  emit  electromagnetic 
radiation  at  this  frequency,  smoothly  and  continuously  draining  it¬ 
self  of  energy.  As  the  energy  is  lost,  the  frequency  stays  the  same. 
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By  the  correspondence  principle,  we  expect  that  when  the  quantum 
mechanical  version  of  such  a  system  is  highly  excited,  it  should  emit 
a  large  number  of  photons  of  this  frequency  /,  so  that  the  discrete 
quantum  jumps  are  undetectable  and  the  radiation  appears  as  a 
classical  wave.  We  can  thus  infer  that  for  a  quantum  vibrator,  the 
excited  states  should  show  an  evenly  spaced  ladder  of  energy  levels. 

Figure  d  shows  how  the  series  of  red  lines  in  figure  b  arises.  For 
an  excitation  consisting  only  of  vibrational  motion,  we  expect  based 
on  the  correspondence  principle  to  see  the  evenly  spaced  ladder  of 
states  shown  in  a  stack  built  above  the  ground  state,  with  all  of  the 
photons  having  the  same  energy.  These  states  and  transitions  do 
exist,  but  the  light  lies  in  the  infrared  spectrum  and  so  is  not  seen 
in  figure  b.  The  red  visible- light  lines  arise  as  shown  in  the  second 
box,  from  states  that  involve  both  a  certain  particle  excitation  and 
some  vibration.  Because  the  spacing  of  the  two  ladders  is  slightly 
unequal,  the  red  lines  all  have  slightly  different  wavelengths. 

Rotation 

What  about  rotation?  An  interesting  thing  happens  here  due 
to  the  structure  of  quantum  mechanics.  Quantum  mechanics  can 
describe  motion  only  as  a  wave,  with  the  value  of  the  wave  oscillat¬ 
ing  from  one  place  to  another.  But  this  implies  that  according  to 
quantum  mechanics,  no  object  can  rotate  about  one  of  its  axes  of 
symmetry,  for  the  rotated  version  of  a  state  would  then  be  the  same 
state.  This  is  why  rotational  excitations  are  never  seen  in  individ¬ 
ual  atoms,  or  in  nuclei  that  have  spherical  shapes.  In  examples  like 
the  ones  in  figure  c,  which  have  a  single  axis  of  symmetry,  we  can 
therefore  have  end-over-end  rotation,  but  never  rotation  about  the 
symmetry  axis.  Such  end-over-end  rotational  states  are  observed  in 
N2,  for  example,  but  because  this  involves  large  motions  by  the  high- 
mass  nuclei,  the  moment  of  inertia  I  is  quite  large,  and  therefore  the 
rotational  energies  —  classically,  K  =  L2 /2I  —  are  very  small,  and 
infrared  rather  than  visible  photons  are  emitted.  If  rotation  about 
the  symmetry  axis  were  possible,  then  the  moment  of  inertia  would 
be  thousands  of  times  smaller,  because  in  such  a  rotation  the  nu¬ 
clei  would  not  move.  The  energies  involved  would  be  thousands  of 
times  higher,  and  the  photons  would  lie  approximately  in  the  visible 
region  of  the  spectrum.  No  such  visible  lines  are  actually  observed. 

Perhaps  more  vivid  evidence  for  the  nonexistence  of  rotation 
about  a  symmetry  axis  is  shown  in  figure  e.  The  states  involving 
end-over-end  rotation  of  the  nucleus  as  a  whole  (“collective”  rota¬ 
tion)  are  approximately  a  parabola  on  this  graph,  which  is  reason¬ 
able  given  the  classical  relation  K  =  L2 /2I.  But  angular  momentum 
cannot  be  generated  along  the  symmetry  axis  through  collective  ro¬ 
tation.  Instead,  we  see  an  irregular  set  of  energy  levels  in  which  first 
one  particle  (for  L  <  8 K)  and  then  two  (14  and  16  h)  are  excited. 
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e  /  Excited  states  of  the  nu¬ 
cleus  178Hf.  Black  squares 
represent  states  that  are  inter¬ 
preted  as  end-over-end  rotation, 
while  white  diamonds  show  parti¬ 
cle  excitations.  For  each  angular 
momentum,  the  graph  shows  the 
lowest-energy  state  of  each  type, 
where  known. 
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Note  that  only  even  multiples  of  h  are  observed  in  collective 
rotation  in  figure  e.  This  is  because  the  nucleus’s  shape  has  an 
additional  mirror  symmetry,  so  that  it  is  unaffected  by  a  180-degree 
rotation.  This  means  that  the  wavefunction  describing  the  collective 
rotation  must  oscillate  twice  as  we  pass  through  a  full  rotation. 

Corrections  to  serniclassical  energies 

So  far  we’ve  been  using  the  correspondence  principle  to  make 
educated  guesses  about  quantum-mechanical  expressions  for  the  en¬ 
ergies  of  vibrators  and  rotors.  This  style  of  reasoning  is  called 
serniclassical,  because  it  combines  ideas  from  classical  and  quantum 
physics.  These  expressions  are  guaranteed  to  be  good  approxima¬ 
tions  in  the  classical  limit  obtained  when  the  quantum  numbers  are 
large,  but  figure  f  shows  that  the  approximations  can  be  poorer 
when  the  quantum  numbers  are  small. 


f  /  Quantum-mechanical  correc¬ 
tions  to  the  serniclassical  results 
for  the  energy  of  a  vibrator  and 
a  rotor.  The  rotational  levels  are 
shown  for  the  case  of  a  rotor  with 
mirror  symmetry,  so  that  only 
even  values  of  t  occur. 
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In  the  case  of  the  nth  excited  state  of  a  vibrator,  the  energy  is 
(n  +  1/2 )hf,  where  the  +1/2  term  represents  a  quantum  correction 
to  the  serniclassical  approximation.  This  shifts  the  entire  ladder  up¬ 
ward  in  energy  by  half  a  step.  In  particular,  the  energy  of  the  ground 
state  is  not  zero  but  rather  (1/2 )hf .  This  can  be  verified  quantita¬ 
tively  by  calculating  the  energy  for  the  solution  to  the  Schrodinger 
discussed  using  the  guess-and-check  method  in  problem  23,  p.  936. 
It  is  easy  to  see  why  the  answer  cannot  be  zero,  for  if  it  were,  then 
the  particle  in  the  ground  state  would  have  zero  kinetic  energy  and 
zero  potential  energy.  To  have  zero  kinetic  energy,  it  would  have 
to  have  a  momentum  of  exactly  zero,  so  A p  =  0,  but  to  have  zero 
potential  energy  it  would  also  have  to  sit  still  at  exactly  the  equi- 
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librium  position,  so  Ax  =  0.  But  this  would  violate  the  Heisenberg 
uncertainty  principle  and  so  is  impossible. 

The  inevitable  motion  that  is  present  even  in  the  ground  state  is 
known  as  zero-point  motion,  and  its  energy  is  the  zero-point  energy. 
Relativity  tells  us  that  E  =  me2,  so  the  zero-point  energy  of  par¬ 
ticles  is  equivalent  to  a  certain  amount  of  mass.  In  fact,  nearly  all 
the  mass  of  ordinary  matter  arises  from  the  zero-point  energy  of  the 
quarks  inside  the  neutrons  and  protons.  Another  interesting  appli¬ 
cation  is  to  spontaneous  nuclear  fission,  which  is  the  basis  for  nuclear 
energy,  providing  the  kick-off  for  a  chain  reaction.  Spontaneous  fis¬ 
sion  requires  that  a  nucleus  become  more  and  more  elongated  until 
it  breaks  apart  into  two  pieces.  The  very  elongated  shapes  have  a 
high  potential  energy,  so  that  spontaneous  fission  requires  quantum- 
mechanical  tunneling.  If  it  were  not  for  the  zero-point  vibrational 
energy  associated  with  this  motion,  the  tunneling  probability  for 
uranium  and  plutonium  isotopes  would  be  extremely  small.  These 
isotopes  would  decay  only  by  alpha  emission,  and  nuclear  reactors 
and  bombs  would  not  work. 


Figure  g  shows  visually  the  reason  for  the  correction  of  C2  to 
1(1  +  1).  Each  of  these  standing  waves  has  \iz\  =  16,  where  z  is  the 
vertical  axis.  But  only  g/2  is  a  solution  of  the  Schrodinger  equation 
or  a  state  of  definite  l.  To  be  a  solution  of  the  Schrodinger  equa¬ 
tion,  such  a  wave  must  have  the  same  kinetic  energy  everywhere. 
Each  of  these  three  has  the  same  kinetic  energy  associated  with  its 
wavelength  in  the  “east- west,”  or  azimuthal,  direction.  Wave  g/1  is 
not  a  solution,  because  near  the  equator,  it  has  an  extremely  short 
wavelength  in  the  “north-south,”  or  longitudinal,  direction,  and  this 
gives  it  a  greater  kinetic  energy  near  the  equator  than  elsewhere. 
The  opposite  problem  occurs  in  g/3,  where  the  wave  is  constant  in 
the  longitudinal  direction;  at  the  poles,  the  wavefunction  varies  in¬ 
finitely  rapidly,  and  therefore  the  kinetic  energy  blows  up  to  infinity 
there.  The  only  valid  solution  is  g/2,  which  has  a  Goldilocks-style 
just-right  wavelength  in  the  longitudinal  direction,  The  kinetic  en¬ 
ergy  associated  with  this  wavelength  is  the  difference  between  the 
semiclassical  f2  and  the  correct  quantum  mechanical  l{l  +  1). 

A  different  example  that  is  particularly  easy  to  reason  about  is 


g  /  Each  panel  of  the  figure  shows 
a  standing  wave  on  a  sphere,  with 
the  convention  that  gray  is  zero, 
white  is  a  positive  real  number, 
and  black  is  a  negative  real  num¬ 
ber.  (These  could  instead  have 
been  drawn  as  traveling  waves, 
but  then  we  would  have  needed 
to  represent  complex  numbers 
using  color,  as  in  figure  c  on 
p.  891 .)  Only  2  is  a  solution  of  the 
Schrodinger  equation. 
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the  wavefunction  Tio  shown  in  figure  d  on  p.  892,  for  £  =  1  and 
£z  =  0.  (The  odd  value  of  £  is  possible  for  a  rotor  that  doesn’t  have 
mirror  symmetry,  e.g.,  the  carbon  monoxide  molecule  CO.)  The 
ratio  of  the  correct  quantum  mechanical  energy  to  the  semiclassical 
one  is  £(£  +  l)/£2  =  2,  and  the  factor  of  two  makes  sense  because 
at  the  poles,  the  wave  has  equal  contributions  to  its  kinetic  energy 
due  to  oscillations  in  the  two  perpendicular  directions  that  occur  in 
the  Laplacian  d2/dx2  +  d2/dy2. 

13.5.3  The  underlying  structure  of  quantum  mechanics 

So  far  we  have  been  building  up  the  structure  of  quantum  me¬ 
chanics  by  casually  laying  one  brick  on  top  of  another,  but  at  this 
point  it  will  be  advantageous  to  pause  and  consider  the  broader 
blueprint. 

The  time-dependent  Schrodinger  equation 

For  simplicity  our  discussion  of  the  Schrodinger  equation  in  sec¬ 
tion  13.3.6,  p.  876,  was  limited  to  standing  waves,  allowing  us  to 
avoid  explicitly  discussing  how  the  wavefunction  changed  with  pass¬ 
ing  time.  Let’s  consider  the  generalization  to  the  full  time-dependent 
case. 

Classically,  suppose  I  show  you  a  picture  of  a  baseball  next  to 
a  tree,  and  I  ask  you  how  long  it  will  take  to  hit  the  ground.  You 
can’t  tell,  because  you  also  need  information  about  the  ball’s  initial 
velocity.  That  is,  the  future  time-evolution  of  the  system  x(t)  de¬ 
pends  not  just  on  the  initial  position  x(0)  but  also  on  its  initial  time 
derivative  a/(0). 

But  if  I  show  you  a  uranium  atom  in  its  lowest  energy  state,  you 
don’t  need  to  know  any  other  information  to  predict  everything  that 
can  be  predicted  about  its  future  decay.  Whereas  the  baseball  could 
be  thrown  downward  in  order  to  make  it  reach  the  ground  more 
quickly,  nobody  knows  of  any  way  to  prepare  the  uranium  nucleus 
in  such  a  way  that  it  is  any  more  likely  to  decay  sooner.  Knowing 
the  initial  wavefunction  'F(O)  to  be  that  of  the  ground  state  lets  us 
say  as  much  as  can  be  said  about  the  future  time-evolution  ’F(t), 
and  it’s  neither  necessary  nor  helpful  to  know  the  time  derivative 

*'(*)• 

This  is  an  example  of  a  more  general  idea  about  the  interpre¬ 
tation  of  quantum  mechanics,  which  is  that  the  wavefunction  is  a 
complete  description  of  any  system.  There  isn’t  more  information 
that  can  be  known  about  the  system.  (The  phase  and  normaliza¬ 
tion  of  the  wavefunction  are  not  considered  to  give  any  information, 
since  the  phase  is  unobservable,  and  the  normalization  can  be  stan¬ 
dardized  so  that  the  total  probability  is  1.)  This  principle  seems 
to  be  widely  agreed  upon  by  physicists,  but  doesn’t  seem  to  have  a 
standard  name. 
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Wavefunction  fundamentalism 

All  knowable  information  about  a  system  is  encoded  in  its  wave- 
function  (ignoring  phase  and  normalization). 


The  pilot  wave  hypothesis,  discussed  on  p.  854,  is  an  example  of  an 
idea  that  would  violate  this  principle. 

Another  example  that  shows  the  contrast  with  the  classical  de¬ 
scription  is  that  if  I  show  you  a  snapshot  of  a  wave  on  a  string,  you 
can’t  tell  which  direction  it’s  going  —  as  with  the  baseball,  you  need 
to  know  its  initial  velocity  in  addition.  But  if  I  show  you  a  snap¬ 
shot  of  a  quantum-mechanical  traveling  wave,  you  can  tell  which 
direction  it’s  going,  because  of  the  complex  phase,  as  shown  in  fig¬ 
ures  u/2  and  u/3  on  page  884.  Note  that  this  mechanism  wouldn’t 
work  if  wavefunctions  were  always  real  numbers,  so  wavefunction 
fundamentalism  implies  complex  wavefunctions. 


Given  the  wavefunction  at  some  initial  time,  we  can  predict  its 
evolution  into  the  future  by  making  use  of  the  principle  that  E  =  hf. 
Suppose  for  example  that  we  have  a  sinusoidal  plane  wave  traveling 
to  the  right.  Then  we  expect  the  value  of  the  wavefunction  at  a 
particular  point  in  space  to  rotate  clockwise  about  the  origin  in  the 
complex  plane  at  the  appropriate  freqency  /,  showing  a  time  depen¬ 
dence  e~lujt  (where,  as  usual,  u  =  2i r/).  Thus  the  time  derivative  of 
the  wavefunction  is  =  — icuT  =  —i(E/h)^f,  so  that  UT  =  iM''. 
Then  to  generalize  the  time-independent  Schrodinger  equation  to  its 
time-dependent  version,  the  most  obvious  thing  to  try  is  simply  to 
substitute  ihd'ft/dt  for  E'h ,  which  gives 


h 2  2 

- V2T  +  [/T. 

2  m 


Unlike  Newton’s  laws  of  motion,  which  refer  to  a  second  derivative 
with  respect  to  time,  the  Schrodinger  equation  involves  only  a  first 
time  derivative.  This  is  why  we  don’t  need  initial  data  on  d'l’/dt,  but 
only  T:  if  we  know  T,  then  the  right-hand  side  of  the  Schrodinger 
equation  is  what  gives  us  /dt.  But  the  Schrodinger  equation  has 
some  other  properties  that  match  up  with  those  of  Newton’s  laws. 
It  is  completely  deterministic,  so  that  if  we  know  T  initially,  we  can 
always  predict  it  in  the  future.  We  can  also  “predict”  backward 
in  time,  so  that  the  system’s  history  can  always  be  recovered  from 
knowledge  of  its  present  state.  Thus  there  is  never  any  loss  of  in¬ 
formation  over  time.  Furthermore,  it  can  be  shown  that  probability 
is  always  conserved,  in  the  sense  that  if  the  wavefunction  is  initially 
normalized,  it  will  also  be  normalized  at  all  later  times. 


Unitary  evolution  of  the  wavefunction 

The  wavefunction  evolves  over  time,  according  to  the  Schrodinger 
equation,  in  a  deterministic  and  unitary  manner,  meaning  that  prob¬ 
ability  is  conserved  and  information  is  never  lost. 
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Since  we  think  of  quantum  mechanics  as  being  all  about  ran¬ 
domness,  this  determinism  may  seem  surprising.  But  determinism 
in  the  time-evolution  of  the  wavefunction  isn’t  the  same  as  deter¬ 
minism  in  the  results  of  experiments  as  perceived  and  recorded  by 
a  human  brain.  Suppose  that  you  prepare  a  uranium  atom  in  its 
ground  state,  then  wait  one  half-life  and  observe  whether  or  not 
it  has  decayed,  as  in  the  thought  experiment  of  Schrodinger’s  cat. 
There  is  no  uncertainty  or  randomness  about  the  wavefunction  of 
the  whole  system  (atom  plus  you)  at  the  end.  We  know  for  sure  what 
it  looks  like.  It  consists  of  an  equal  superposition  of  two  states,  one 
in  which  the  atom  has  decayed  and  your  brain  has  observed  that 
fact,  and  one  in  which  the  atom  has  not  yet  decayed  and  that  fact 
is  instead  recorded  in  your  brain. 

To  get  more  of  a  feeling  for  what  is  meant  by  unitarity,  it  may 
be  helpful  to  consider  some  examples  of  how  it  could  be  violated. 
In  the  Copenhagen  interpretation  of  quantum  mechanics  (p.  872), 
measurement  causes  the  wavefunction  to  collapse  to  one  randomly 
chosen  possibility.  The  other  possibilities  disappear,  and  there  is 
no  way  to  tell  that  they  ever  existed.  Along  with  the  many  other 
problems  with  the  Copenhagen  interpretation,  this  shows  that  it 
requires  nonunit arity. 

Another  interesting  example  is  the  disappearance  of  matter  into 
a  black  hole.  If  I  throw  my  secret  teenage  diary  into  a  black  hole, 
then  it  contributes  a  little  bit  to  the  black  hole’s  mass,  but  the 
embarrassing  information  on  the  pages  is  lost  forever.  This  loss 
of  information  seems  to  imply  nonunitarity.  This  is  one  of  several 
arguments  suggesting  that  quantum  mechanics  cannot  fully  handle 
the  gravitational  force.  Thus  although  physicists  currently  seem  to 
possess  a  completely  successful  theory  of  gravity  (Einstein’s  theory 
of  general  relativity)  and  a  completely  successful  theory  of  the  mi¬ 
croscopic  world  (quantum  mechanics),  the  two  theories  are  irrecon¬ 
cilable,  and  we  can  only  make  educated  guesses,  for  example,  about 
the  behavior  of  a  hypothetical  microscopic  black  hole. 

A  plane  wave  example  26 

Consider  a  free  particle  of  mass  m  in  one  dimension,  with  the 
wavefunction  T  =  elkx~lwt.  Then  application  of  the  Schrodinger 
equation,  ihdW/dt  =  -{h2/2m)d2xil/dx2,  gives 


w->  = 

2m 


and  if  this  is  to  hold  true  for  all  values  of  x  and  t,  then  we  must 
have 

t  h2k2 
no)  =  — — , 

2m 

which  is  simply  an  expression  of  the  Newtonian  relation  K  = 
p2 /2m,  since  k\  =  2 n  and  p  =  h/ A. 
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Dispersion  of  a  wave  packet  example  27 

An  annoying  feature  of  example  26  is  that  the  wavefunction  can¬ 
not  be  normalized  because  it  extends  in  all  directions  to  infinity. 
This  type  of  infinite  plane  wave  is  at  best  an  idealization  of  the 
wavefunction  for  a  realistic  example  such  an  electron  launched 
by  a  cathode  ray  tube,  or  an  alpha  particle  emitted  by  a  nucleus. 
As  a  more  realistic  example,  we  might  try  something  like  a  wave 
packet,  such  as  a  pulse  with  a  certain  shape,  traveling  in  a  cer¬ 
tain  direction.  This  works  for  waves  on  a  string  or  for  electromag¬ 
netic  waves:  the  pulse  or  packet  simply  glides  along  while  rigidly 
maintaining  its  shape.  To  investigate  this  idea  using  the  time- 
dependent  Schrodinger  equation,  it  will  be  convenient  to  adopt 
the  frame  of  reference  in  which  the  particle  is  at  rest.  In  this  frame, 
we  would  expect  the  wave  to  look  frozen,  just  as  an  ocean  wave 
looks  frozen  in  place  to  a  surfer  who  is  riding  it.  It  must  therefore 
be  of  the  form 

W  =  e~iwtf(x), 

where  f  is  some  function  specifying  the  shape  of  the  wave  packet. 
But  this  M7  is  not  a  solution  to  the  Schrodinger  equation.  On 
the  left-hand  side  of  the  Schrodinger  equation,  evaluating  the 
time  derivative  gives  hwW,  which  is  just  the  original  wavefunc¬ 
tion  multiplied  by  a  constant.  If  we  are  to  satisfy  the  Schrodinger 
equation,  then  the  right-hand  side,  which  is  the  second  derivative 
with  respect  to  x,  must  also  be  equal  to  the  original  wavefunc¬ 
tion  multiplied  by  a  constant.  But  the  only  functions  for  which 
(cP  /dx2)(. . .)  =  (constant)(. . .)  are  exponentials  and  sine  waves. 
An  exponential  shape  obviously  isn’t  a  physical  realization  of  a 
wave  packet.  A  sine  wave  works,  but  it  just  describes  an  infinite 
plane  wave  like  the  one  in  example  26,  not  a  wave  packet  that 
can  be  localized  and  normalized. 

The  underlying  reason  for  this  result  is  that  the  Schrodinger  equa¬ 
tion  is  dispersive:  waves  with  different  wavelengths  travel  at  dif¬ 
ferent  speeds  (because  they  correspond  to  different  momenta). 
Suppose  a  pulse  has  the  shape  f(x)  at  t  =  0.  Since  a  pulse  is 
not  a  sine  wave,  it  doesn’t  have  a  single  well-defined  wavelength, 
and  therefore  it  doesn’t  have  a  definite  momentum  or  velocity.  In 
fact,  the  spread  in  momentum  must  be  at  least  a  certain  size  due 
to  the  Heisenberg  uncertainty  principle  ApAx  >  h.  This  causes 
the  pulse  to  spread  out  over  time. 

This  leads  to  a  strange  thought  experiment.  Suppose  that  a 
uranium  atom  in  the  Andromeda  galaxy  emits  an  alpha  particle, 
which  then  travels  thousands  of  light  years  and  eventually  flies 
past  the  earth.  Its  wave  packet  may  initially  have  been  as  nar¬ 
row  as  the  diameter  of  an  atomic  nucleus,  ~  10~15  m,  but  by  the 
time  it  arrives  perhaps  it  is  the  size  of  an  aircraft  carrier.  Will  an 
observer  see  a  gigantic  alpha  particle  flying  by?  No,  because  ob¬ 
serving  it  constitutes  a  measurement  of  its  position,  and  by  the 
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probability  interpretation  of  the  wavefunction  this  measurement 
simply  has  a  certain  probability  of  giving  a  result  that  is  anywhere 
within  some  region  the  size  of  an  aircraft  carrier. 

Observables 

By  the  time  my  first-year  mechanics  students  have  been  in  class 
for  a  week,  they  know  how  to  answer  when  I  ask  them  the  velocity 
of  the  tape  dispenser  at  the  front  of  the  classroom:  “We  don’t  know, 
it  depends  on  your  frame  of  reference.”  The  absolute  velocity  of  an 
object  is  a  meaningless  concept,  part  of  the  mythical  dungeons-and- 
dragons  cosmology  of  Aristotelian  physics.  Quantum  mechanics  is 
as  great  a  break  from  Newton  as  Newton  was  from  Aristotle,  and 
similar  care  is  required  in  redefining  what  concepts  are  observables 
—  meaningful  things  to  talk  about  measuring. 

Classically,  we  describe  the  state  of  the  system  as  a  point  in  phase 
space  (sec.  5.4.2,  p.  320)  —  which  is  just  a  fancy  way  of  saying  that 
we  specify  all  the  positions  and  momenta  of  its  particles  —  and  an 
observable  is  defined  as  a  function  that  takes  that  point  as  an  input 
and  produces  a  real  number  as  an  output.  (By  the  way,  the  word 
“phase”  in  “phase  space”  doesn’t  refer  directly  to  the  phase  of  a 
wave,  which  we’ll  also  be  discussing  below.)  For  example,  kinetic 
energy  is  a  classical  observable,  and  K(Q>)  =  0,  where  the  picture 
represents  a  tennis  ball  at  rest.  For  a  moving  tennis  ball  with  one 
unit  of  energy,  K(Z@)  =  1.  For  a  vibrating  violin  string,  we  could 
have  f7(^/>)  =  1,  and  U(\/s')  =  4  (where  doubling  the  amplitude 
gives  four  times  the  energy). 

Quantum-mechanically,  the  Heisenberg  uncertainty  principle  tells 
us  that  we  can’t  independently  dial  in  the  desired  values  of  a  par¬ 
ticle’s  position  and  momentum.  They  aren’t  two  variables  that  are 
independent  of  one  another.  Therefore  we  don’t  have  a  phase  space, 
so  an  observable  has  to  be  represented  by  a  function  whose  input  is 
a  wavefunction.  Furthermore,  we  expect  that: 

•  The  output  shouldn’t  depend  on  the  phase10  of  the  wavefunc¬ 
tion. 

•  The  output  shouldn’t  depend  on  amplitude  (because  a  dif¬ 
ferent  amplitude  might  just  mean  an  incorrectly  normalized 
state). 

•  The  output  should  be  well  defined  when  we  superpose  any  two 
states. 

These  requirements  are  hard  to  reconcile  with  the  idea  that  the 
output  of  the  observable  is  just  a  real  number  representing  the  result 
of  the  measurement.  We  could  decree  that  the  input  wavefunction 

10  “Phase”  as  in  the  phase  of  a  wave,  not  as  in  “phase  space.” 
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is  just  required  to  be  have  the  standard  normalization,  but  there’s 
no  obvious  way  to  define  a  standardization  of  phase.  And  suppose 
we  have  a  particle  in  a  one-dimensional  box,  with  the  two  lowest 
energies  being  E(/^s)  =  1  and  E(\/x)  =  4.  Then  what  should 
we  define  for  the  superposition  E(/^'  +  v/'1)?  We  could  define  it 
to  be  the  average,  2.5,  but  that  isn’t  even  a  possible  value  of  the 
measurement;  in  reality,  the  result  of  the  measurement  would  be 
either  1  or  4,  with  equal  probability. 

For  a  clue  as  to  a  better  way  to  proceed,  note  the  structure  of  the 
time-independent  Schrodinger  equation  for  a  free  particle,  omitting 
all  constant  factors  like  m,  2,  and  h.  It  isn’t  (cP  /  dx,2)'F  =  E,  it’s 
(cf  /  dx2)'F  =  EXV .  This  fixes  all  the  problems.  For  example,  if 
we  change  the  phase  of  the  wavefunction  by  flipping  its  sign,  the 
equation  still  holds  with  the  same  value  of  E.  This  equation  is  a 
specific  example  of  a  more  general  type  of  equation  that  looks  like 

operator(input)  =  number  x  input. 

Another,  simpler  example  is  (d/d x)f  =  3/,  which  is  satisfied  if 
/  =  Ae3x,  where  A  is  any  constant.  Such  an  equation  says  that 
applying  the  operator  to  the  input  just  gives  back  the  input  itself, 
multiplied  by  some  constant.  For  this  reason,  this  type  of  equation  is 
called  an  eigenvalue  equation,  because  “ eigen ”  is  the  German  word 
for  “self.”  We  say  that  3  is  the  eigenvalue  of  the  eigenvalue  equation 
(d/  d x)f  =  3/.  In  the  time-independent  Schrodinger  equation,  the 
eigenvalue  is  the  energy,  and  a  solution  T  is  called  a  state  of  definite 
energy  (or  “eigenstate”). 

All  observables  in  quantum  mechanics  are  described  by  opera¬ 
tors  such  as  derivatives.  The  second  derivative  (with  the  appropriate 
factor  of  —h2 /2m)  is  the  kinetic  energy  operator  in  quantum  me¬ 
chanics.  Given  an  operator  O  that  describes  a  certain  observable,  a 
state  T  with  a  definite  value  c  of  that  observable  is  one  for  which 
O('F)  =  c\k.  Although  it’s  common  to  use  parentheses  when  no¬ 
tating  functions,  as  in  cos(7r)  =  —  1,  they  are  optional,  and  we  can 
write  cos  ir  =  —1,  so  we  will  often  use  notationa  like  (HT  instead  of 
0(' F),  but  keep  in  mind  that  this  not  multiplication,  just  as  cos7r 
doesn’t  mean  multiplying  cos  by  7 r. 

When  we  carried  over  the  classical  kinetic  energy  observable  to 
quantum  mechanics,  we  weren’t  going  blind.  For  example,  the  factor 
of  —h2 /2m  in  front  is  tightly  constrained  by  requirements  like  units 
and  the  need  for  a  traveling  sine  wave  to  have  positive  energy.  But 
for  the  superposition  of  two  states,  classical  mechanics  will  never 
give  us  any  guidance.  For  example,  what  is  the  body  temperature 
of  Schrodinger’s  cat?  For  the  energy  operators  appearing  in  the 
Schrodinger  equation,  we  used  linear  operators.  The  result  was  that 
our  law  of  physics  was  perfectly  linear,  and  this  is  a  hard  require¬ 
ment,  for  the  reasons  described  on  p.  885.  It  therefore  seems  natural 
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to  require  that  all  observables  be  represented  by  linear  operators, 

0(^1  +  ^2)  =  O’F  1  +  0'S’2- 

Indeed,  if  they  were  not  linear,  then  quantum  mechanics  would  lack 
self-consistency,  for  the  act  of  measurement  can  be  described  by 
applying  the  Schrodinger  equation  to  a  big  system  consisting  of  the 
system  being  observed  interacting  with  the  measuring  device. 

Finally,  we  have  one  more  requirement,  which  is  that  the  linear 
operator  representing  an  observable  should  have  eigenvalues  that 
are  real.  This  isn’t  because  the  results  of  a  measurement  must  log¬ 
ically  be  real  —  e.g.,  we  can  measure  complex  impedances.  But  in 
any  real-world  application  of  the  complex  number  system,  we  must 
always  choose  some  arbitrary  phase  conventions,  such  as  that  an 
inductor  has  a  positive  imaginary  impedance  to  represent  the  fact 
that  the  voltage  leads  the  current  by  90  degrees.  (Such  phase  con¬ 
ventions  are  always  arbitrary  because  we  define  i  as  \/—l,  but  this 
doesn’t  distinguish  i  from  —  i.)  These  phase  conventions  are  all  in¬ 
dependent  of  one  another,  and  the  classical  ones  are  independent  of 
the  convention  used  for  wavefunctions  in  quantum  mechanics,  which 
is  that  a  state  with  positive  energy  twirls  clockwise  in  the  complex 
plane. 

Observables 

In  quantum  mechanics,  any  observable  is  represented  by  a  linear 
operator  that  takes  a  wavefunction  as  an  input  and  has  real  eigen¬ 
values. 


The  momentum  operator  example  28 

Quantum  mechanics  represents  motion  as  a  dependence  of  the 
wavefunction  on  position,  so  that  a  constant  wavefunction  has 
no  motion.  This  suggests  defining  the  momentum  operator  as 
the  derivative  with  respect  to  position.  This  almost  works,  but 
needs  to  be  tweaked  a  little.  We  expect  that  a  state  of  definite 
momentum  is  a  sine  wave  of  the  form  W  =  e'kx.  We  have  kA  =  2n 
and  p  =  h/ A  =  hk,  and  the  sign  is  a  matter  of  convention.  Taking 
the  derivative  of  W  gives  an  eigenvalue  /k,  which  has  the  wrong 
units  (easily  fixed  by  tacking  on  a  factor  of  h),  but  more  importantly 
is  not  real.  This  suggests  defining  the  momentum  operator  as 


A  nonexample  example  29 

Consider  the  one-dimensional  particle  in  a  box,  and  restrict  our 
attention  to  the  two  lowest-energy  states  and  their  superpositions. 
Define  an  operator  O  by  the  rule 

0(^)  =  v/' 

o(kD  =  -(^). 
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Since  O  is  linear,  defining  its  action  on  ^  and  v/^  suffices  to 
define  its  action  on  the  superpositions  of  these  states  as  well. 
This  operator  has  eigenvalues,  one  of  which  is  /,  corresponding 
to  the  state  ^  -  A/\  (It  also  has  a  second  eigenvalue,  which 
is  imaginary  as  well.)  Because  this  operator  doesn’t  have  real 
eigenvalues,  it  is  not  a  valid  observable. 

Note  that  in  examples  28  and  29,  it  doesn’t  matter  whether  the 
operator  is  defined  using  complex  numbers.  Our  definition  of  the 
momentum  operator  was  stated  using  an  equation  that  had  an  i  in 
it,  but  its  eigenvalues  are  real,  so  that’s  OK.  The  operator  O  in 
example  29  was  defined  using  only  real  numbers,  but  its  eigenvalues 
are  not  real. 

Position  is  an  observable  example  30 

If  we  have  a  wavefunction  ¥(x)  expressed  as  a  function  of  po¬ 
sition  x,  then  we  simply  take  the  operator  for  position  Ox  to  be 
multiplication  by  the  number  x, 


Ox{W)  =  xW. 


For  example,  if  W  =  elx  (ignoring  units),  then  Ox(T)  =  xelx.  This 
operator  is  definitely  linear,  because  multiplication  by  a  number  is 
linear,  e.g.,  7(a  +  b)  =  7a  +  7b.  The  only  question  is  whether  it  has 
eigenvalues,  and  whether  those  are  real.  A  state  of  definite  x,  say 
a  state  with  x  =  0,  would  have  to  be  represented  by  a  wavefunc¬ 
tion  T(x)  for  which  there  was  zero  probability  of  having  x  =/  0,  and 
this  requires  us  to  have  ¥(x)  =  0  for  nonzero  x.  But  what  would 
be  the  value  of  T(0)?  It  has  to  be  infinite  if  T  is  to  be  properly  nor¬ 
malized.  With  this  motivation,  the  physicist  P.A.M.  Dirac  defined 
Dirac  delta  function, 


6(x)  = 


for  x  0 
for  x  =  0 


Its  graph  is  an  infinitely  narrow,  infinitely  tall  spike  at  x  =  0,  and  it 
has  5(x)dx  =  1.  Mathematicians  will  shake  their  heads  and 
say  that  this  is  not  a  definition  of  a  function,  but  it’s  very  useful  to 
pretend  that  it  is,  and  the  delta  “function”  is  widely  used  in  a  vari¬ 
ety  of  fields  such  as  electrical  engineering.  Because  it  was  useful, 
mathematicians  felt  obliged  to  define  a  theory  in  which  functions 
are  generalized  to  things  called  distributions  or  generalized  func¬ 
tions. 

Becase  we  represent  an  observable  as  an  operator  that  changes  a 
wavefunction  into  a  new  wavefunction,  a  common  misconception  is 
that  this  change  represents  the  effect  of  measurement  on  the  system. 
Although  it  is  often  true  that  microscopic  systems  are  delicate,  so 
that  the  act  of  measurement  may  have  a  significant  effect  on  them, 
that  action  of  the  operator  on  the  wavefunction  does  not  represent 
that  effect.  For  example,  the  position  operator  Ox  from  example  30 
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consists  simply  of  multiplication  of  the  wavefunction  by  x.  Suppose 
we  have  a  particle  in  a  box  with  a  wavefunction  given  by  =  sin  x, 
where  we  ignore  units  and  normalization,  and  the  box  is  defined  by 
0  <  x  <  7T.  Then  Ox^>  eats  the  input  wavefunction  sin  a;  and  poops 
out  the  new  function  x  sin  x.  But  the  act  of  measuring  the  particle’s 
position  clearly  can’t  do  anything  like  this  —  for  one  thing,  the 
function  xsinx  has  larger  values  on  the  right  side  of  the  box  than 
on  the  left,  but  there  is  nothing  to  create  such  an  asymmetry  in 
either  the  original  state  or  the  measuring  process.  The  real-world 
effect  of  the  measurement  would  probably  be  to  knock  the  particle 
out  of  the  box  completely,  since  a  high-resolution  measurement  will 
have  a  small  uncertainty  Ax,  which  by  the  Heisenberg  uncertainty 
principle  means  creating  a  large  A p. 

Nor  is  it  always  true  that  measuring  a  system  disturbs  it.  For 
example,  suppose  that  we  prepare  a  beam  of  silver  atoms,  as  in  the 
Stern-Gerlach  experiment,  in  such  a  way  that  every  atom  is  guar¬ 
anteed  to  be  in  either  a  state  of  definite  Lx  =  +1/2  or  Lx  =  —1/2. 
That  is,  the  beam  may  be  a  mixture  of  both  of  these  possibilities, 
but  each  atom  is  guaranteed  have  its  spin  either  exactly  aligned 
with  the  magnetic  field  or  exactly  antiparallel  to  it.  Then  the  effect 
of  the  magnetic  field  is  simply  to  sort  out  the  two  types  of  atoms 
according  to  spin,  without  having  the  slightest  effect  on  those  spins. 

Phase  is  not  an  observable  example  31 

On  p.  916  we  listed  three  criteria  for  implementing  the  concept  of 
an  observable  in  quantum  mechanics,  and  one  of  these  was  that 
since  wavefunctions  that  differ  only  by  a  phase  describe  the  same 
state,  the  result  of  an  observation  should  not  depend  on  phase. 
For  this  reason,  it  should  not  be  a  surprise  that  the  mathematical 
definition  of  an  observable  that  we  came  up  with  does  not  allow 
for  the  creation  of  an  observable  to  describe  measurement  of  a 
phase. 

By  way  of  rigorous  proof,  suppose  to  the  contrary  that  we  did 
have  such  an  observable  Oph.  By  our  definition  of  an  observ¬ 
able,  it  would  have  to  have  some  set  of  eigenvalues  that  were 
real  numbers.  Consider  such  an  eigenvalue  cp,  which  might  per¬ 
haps  be  the  argument  of  the  wavefunction  in  the  complex  plane, 
although  we  will  not  need  to  assume  that.  Let  ¥  be  the  state  of 
definite  phase  having  the  phase  (p,  so  that 

[1]  OphW  =  cp¥. 

We  can  change  the  phase  of  ¥  to  create  a  new  wavefunction. 
Let’s  retard  its  phase  by  90  degrees,  creating  /¥.  Since  ¥  was  a 
state  of  definite  phase,  clearly  /¥  is  as  well,  and  and  it  must  have 
some  different  eigenvalue  cp'.  Perhaps  cp'  =  cp  +  n/2,  but  in  any 
case  we  must  have  cp'  =/  cp.  Then 

[2]  0ph(/¥)  =  cp'(/¥). 
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But  by  linearity  equation  [2]  is  equivalent  to  iOp^W  =  i(p'W,  or 
G>phT  =  rp'T,  and  therefore  by  comparison  with  equation  [1],  <p  = 
cp',  which  is  a  contradiction,  so  we  conclude  that  there  cannot  be 
an  observable  representing  phase. 

The  result  of  example  31  was  a  bit  of  a  foregone  conclusion,  since 
we  specifically  designed  our  notion  of  an  observable  to  be  insensitive 
to  phase.  Therefore  this  argument  is  subject  to  the  objection  that 
perhaps  there  is  some  way  to  measure  a  quantum-mechanical  phase, 
but  our  definition  of  an  observable  is  just  too  restrictive  to  describe 
it.  However,  we  will  see  on  p.  928  that  there  are  more  concrete 
reasons  why  phase  cannot  be  measured. 

Time  is  not  an  observable  example  32 

We  do  not  expect  to  have  a  time  operator  in  quantum  mechan¬ 
ics.  This  follows  simply  because  an  operator  is  supposed  to  be 
a  function  that  takes  a  wavefunction  as  an  input,  but  we  typically 
can’t  tell  what  time  it  is  by  looking  at  the  wavefunction.  For  exam¬ 
ple,  if  the  electron  in  a  hydrogen  atom  is  in  its  ground  state,  then 
we  could  say  its  energy  is  zero,  so  its  frequency  is  zero,  the  pe¬ 
riod  is  infinite,  and  the  wavefunction  doesn’t  vary  at  all  with  time. 
(We  can  choose  our  reference  level  for  the  electrical  energy  Ue\ec 
to  be  anything  we  like.  Even  if  we  choose  it  such  that  the  energy 
of  the  ground  state  is  nonzero,  the  only  change  in  the  electron’s 
wavefunction  over  time  will  be  a  phase  rotation,  which  by  example 
31  is  not  observable.) 

Of  course  this  doesn’t  mean  that  quantum  mechanics  forbids  us 
from  building  clocks.  It  just  tells  us  that  many  quantum  mechan¬ 
ical  systems  are  too  simple  to  function  as  clocks.  In  particular, 
we  would  be  misled  if  we  pictured  a  hydrogen  atom  classically  in 
terms  of  an  electron  traveling  in  a  circular  orbit  around  a  proton, 
in  which  case  it  really  could  act  like  the  hand  on  a  tiny  clock.  For 
further  discussion  of  this  idea,  see  p.  929 

Since  you’ve  already  studied  relativity,  you’ve  been  carefully  in¬ 
culcated  with  the  idea  that  space  and  time  are  to  be  treated  sym¬ 
metrically,  as  parts  of  a  more  general  thing  called  spacetime.  The 
differing  results  of  examples  30  and  32  are  clearly  not  consistent  with 
relativity.  This  is  to  be  expected  because  the  Schrodinger  equation 
is  nonrelativistic  (cf.  self-check  G,  p.  876),  and  the  principles  laid 
out  in  this  section  are  the  principles  of  nonrelativistic  quantum  me¬ 
chanics. 

Parity  example  33 

In  freshman  calculus  you  will  have  encountered  the  notion  of  even 
and  odd  functions.  In  quantum  mechanics,  we  can  have  even 
and  odd  wavefunctions,  and  they  can  be  distinguished  from  one 
another  using  the  parity  operator  V.  If  ¥(x)  is  a  wavefunction, 
then  VW  is  a  new  wavefunction,  call  it  W,  such  that  T'(x)  =  T(-x). 
In  other  words,  the  parity  operator  flips  the  wavefunction  across 
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the  origin.  (In  three  dimensions,  we  negate  all  three  coordinates.) 
States  of  definite  parity  are  represented  by  wavefunctions  that  are 
even  (eigenvalue  +1)  or  odd  (-1). 

States  of  definite  angular  momentum  example  34 

In  section  13.5.2,  p.  910,  we  saw  that  the  kinetic  energy  of  a 
quantum  mechanical  rotor  is  proportional  not  to  £2  but  instead 
to£{£+'\).  This  was  justified  qualitatively  in  terms  of  the  solutions 
of  the  Schrodinger  equation  for  a  particle  on  a  sphere,  but  in  fact 
there  is  a  deeper  reason,  which  is  that  the  eigenvalues  of  the 
orbital  angular  momentum  operator  turn  out  to  be  £(£  +  1).  The 
parity  of  such  a  state  is  (-1)£,  which  can  be  seen  in  figure  g  on 
p.  895. 

If  we  have  two  observables,  it  may  or  may  not  be  possible  to 
measure  them  both  on  the  same  state  and  get  exact  and  meaningful 
results.  Position  and  momentum  p  and  x  are  incompatible  observ¬ 
ables,  as  expressed  by  the  Heisenberg  uncertainty  principle.  No  state 
is  simultaneously  a  state  of  definite  p  and  of  definite  x.  The  mag¬ 
nitude  of  an  angular  momentum  L  and  its  component  along  some 
axis  Lz  are  compatible.  It  is  common  to  have  a  state  thare  is  simul¬ 
taneously  a  state  of  definite  L  and  of  definite  Lz.  Another  example 
of  incompatible  observables  is  Lz  and  Lx ,  as  proved  on  p.  891. 

The  inner  product 

We’ve  defined  the  normalization  of  a  wavefunction  as  the  re¬ 
quirement  T'*d/  dx  =  1,  which  means  that  the  total  probability 
that  the  particle  is  somewhere  equals  1.  (Another  way  of  writing 
'h*'h  would  be  |\k|2.)  This  assumes  that  the  wavefunction  is  writ¬ 
ten  as  a  function  of  the  position  x.  But  it  is  also  possible  to  have 
a  wavefunction  that  depends  on  some  other  variable,  such  as  spin 
or  momentum,  or  on  some  combination  of  variables,  e.g.,  both  the 
spin  s  and  the  position  x  of  an  electron,  H/(x,  s).  We  can  also  use 
a  wavefunction  to  describe  a  correlation  between  multiple  particles, 
in  which  case  the  wavefunction  might  look  like  \I'(xi,X2).  The  vari¬ 
ables  that  the  wavefunction  depends  on  may  be  either  continuous, 
like  position  and  momentum,  or  discrete,  like  spin  or  angular  mo¬ 
mentum.  Given  all  of  these  possibilities,  we  need  to  figure  out  an 
appropriate  generalization  of  the  integral  over  x  that  we  originally 
used  to  define  our  normalization  condition.  To  provide  for  flexibility 
and  generality,  we  will  start  by  simply  defining  a  new  notation  that 
looks  like  this: 

(T|T)  =  1. 

In  the  case  where  T  is  a  function  of  x  alone,  the  angle  brackets 
(. . .  |  . . .)  basically  mean  just  an  integral  over  x,  and  we  think  of  the 
(. . .  |  part  as  automatically  implying  the  complex  conjugation  of  the 
thing  inside  it.  The  operation  (. . .  |  . . .)  is  called  the  inner  product. 

Because  negative  probabilities  don’t  make  sense,  we  require  that 
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the  inner  product  of  a  wavefunction  with  itself  always  be  positive, 

(u\u)  >  0. 

This  makes  it  similar  to  the  dot  product  used  with  vectors  in  Eu¬ 
clidean  geometry. 

In  the  case  of  Euclidean  geometry,  the  ability  to  add  vectors 
and  measure  their  lengths  automatically  gives  us  a  way  to  judge 
the  similarity  of  two  vectors.  For  example,  if  |u|  =  1,  |u|  =  1,  and 
\u  +  v |  =  2,  then  we  conclude  that  u  and  v  are  in  the  same  direction. 
On  the  other  hand,  if  |u|  =  1,  |u|  =  1,  and  |w  +  u|  =  \f2 ,  then  we  can 
tell  that  u  and  v  are  perpendicular,  which  makes  them  as  different 
as  two  unit-length  vectors  can  be.  More  generally,  (u  +  v)  ■  (u  +  v)  = 
\u\2  +  \v\2+2u-v,  because  the  dot  product  is  linear,  so  we  can  see  that 
the  information  about  how  similar  u  and  v  are  is  all  contained  in 
their  dot  product  u-v.  Making  the  analogy  with  quantum  mechanics, 
we  expect  that  since  we  can  define  normalization  of  wavefunctions, 
we  should  automatically  get,  “for  free,”  a  way  of  measuring  how 
similar  two  states  are. 

With  this  motivation,  we  assume  that  there  is  an  inner  product 
on  wavefunctions  that  has  properties  analogous  to  those  of  the  dot 
product.  We  assume  linearity,  so  that  if  u.  v,  and  w  are  wavefunc¬ 
tions,  then 


(u\av  +  (3w )  =  a(u\v)  +  /3(u\w) 


and 


(era  +  j3v\w)  =  a*  (u\w)  +  (3*(v\w ). 

In  the  second  equation,  we  need  to  take  the  complex  conjugates  a* 
and  (3*,  for  if  we  omitted  the  conjugation,  then  when  (u\u)  =  1 
we  would  have  {iu\iu)  =  —  1,  describing  a  negative  probability.  For 
similar  reasons,  we  require  that 

(u\v)  =  (v\u)* 

rather  than  the  more  familiar  property  of  the  Euclidean  dot  product 
u  ■  v  =  v  ■  u. 


Inner  product 

Wavefunctions  come  equipped  with  an  inner  product  that  has  the 
properties  described  above. 


If  we’re  dealing  with  wavefunctions  that  are  expressed  as  func¬ 
tions  of  position,  then  it’s  pretty  clear  how  to  define  an  appropriate 
inner  product:  (u\v)  =  f  u*vdx.  The  inner  product  axiom  stated 
above  then  requires  that  this  (possibly  improper)  integral  converge 
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in  all  cases,  which  means,  for  example,  that  we  have  to  exclude 
infinite  plane  waves  from  consideration.  However,  because  it’s  so 
convenient  sometimes  to  talk  about  plane  waves,  we  may  break  this 
rule  when  nobody  is  looking.  Note  the  similarity  between  the  ex¬ 
pression  f  u*vdx  and  the  expression  uxvx  +  uyvy  +  uzvz  for  a  dot 
product;  the  integral  is  a  continuous  sum,  and  the  dot  product  is  a 
discrete  sum. 

Two  wavefunctions  have  a  zero  inner  product  if  and  only  if  they 
are  completely  distinguishable  from  each  other  by  the  measurement 
of  some  observable.  By  analogy  with  vectors  in  Euclidean  space, 
we  say  that  the  two  wavefunctions  are  orthogonal.  For  example, 
Mv/')  =  0,  as  can  be  verified  from  the  integral  sin  x  sin  2x  dx  = 
0.  These  states  are  also  distinguishable  by  measuring  either  their 
momentum  or  their  energy.  Let’s  consider  more  carefully  the  gen¬ 
eral  justification  for  this  assertion  that  perfect  distinguishability  is 
logically  equivalent  to  a  zero  inner  product. 

We  have  described  valid  observables  in  quantum  mechanics  as 
being  represented  by  operators  that  have  real  eigenvalues.  An  alter¬ 
native  description  of  such  an  operator  O,  called  a  Hermitian  opera¬ 
tor  after  Charles  Hermite,  is  that  it  is  one  such  that  (Ou\v)  =  (u\Ov) 
for  any  u  and  v.  To  see  this,  let  e  be  an  eigenvalue,  On  =  eu  for 
ti  /  0.  Then  (Ou\u)  =  (u\Ou),  so  (eu\u)  =  (u\eu),  and  e*{u\u)  = 
e(u\u),  so  e*  =  e,  meaning  that  e  is  real. 

Using  this  fact  about  observables,  we  now  consider  states  u  and 
v  that  can  be  distinguished  empirically  because  they  are  states  with 
different  values  of  some  observable,  Ou  =  e\ u  and  Ov  =  e2V.  Be¬ 
cause  O  is  Hermitian,  we  have  (Ou\v)  =  (u\Ov),  so  e\(u\v)  = 
e2(u\v).  But  since  e\  and  e2  are  real  and  unequal,  we  must  have 
(tt|v)  =  0. 

Going  in  the  opposite  direction,  suppose  that  (u\v)  =  0.  How 
can  we  then  conclude  that  there  exists  some  observable  O  that  can 
distinguish  them?  There  is  no  straightforward  mathematical  reason 
why  this  must  be  true,  but  it  would  not  make  sense  physically  to  talk 
about  two  states  that  were  utterly  distinct  and  yet  indistinguishable 
by  any  experiment.  We  therefore  take  this  as  a  postulate. 


Completeness 

For  any  system  of  interest,  there  exists  a  set  of  compatible  observ¬ 
ables,  called  a  complete  set,  such  that  any  state  of  the  system  can  be 
expressed  as  a  sum  of  wavefunctions  having  definite  values  of  these 
observables. 


For  example,  we  have  seen  in  example  24  on  p.  892  that  for  a 
rotor,  the  state  with  1=1  and  lx  =  0  can  be  written  as  a  sum 
of  the  states  with  lz  =  —  1  and  iz  =  1.  In  the  language  of  the 
completeness  postulate,  we  can  express  this  as  follows.  Let  our 
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system  be  the  set  of  possible  states  of  a  rotor.  The  observables  L 
and  Lz  are  compatible,  and  they  turn  out  (although  we  will  not 
prove  it  here)  to  be  a  complete  set  of  observables  for  this  system. 
The  completeness  postulate  is  satisfied  in  this  example  because,  as 
shown  in  example  24,  the  state  with  £x  =  0  can  be  expressed  as 

*ez=-i/V2  +  *ez=i/V2. 

Because  the  physical  motivation  for  the  completeness  postulate 
is  so  appealing,  physicists  are  willing  to  stretch  the  definition  of  the 
word  “sum”  in  order  to  make  it  true.  The  sum  can  be  an  infinite 
sum,  and  in  certain  cases  we  may  even  need  to  make  it  an  integral, 
which  is  a  kind  of  continuous  sum.  For  example,  consider  a  one¬ 
dimensional  particle  in  a  box.  A  complete  set  of  observables  for  this 
system  can  be  found  by  picking  the  energy  operator  alone.  Now  sup¬ 
pose  we  throw  a  particle  in  the  box,  in  such  a  way  that  its  position  is 
equally  likely  to  be  anywhere  in  the  box,  i.e. ,  its  wavefunction  is  sup¬ 
posed  to  be  constant  throughout  the  box.  Ignoring  normalization, 
this  constant  wavefunction  can  be  expressed  as  an  infinite  series  in 
terms  of  the  states  of  definite  energy  as  +  itVV'  +  •  •  •  This 

kind  of  representation  of  a  function  as  an  infinite  sum  of  sine  waves 
is  called  a  Fourier  series. 

13.5.4  Energy-time  uncertainty 

Classical  uncertainty  relations 

Consider  the  following  classical  system  of  analogies. 

space  x  k  AxAk  >  1 

time  t  uj  AtAoo  >  1 

Here  the  quantity  k  =  2n /lambda  is  called  the  wavenumber.  The  in¬ 
equality  AxAk  >  1  is  a  kind  of  classical  uncertainty  relation  that  is 
closely  related  to  the  Heisenberg  uncertainty  principle.  Its  classical 
nature  is  immediately  apparent  because  it  doesn’t  involve  Planck’s 
constant.  If  you  look  back  at  the  argument  given  on  p.  871  to  jus¬ 
tify  the  Heisenberg  uncertainty  principle,  you  will  see  that  it  carries 
through  equally  well  if  we  simply  omit  the  quantum-mechanical  in¬ 
gredients  and  use  it  to  put  a  bound  on  AxAk  instead  of  AxAp. 
Once  we’ve  established  the  bound  on  AxAk,  the  one  on  Ax  A p  fol¬ 
lows  immediately  because  p  =  h/\  =  hk. 

The  second  line  of  the  table  is  in  strict  analogy  to  the  first  line. 
A  good  practical  example  is  the  high-speed  transmission  of  digital 
data  over  transmission  lines  such  as  fiber-optic  cables.  Suppose  that 
we  wish  to  send  a  string  of  0’s  and  l’s,  and  a  1  is  to  be  represented  by 
a  square  pulse.  If  we  want  to  transmit  the  data  at  high  speed,  then 
we  need  the  duration  At  of  this  pulse  to  be  short,  perhaps  in  the 
microsecond  or  even  nanosecond  range.  This  cannot  be  done  if  the 
signal  consists  only  of  a  single  frequency.  A  signal  that  only  contains 
a  single,  pure  frequency  is  just  a  sinusoidal  wave  that  has  existed 
infinitely  far  back  in  the  past  and  will  exist  infinitely  far  into  the 
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future.  Such  a  wave  carries  uo  information  at  all.  Out  frequency¬ 
time  uncertainty  relation  tells  us  that  if  the  duration  of  a  pulse  is  to 
be,  say,  a  microsecond,  then  the  signal’s  spread  in  frequency  much 
be  at  least  on  the  order  of  1  MHz.  This  is  why  we  use  the  term 
“bandwidth”  to  describe  the  speed  of  a  communication  channel. 

Energy-time  uncertainty 

In  a  quantum- mechanical  context,  we  have  E  =  huj,  so  there  is 
an  energy-time  uncertainty  relation, 

AEAt  >  h. 

As  with  the  Heisenberg  uncertainty  principle  for  momentum  and 
position,  the  symbol  >  means  that  we  leave  out  a  numerical  factor, 
which  can  only  be  precisely  defined  if  we  fix  some  specific  statistical 
definition  of  A,  e.g.,  a  standard  deviation. 

The  interpretation  of  the  energy-time  uncertainty  relation  is  a 
little  tricky,  because  although  the  classical  analogy  between  space 
and  time  is  exact,  the  quantum- mechanical  analogy  breaks  down. 
This  is  because  time  in  nonrelativistic  quantum  mechanics,  unlike 
position,  is  not  an  observable  (example  32).  Time  in  this  theory 
is  just  a  universal  parameter.  The  physicist  Lev  Landau  liked  to 
tell  his  students  that  there  was  no  energy-time  uncertainty  relation, 
because  “I  can  measure  the  energy,  and  look  at  my  watch;  then  I 
know  both  energy  and  time!”  One  good  way  of  interpreting  it  is 
that  if  there  is  a  transfer  of  energy  between  two  systems,  then  it 
relates  the  uncertainty  A E  in  the  amount  of  energy  transferred  to 
the  duration  At  of  the  interaction. 

For  example,  suppose  we  wish  to  bounce  a  photon  off  of  a  hydro¬ 
gen  atom  in  order  to  determine  whether  the  atom  is  in  its  ground 
state.  This  is  not  necessarily  an  easy  thing  to  do  by  extracting  what¬ 
ever  information  we  get  from  the  reflected  photon,  but  the  ground 
state  is  orthogonal  to  the  other  states,  so  we  are  at  least  encour¬ 
aged  to  believe  that  it  is  not  theoretically  impossible.  But  there  is  a 
hard  theoretical  limit  on  how  quickly  we  can  make  such  a  determi¬ 
nation.  The  difference  in  energy  between  the  ground  state  and  the 
first  excited  state  is  1.6  x  10-18  J,  so  we  must  use  a  photon  with 
an  energy  less  than  this  amount,  or  else  the  act  of  observing  the 
atom  may  in  fact  destroy  the  property  we  were  hoping  to  measure. 
By  the  energy-time  uncertainty  relation,  this  implies  that  the  mea¬ 
surement  process  cannot  be  done  in  less  than  about  10~15  seconds. 
This  example  may  seem  impractical,  but  in  fact  computer  memories 
are  starting  to  reach  the  level  of  speed  and  miniaturization  at  which 
such  fundamental  constraints  become  relevant. 

Mortality  for  hydrogen  example  35 

In  atomic  physics,  when  a  photon  is  emitted  or  absorbed  it  is  al¬ 
most  always  in  a  wave  pattern  with  angular  momentum  1  (i.e.,  1  h) 
and  negative  parity  (example  33).  Classically,  this  is  the  type  of 
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radiation  pattern  that  we  would  get  from  an  electric  dipole  spin¬ 
ning  end  over  end,  so  we  call  it  an  electric  dipole  transition.  Be¬ 
cause  the  electromagnetic  interaction  has  a  symmetry  between 
left-  and  right-handedness  (section  11.1.5,  p.  665),  this  means 
that  an  electric  dipole  transition  can  never  cause  a  transition  from 
one  state  of  an  atom  to  another  state  with  the  same  parity. 

Now  the  ground  state  of  the  hydrogen  atom  has  t  =  0  and  is 
therefore  a  state  of  positive  parity.  One  of  the  first  excited  states, 
referred  to  as  the  2s  state,  also  has  these  properties,  and  there¬ 
fore  it  is  impossible  for  the  2s  state  to  decay  to  the  ground  state 
by  emitting  an  electric  dipole  photon.  The  happy  atom  proba¬ 
bly  believes  that  once  it’s  in  the  exalted  2s  state,  it  can  stay  that 
way  forever.  One  way  for  it  to  be  cheated  of  immortality  is  if  it 
undergoes  a  collision  with  another  atom,  but  in  some  so-called 
planetary  nebulae  (hot  clouds  of  gas  cast  off  by  dying  stars),  the 
density  can  be  so  low  that  collisions  are  very  infrequent.  In  this 
situation,  the  dominant  process  for  decay  of  the  2s  state  can  be 
the  simultaneous  emission  of  two  photons.  An  exact  and  rigorous 
calculation  of  the  rate  of  decay  for  this  process  is  quite  technical, 
but  a  fairly  reasonable  estimate  can  be  obtained  by  the  following 
semiclassical  argument  based  on  the  energy-time  uncertainty  re¬ 
lation. 

The  typical  rate  of  emission  for  a  photon,  when  not  forbidden  by 
parity,  is  R  ~  109  s~\  i.e.,  it  takes  about  a  nanosecond.  We  can 
think  of  the  two-photon  decay  as  an  energy-nonconserving  jump 
up  to  some  higher-energy  state,  with  the  emission  of  a  photon, 
followed  by  the  emission  of  a  second  photon  leading  down  to  the 
ground  state.  The  first  jump  can  happen  because  of  the  energy¬ 
time  uncertainty  relation,  which  allows  the  electron  to  stay  in  the 
intermediate  state  for  a  time  t  ~  h/E,  which  is  on  the  order  of 
10~15  s.  The  probability  for  the  second  photon  to  be  emitted 
within  this  time  is  Rt,  so  the  rate  for  the  whole  two-photon  pro¬ 
cess  is  R2t  ~  10  s'1.  Considering  the  extremely  crude  nature 
of  this  calculation,  the  result  is  in  good  agreement  with  the  ob¬ 
served  rate  of  about  0.1  s_1.  The  process  is  actually  observed, 
and  contributes  a  continuous  background  spectrum  in  addition  to 
the  discrete  line  spectrum  when  such  nebulae  are  observed  with 
a  spectrometer  through  a  telescope. 

Randomization  of  phase  in  a  measurement 

The  energy-time  uncertainty  relation  can  help  us  to  understand 
one  of  the  most  puzzling  issues  in  quantum  mechanics,  which  is  the 
problem  of  measurement.  What  happens  when  we  use  a  macroscopic 
measuring  device,  which  is  well  described  by  classical  physics,  to 
observe  a  microscopic  system,  which  is  quantum-mechanical?  How 
do  we  reconcile  these  two  seemingly  incompatible  descriptions  of 
reality  when  both  appear  to  be  in  play  simultaneously? 
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h  /  Spying  on  one  slit  in  the 
double-slit  experiment. 


Consider  an  electron  passing  through  a  double-slit  apparatus. 
We  have  already  considered  the  possibility  of  covering  one  slit  (dis¬ 
cussion  question  D,  p.  857).  Suppose  instead  that  we  carefully  watch 
one  slit  through  a  microscope,  and  see  whether  or  not  the  electron 
passed  through  it.  If  we  could  perform  this  observation  without  dis¬ 
turbing  the  electron,  then  a  paradox  would  arise.  For  if  we  haven’t 
disturbed  the  electron,  then  there  should  still  be  a  double-slit  in¬ 
terference  pattern.  But  if  we  watch  one  slit,  then  half  of  the  time 
we  should  see  that  the  electron  did  not  go  through  it,  and  therefore 
the  slit’s  existence  is  of  no  importance,  and  we  can’t  possible  get  a 
double-slit  interference  pattern. 

To  avoid  this  contradiction,  it  appears  that  nature  must  conspire 
against  us  in  such  a  way  that  observing  the  slit  inevitably  does 
disturb  the  electron.  The  energy-time  uncertainty  relation  explains 
why  this  is  so.  Our  observation  of  the  electron  is  an  interaction 
between  the  electron  and  our  macroscopic  measuring  device.  This 
interaction  will  presumably  transfer  some  amount  of  energy  E  into 
or  out  of  the  electron,  and  if  our  goal  was  to  avoid  disturbing  the 
electron,  we  would  imagine  that  it  would  be  best  to  make  E  very 
small.  But  the  energy-time  uncertainty  AEAt  >  h  relation  tells  us 
that  if  this  energy  is  to  have  a  value  that  is  confined  to  some  small 
range  AE,  then  the  time  At  it  takes  for  the  interaction  to  occur 
must  be  at  least  ~  h/ AE.  While  the  electron  is  being  subjected  to 
this  interaction,  its  phase  is  rotating  around  the  complex  plane  like 
elU}t  _  e iEt/h'  pjie  change  in  the  phase  angle  <f>  =  EAt/h  is 
uncertain  because  E  is  uncertain,  so  our  observation  will  inevitably 
change  the  phase  by  some  random  amount,  which  is  uncertain  by 
an  amount  A <f>  =  AEAt/h,  so  A</>  >  1. 

Thus  is  won’t  actually  help  us  if  we  make  the  interaction  very 
gentle,  because  the  lengthening  of  the  time  has  a  compensating  ef¬ 
fect.  Any  slight  alteration  in  the  frequency  will  have  more  time  to 
accumulate  into  a  big  phase  difference,  and  we  still  end  up  with  a 
phase  uncertainty  that  is  at  least  on  the  order  of  1.  Although  we 
haven’t  stated  our  uncertainty  relations  with  enough  mathemati¬ 
cal  precision  to  state  this  lower  bound  with  all  the  right  factors  of 
2  and  7 r,  it  turns  out  that  Acj)  >  2ir.  That  is,  any  such  observa¬ 
tion  will  have  the  effect  of  completely  randomizing  the  phase  of  the 
thing  being  observed.  In  fact,  macroscopic  measuring  devices  nor¬ 
mally  exceed  the  bounds  set  by  the  uncertainty  relations  by  many 
orders  of  magnitude,  so  there  will  typically  be  a  vast  amount  of 
overkill  in  this  randomization.  This  is  a  general  rule  for  reasoning 
about  quantum-mechanical  measurements:  they  always  completely 
randomize  the  quantum-mechanical  phase  of  the  thing  being  mea¬ 
sured.  This  provides  a  more  physical  justification  for  our  more  ab¬ 
stract  mathematical  proof  in  example  31  on  p.  920  that  phase  is  not 
an  observable. 

In  our  example  of  the  double  slit,  what  will  be  the  effect  of  this 
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randomization  of  the  electron’s  phase?  In  our  usual  description  of 
the  double  slit,  we  assume  that  the  circular  waves  emerging  from  one 
slit  are  in  phase  with  those  that  come  out  through  the  other  one, 
so  that  the  double-slit  interference  pattern  has  a  maximum  in  the 
center.  But  if,  for  example,  one  of  the  waves  has  its  phase  inverted, 
then  all  the  maxima  of  our  interference  pattern  will  become  minima 
and  vice  versa.  If  the  phase  is  randomized,  then  the  positions  of  the 
maxima  and  minima  are  randomized  as  well,  and  thus  if  we  try  to 
collect  data  on  enough  electrons  to  see  an  interference  pattern,  we 
will  not  see  maxima  and  minima  at  all. 

Another  fundamental  application  of  the  energy-time  uncertainty 
relation  is  to  the  explication  of  what  it  means  to  measure  time  in 
quantum  mechanics.  In  example  32  on  p.  921  we  argued  that  time 
is  not  an  observable  in  quantum  mechanics  because  time  cannot  in 
general  be  measured  by  looking  at  a  quantum-mechanical  system: 
many  quantum-mechanical  systems  are  too  simple  to  function  as 
clocks.  We  can  now  see  in  more  detail  what  “too  simple”  might 
mean  here.  Microscopic  systems,  unlike  macroscopic  ones,  are  of¬ 
ten  encountered  in  a  definite  state  of  energy,  such  as  the  ground 
state.  Such  a  state  has  A E  =  0  and  therefore  by  the  energy-time 
uncertainty  relation  it  has  At  =  oo.  In  other  words,  the  only  time 
evolution  in  such  a  system  consists  of  the  system’s  over-all  phase 
twirling  in  the  complex  plane  at  a  steady  rate,  but  phase  isn’t  mea¬ 
surable,  so  we  can’t  use  this  rotation  like  the  hand  on  a  clock.  To 
make  a  clock,  we  need,  at  a  bare  minimum,  a  system  that  is  in  a 
superposition  of  two  different  energy  levels.  We  then  have  two  in¬ 
dependent  phases.  Although  absolute  phases  are  not  measurable, 
relative  ones  are,  and  for  example  when  we  measure  a  double-slit 
interference  pattern,  that  is  exactly  what  we  are  doing:  observing 
(statistically)  the  difference  between  two  phases.  As  a  loose  concep¬ 
tual  analogy,  this  is  like  the  idea  that  a  figure-eight  Lissajous  pattern 
has  an  identifiable  feature  where  it  crosses  itself,  the  crossing  being 
like  the  tick  of  a  clock. 


i/The  Lissajous  figure  x  =  cost, 
y  =  sin  2 1. 


13.5.5  Quantum  computing  and  the  no-cloning  theorem 

Computers  and  information  transmission  systems  such  as  the 
internet  are  currently  implemented  as  classical  devices.  For  example, 
the  wavelengths  of  the  electrons  that  carry  signals  in  a  computer 
chip  are  currently  orders  of  magnitude  shorter  than  the  size  of  the 
logic  gates,  so  that  wave  effects  such  as  diffraction  and  interference 
are  not  important  (problem  22,  p.  936).  Even  if  the  current  devices 
such  as  silicon  chips  and  fiber-optic  cables  could  simply  be  scaled 
down  to  sizes  comparable  to  the  electrons’  wavelengths,  quantum 
effects  would  at  some  point  simply  make  them  start  breaking  down 
or  behaving  unreliably. 

It  is  possible,  however,  to  design  qualitatively  different  devices 
in  which  information  and  signals  are  intentionally  manipulated  in  an 
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explicitly  quantum-mechanical  fashion.  This  is  the  frontier  known 
as  quantum  computing.  In  a  quantum  computer,  the  basic  unit  of 
information  is  not  the  classical  bit  but  the  quantum  bit  or  qubit.  A 
qubit  can  exist  in  a  superposition  of  the  0  and  1  states,  with  a  well 
defined  phase,  e.g.,  To  +  is  a  different  state  than  To  —  Ti  or 
To  +  iTi.  Furthermore,  one  qubit  can  have  its  state  entangled  with 
another’s.  For  example,  T01+T10  describes  a  state  in  which  we  have 
two  bits,  neither  in  a  definite  0  or  1  state,  but  which  are  guaranteed 
to  add  up  to  1.  That  is,  if  one  is  true,  then  the  other  is  guaranteed 
to  be  false.  It  has  been  shown  that  some  problems  that  are  hard 
for  classical  computers  are  more  tractable  for  a  quantum  computer. 
For  example,  there  is  a  known  quantum  computing  algorithm  that 
is  capable  of  efficiently  factoring  large  integers,  and  when  this  is 
eventually  implemented  in  a  practical  device,  it  will  have  the  effect 
of  breaking  the  cryptographic  algorithms  that  you  currently  use  for 
online  privacy  and  security,  since  the  security  of  those  algorithms  is 
predicated  on  the  assumption  that  factorization  is  hard.  This  would 
be  a  disaster  for  online  economic  activity  and  could  have  effects  such 
as  unmasking  political  dissidents. 

A  different  application,  and  one  that  is  easier  to  explain,  is  that 
quantum  computing  makes  it  possible  in  theory  to  make  copy-proof 
information.  This  would  not  be  useful  to  Hollywood  studios  trying 
to  prevent  copying  of  their  movies,  since  the  images  have  to  pass 
through  classical  devices  anyway  in  order  to  be  displayed,  but  it 
means  that  one  might  be  able  to  send  private  information  through 
a  quantum  internet  in  such  a  way  that  it  could  not  be  copied  by 
snoops,  even  in  theory.  In  contrast,  current  classical  methods  of 
encryption  are  designed  to  allow  eavesdropping  on  an  information 
packet  as  it  hops  across  the  internet,  but  to  make  the  copy  useless 
to  prying  eyes  because  it  cannot  be  decoded. 

The  theoretical  key  to  this  application  of  quantum  computing  is 
the  counterintuitive  no-cloning  theorem,  which  states  that  it  is  not 
possible  to  make  a  copy  of  an  unknown  quantum  state.  To  see  why 
this  works,  suppose  that  we  implement  a  qubit  using  the  spin  1/2 
of  a  silver  atom,  with  the  convention  that  the  0  state  is  represented 
by  sx  =  —1/2  and  1  by  sx  =  +1/2.  If  you  provide  me  with  an 
atom  that  you  have  prepared,  then  it  might  seem  straightforward, 
at  least  in  principle,  for  me  to  copy  its  state.  I  can  shoot  it  through 
a  magnetic  spectrometer,  as  in  the  Stern-Gerlach  experiment,  and 
measure  its  sx.  Then  I  prepare  another  silver  atom  in  the  same 
state.  What’s  the  problem? 

The  problem  is  that  if  the  state  of  the  atom  is  truly  unknown  to 
me,  then  I  have  no  way  of  knowing  that  it  is  actually  in  one  of  the 
two  states  sx  =  —1/2  and  sx  =  +1/2.  It  could  instead  be  in  some 
superposition  of  these,  such  as  TSa,=_1/2/v/2  +  *T Saf=_J_1 /2/ V2,  with 
a  90-degree  phase  angle  between  the  two  components.  Then  when 
I  send  your  atom  through  the  spectrometer,  the  world  becomes  one 
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in  which  both  the  spectrometer  and  my  brain  are  in  a  superposition 
of  the  two  states.  In  one  of  these  worlds,  I  then  go  ahead  and  pre¬ 
pare  my  copy- atom  in  the  sx  =  —1/2  state,  and  in  the  other  one  I 
set  it  up  as  a  +1/2.  You  could  say  that  my  copy-atom  is,  like  the 
original,  in  a  superposition  of  the  two  sx  states,  but  there  is  no  rea¬ 
son  to  think  that  it  will  be  the  same  superposition,  with  the  same 
90-degree  phase  angle.  In  fact,  by  the  argument  on  p.  927  we  know 
that  it  is  not  possible  by  any  measurement  to  extract  this  phase 
information  and  convert  it  into  classical  information.  Furthermore, 
our  final  result  is  not  really  as  simple  as  a  copy-atom  in  some  un¬ 
known  superposition  of  the  two  sx  states.  It  is  a  silver  atom  whose 
spin  is  correlated  with  the  state  of  the  original,  but  also  correlated 
with  the  state  of  the  spectrometer  and  the  state  of  my  brain. 

The  impossibility  of  copying  an  unknown  quantum  state  is  en¬ 
forced  by  nature  in  full  generality,  not  just  by  the  specific  mecha¬ 
nisms  described  in  the  artificial  scenario  described  above.  To  see 
why,  consider  what  would  happen  to  the  state  of  the  “blank”  atom 
on  which  we  had  hoped  to  impose  the  copied  state.  Its  state  would 
have  been  overwritten,  but  this  would  imply  a  loss  of  information, 
which  is  forbidden  by  the  unitarity  postulate  of  quantum  mechanics 
(p.  913). 

The  no-cloning  theorem  would  seem  to  severely  limit  the  prac¬ 
ticability  of  a  quantum  computing.  When  you  run  a  program  on  a 
classical  computer,  the  very  first  step  to  be  performed  by  the  oper¬ 
ating  system  is  to  copy  the  program’s  code  and  data  from  storage 
into  random-access  memory.  If  a  quantum  computer  can’t  copy  any¬ 
thing,  then  how  do  we  perform  this  initial  step?  But  the  no-cloning 
theorem  doesn’t  actually  forbid  copying  any  quantum  state  —  it 
forbids  copying  an  unknown  state.  Going  back  to  the  example  of 
the  silver  atom,  imagine  that  rather  than  presenting  me  with  a  sil¬ 
ver  atom  in  a  completely  unknown  quantum  state,  you  give  me  a 
solemn  promise  that  it  will  be  either  in  the  state  sx  =  —1/2  or  the 
state  sx  =  +1/2  —  not  some  superposition  of  these.  Then  if  you 
trace  back  through  the  logic  of  the  scenario,  you  will  find  that  there 
is  absolutely  nothing  preventing  me  from  making  an  accurate  copy. 

Once  the  software  on  a  quantum  computer  starts  running,  its 
qubits  will  certainly  start  going  into  superpositions  of  the  0  state 
and  the  1  state.  By  the  no-cloning  theorem,  these  cannot  be  copied 
from  one  memory  location  to  another,  overwriting  the  previous  con¬ 
tents  of  the  target  location.  But  that  simply  isn’t  how  quantum 
computing  works.  Rather  than  attempting  to  copy,  erase,  and  over¬ 
write  bits  as  in  a  classical  computer,  the  software  is  designed  to  cre¬ 
ate  complicated  correlations  between  the  different  bits.  This  model 
of  computing  is  not  necessarily  better  or  worse  over  all  than  classi¬ 
cal  digital  computing,  but  it  differs  from  it  as  much  as  an  iPhone’s 
model  of  computing  differs  from  that  of  a  slide  rule. 
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Problems 


The  symbols  \/,  ^ ,  etc.  are  explained  on  page  943. 

1  If  a  radioactive  substance  has  a  half-life  of  one  year,  does  this 
mean  that  it  will  be  completely  decayed  after  two  years?  Explain. 


activity 
(millions 
of  de¬ 
cays  per 
second) 


time  (minutes) 


Problem  6. 


2  What  is  the  probability  of  rolling  a  pair  of  dice  and  getting 
“snake  eyes,”  i.e. ,  both  dice  come  up  with  ones? 

3  Problem  3  has  been  deleted.  ■ 

4  Problem  4  has  been  deleted.  ^ 

5  Refer  to  the  probability  distribution  for  people’s  heights  in 
figure  f  on  page  836. 

(a)  Show  that  the  graph  is  properly  normalized. 

(b)  Estimate  the  fraction  of  the  population  having  heights  between 

140  and  150  cm.  V 

6  (a)  A  nuclear  physicist  is  studying  a  nuclear  reaction  caused  in 
an  accelerator  experiment,  with  a  beam  of  ions  from  the  accelerator 
striking  a  thin  metal  foil  and  causing  nuclear  reactions  when  a  nu¬ 
cleus  from  one  of  the  beam  ions  happens  to  hit  one  of  the  nuclei  in 
the  target.  After  the  experiment  has  been  running  for  a  few  hours, 
a  few  billion  radioactive  atoms  have  been  produced,  embedded  in 
the  target.  She  does  not  know  what  nuclei  are  being  produced,  but 
she  suspects  they  are  an  isotope  of  some  heavy  element  such  as  Pb, 
Bi,  Fr  or  U.  Following  one  such  experiment,  she  takes  the  target  foil 
out  of  the  accelerator,  sticks  it  in  front  of  a  detector,  measures  the 
activity  every  5  min,  and  makes  a  graph  (figure).  The  isotopes  she 
thinks  may  have  been  produced  are: 


isotope 

half-life  (minutes) 

211Pb 

36.1 

214Pb 

26.8 

214Bi 

19.7 

223  pj. 

21.8 

239-q 

23.5 

Which  one  is  it? 

(b)  Having  decided  that  the  original  experimental  conditions  pro¬ 
duced  one  specific  isotope,  she  now  tries  using  beams  of  ions  travel¬ 
ing  at  several  different  speeds,  which  may  cause  different  reactions. 
The  following  table  gives  the  activity  of  the  target  10,  20  and  30  min¬ 
utes  after  the  end  of  the  experiment,  for  three  different  ion  speeds. 


first  ion  speed 
second  ion  speed 
third  ion  speed 


activity  (millions  of  decays/s)  after. . . 
10  min  20  min  30  min 

1.933  0.832  0.382 

1.200  0.545  0.248 

7.211  1.296  0.248 


Since  such  a  large  number  of  decays  is  being  counted,  assume  that 
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the  data  are  only  inaccurate  due  to  rounding  off  when  writing  down 
the  table.  Which  are  consistent  with  the  production  of  a  single 
isotope,  and  which  imply  that  more  than  one  isotope  was  being 
created?  ■ 

7  Devise  a  method  for  testing  experimentally  the  hypothesis  that 
a  gambler’s  chance  of  winning  at  craps  is  independent  of  her  previous 
record  of  wins  and  losses.  If  you  don’t  invoke  the  mathematical 
definition  of  statistical  independence,  then  you  haven’t  proposed  a 
test.  This  has  nothing  to  do  with  the  details  of  the  rules  of  craps, 
or  with  the  fact  that  it’s  a  game  played  using  dice. 

8  A  blindfolded  person  fires  a  gun  at  a  circular  target  of  radius 
b,  and  is  allowed  to  continue  bring  until  a  shot  actually  hits  it.  Any 
part  of  the  target  is  equally  likely  to  get  hit.  We  measure  the  random 
distance  r  from  the  center  of  the  circle  to  where  the  bullet  went  in. 

(a)  Show  that  the  probability  distribution  of  r  must  be  of  the  form 
D(r )  =  kr,  where  k  is  some  constant.  (Of  course  we  have  D(r )  =  0 
for  r  >  b.) 

(b)  Determine  k  by  requiring  D  to  be  properly  normalized. 

(c)  Find  the  average  value  of  r. 

(d)  Interpreting  your  result  from  part  c,  how  does  it  compare  with 

6/2?  Does  this  make  sense?  Explain.  ^ 

9  We  are  given  some  atoms  of  a  certain  radioactive  isotope,  with 
half-life  t\/ 2.  We  pick  one  atom  at  random,  and  observe  it  for  one 
half-life,  starting  at  time  zero.  If  it  decays  during  that  one-half¬ 
life  period,  we  record  the  time  t  at  which  the  decay  occurred.  If  it 
doesn’t,  we  reset  our  clock  to  zero  and  keep  trying  until  we  get  an 
atom  that  cooperates.  The  final  result  is  a  time  0  <  t  <  fi /2,  with  a 
distribution  that  looks  like  the  usual  exponential  decay  curve,  but 
with  its  tail  chopped  off. 

(a)  Find  the  distribution  D(t),  with  the  proper  normalization.  V 

(b)  Find  the  average  value  of  t.  V 

(c)  Interpreting  your  result  from  part  b,  how  does  it  compare  with 
f  1/2/2?  Does  this  make  sense?  Explain. 

10  The  speed,  v,  of  an  atom  in  an  ideal  gas  has  a  probability 
distribution  of  the  form  D(v)  =  bve~cv  ,  where  0  <  v  <  00,  c  relates 
to  the  temperature,  and  b  is  determined  by  normalization. 

(a)  Sketch  the  distribution. 

(b)  Find  b  in  terms  of  c.  V 

(c)  Find  the  average  speed  in  terms  of  c,  eliminating  b.  (Don’t  try 

to  do  the  indefinite  integral,  because  it  can’t  be  done  in  closed  form. 
The  relevant  definite  integral  can  be  found  in  tables  or  done  with 
computer  software.)  V  ■ 
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11  All  helium  on  earth  is  from  the  decay  of  naturally  occurring 
heavy  radioactive  elements  such  as  uranium.  Each  alpha  particle 
that  is  emitted  ends  up  claiming  two  electrons,  which  makes  it  a 
helium  atom.  If  the  original  238U  atom  is  in  solid  rock  (as  opposed 
to  the  earth’s  molten  regions),  the  He  atoms  are  unable  to  diffuse 
out  of  the  rock.  This  problem  involves  dating  a  rock  using  the 
known  decay  properties  of  uranium  238.  Suppose  a  geologist  finds 
a  sample  of  hardened  lava,  melts  it  in  a  furnace,  and  finds  that  it 
contains  1230  mg  of  uranium  and  2.3  mg  of  helium.  238U  decays  by 
alpha  emission,  with  a  half-life  of  4.5  x  109  years.  The  subsequent 
chain  of  alpha  and  electron  (beta)  decays  involves  much  shorter  half- 
lives,  and  terminates  in  the  stable  nucleus  206Pb.  Almost  all  natural 
uranium  is  238U,  and  the  chemical  composition  of  this  rock  indicates 
that  there  were  no  decay  chains  involved  other  than  that  of  238U. 

(a)  How  many  alphas  are  emitted  per  decay  chain?  [Hint:  Use 
conservation  of  mass.] 

(b)  How  many  electrons  are  emitted  per  decay  chain?  [Hint:  Use 
conservation  of  charge.] 

(c)  How  long  has  it  been  since  the  lava  originally  hardened?  V  - 

12  When  light  is  reflected  from  a  mirror,  perhaps  only  80%  of 

the  energy  comes  back.  One  could  try  to  explain  this  in  two  different 
ways:  (1)  80%  of  the  photons  are  reflected,  or  (2)  all  the  photons  are 
reflected,  but  each  loses  20%  of  its  energy.  Based  on  your  everyday 
knowledge  about  mirrors,  how  can  you  tell  which  interpretation  is 
correct?  [Based  on  a  problem  from  PSSC  Physics.]  - 

13  Suppose  we  want  to  build  an  electronic  light  sensor  using  an 
apparatus  like  the  one  described  in  subsection  13.2.2  on  p.  846.  How 
would  its  ability  to  detect  different  parts  of  the  spectrum  depend  on 
the  type  of  metal  used  in  the  capacitor  plates? 
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14  The  photoelectric  effect  can  occur  not  just  for  metal  cathodes 
but  for  any  substance,  including  living  tissue.  Ionization  of  DNA 
molecules  can  cause  cancer  or  birth  defects.  If  the  energy  required 
to  ionize  DNA  is  on  the  same  order  of  magnitude  as  the  energy 
required  to  produce  the  photoelectric  effect  in  a  metal,  which  of  the 
following  types  of  electromagnetic  waves  might  pose  such  a  hazard? 
Explain. 

60  Hz  waves  from  power  lines 

100  MHz  FM  radio 

microwaves  from  a  microwave  oven 

visible  light 

ultraviolet  light 

x-rays 


15  (a)  Rank-order  the  photons  according  to  their  wavelengths, 
frequencies,  and  energies.  If  two  are  equal,  say  so.  Explain  all  your 
answers. 

(b)  Photon  3  was  emitted  by  a  xenon  atom  going  from  its  second- 
lowest-energy  state  to  its  lowest-energy  state.  Which  of  photons  1, 
2,  and  4  are  capable  of  exciting  a  xenon  atom  from  its  lowest-energy 
state  to  its  second-lowest-energy  state?  Explain. 

16  Which  figure  could  be  an  electron  speeding  up  as  it  moves 
to  the  right?  Explain. 

17  The  beam  of  a  100-W  overhead  projector  covers  an  area 

of  1  m  X  1  m  when  it  hits  the  screen  3  m  away.  Estimate  the 
number  of  photons  that  are  in  flight  at  any  given  time.  (Since  this 
is  only  an  estimate,  we  can  ignore  the  fact  that  the  beam  is  not 
parallel.)  ■ 

18  In  the  photoelectric  effect,  electrons  are  observed  with  virtu¬ 
ally  no  time  delay  (~  10  ns),  even  when  the  light  source  is  very  weak. 
(A  weak  light  source  does  however  only  produce  a  small  number  of 
ejected  electrons.)  The  purpose  of  this  problem  is  to  show  that  the 
lack  of  a  significant  time  delay  contradicted  the  classical  wave  the¬ 
ory  of  light,  so  throughout  this  problem  you  should  put  yourself  in 
the  shoes  of  a  classical  physicist  and  pretend  you  don’t  know  about 
photons  at  all.  At  that  time,  it  was  thought  that  the  electron  might 
have  a  radius  on  the  order  of  ICE15  m.  (Recent  experiments  have 
shown  that  if  the  electron  has  any  finite  size  at  all,  it  is  far  smaller.) 

(a)  Estimate  the  power  that  would  be  soaked  up  by  a  single  electron 

in  a  beam  of  light  with  an  intensity  of  1  mW / m2 .  V 

(b)  The  energy,  Es,  required  for  the  electron  to  escape  through  the 

surface  of  the  cathode  is  on  the  order  of  10~ 19  J.  Find  how  long  it 
would  take  the  electron  to  absorb  this  amount  of  energy,  and  explain 
why  your  result  constitutes  strong  evidence  that  there  is  something 
wrong  with  the  classical  theory.  V  ^ 


Problem  15. 
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19  In  a  television,  suppose  the  electrons  are  accelerated  from  rest 
through  a  voltage  difference  of  104  V.  What  is  their  final  wavelength? 

V  ^ 

20  Use  the  Heisenberg  uncertainty  principle  to  estimate  the 

minimum  velocity  of  a  proton  or  neutron  in  a  208Pb  nucleus,  which 
has  a  diameter  of  about  13  fm  (1  fm=10-15  m).  Assume  that  the 
speed  is  nonrelativistic,  and  then  check  at  the  end  whether  this 
assumption  was  warranted.  V  ■ 

21  Find  the  energy  of  a  particle  in  a  one-dimensional  box  of 

length  L,  expressing  your  result  in  terms  of  L,  the  particle’s  mass 
m,  the  number  of  peaks  and  valleys  n  in  the  wavefunction,  and 
fundamental  constants.  V  ^ 

22  A  free  electron  that  contributes  to  the  current  in  an  ohmic 
material  typically  has  a  speed  of  105  m/s  (much  greater  than  the 
drift  velocity). 

(a)  Estimate  its  de  Broglie  wavelength,  in  nm.  v 

(b)  If  a  computer  memory  chip  contains  108  electric  circuits  in  a  1 
cm2  area,  estimate  the  linear  size,  in  nm,  of  one  such  circuit.  V 

(c)  Based  on  your  answers  from  parts  a  and  b,  does  an  electrical 
engineer  designing  such  a  chip  need  to  worry  about  wave  effects 
such  as  diffraction? 

(d)  Estimate  the  maximum  number  of  electric  circuits  that  can  fit  on 

a  1  cm2  computer  chip  before  quantum-mechanical  effects  become 
important.  ^ 

23  In  classical  mechanics,  an  interaction  energy  of  the  form 
U{x)  =  \kx 2  gives  a  harmonic  oscillator:  the  particle  moves  back 
and  forth  at  a  frequency  oj  =  \Jkfm.  This  form  for  U(x )  is  often 
a  good  approximation  for  an  individual  atom  in  a  solid,  which  can 
vibrate  around  its  equilibrium  position  at  x  =  0.  (For  simplicity,  we 
restrict  our  treatment  to  one  dimension,  and  we  treat  the  atom  as 
a  single  particle  rather  than  as  a  nucleus  surrounded  by  electrons). 
The  atom,  however,  should  be  treated  quantum-mechanically,  not 
clasically.  It  will  have  a  wave  function.  We  expect  this  wave  function 
to  have  one  or  more  peaks  in  the  classically  allowed  region,  and  we 
expect  it  to  tail  off  in  the  classically  forbidden  regions  to  the  right 
and  left.  Since  the  shape  of  U(x)  is  a  parabola,  not  a  series  of  flat 
steps  as  in  figure  n  on  page  877,  the  wavy  part  in  the  middle  will 
not  be  a  sine  wave,  and  the  tails  will  not  be  exponentials. 

(a)  Show  that  there  is  a  solution  to  the  Schrodinger  equation  of  the 
form 

tf(z)  =e~bx\ 

and  relate  b  to  k,  m,  and  h.  To  do  this,  calculate  the  second  deriva¬ 
tive,  plug  the  result  into  the  Schrodinger  equation,  and  then  find 
what  value  of  b  would  make  the  equation  valid  for  all  values  of  x. 
This  wavefunction  turns  out  to  be  the  ground  state.  Note  that  this 
wavefunction  is  not  properly  normalized  —  don’t  worry  about  that. 
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(b)  Sketch  a  graph  showing  what  this  wavefunction  looks  like. 

(c)  Let’s  interpret  b.  If  you  changed  b,  how  would  the  wavefunc¬ 
tion  look  different?  Demonstrate  by  sketching  two  graphs,  one  for 
a  smaller  value  of  b,  and  one  for  a  larger  value. 

(d)  Making  k  greater  means  making  the  atom  more  tightly  bound. 

Mathematically,  what  happens  to  the  value  of  b  in  your  result  from 
part  a  if  you  make  k  greater?  Does  this  make  sense  physically  when 
you  compare  with  part  c?  V  M 

24  (a)  A  distance  scale  is  shown  below  the  wavefunctions  and 
probability  densities  illustrated  in  figure  g  on  page  895.  Com¬ 
pare  this  with  the  order-of-magnitude  estimate  derived  in  subsec¬ 
tion  13.4.4  for  the  radius  r  at  which  the  wavefunction  begins  tailing 
off.  Was  the  estimate  on  the  right  order  of  magnitude? 

(b)  Although  we  normally  say  the  moon  orbits  the  earth,  actually 
they  both  orbit  around  their  common  center  of  mass,  which  is  below 
the  earth’s  surface  but  not  at  its  center.  The  same  is  true  of  the 
hydrogen  atom.  Does  the  center  of  mass  lie  inside  the  proton,  or 
outside  it? 

■ 

25  The  figure  shows  eight  of  the  possible  ways  in  which  an 
electron  in  a  hydrogen  atom  could  drop  from  a  higher  energy  state  to 
a  state  of  lower  energy,  releasing  the  difference  in  energy  as  a  photon. 
Of  these  eight  transitions,  only  D,  E,  and  F  produce  photons  with 
wavelengths  in  the  visible  spectrum. 

(a)  Which  of  the  visible  transitions  would  be  closest  to  the  violet 
end  of  the  spectrum,  and  which  would  be  closest  to  the  red  end? 
Explain. 

(b)  In  what  part  of  the  electromagnetic  spectrum  would  the  photons 
from  transitions  A,  B,  and  C  lie?  What  about  G  and  H?  Explain. 

(c)  Is  there  an  upper  limit  to  the  wavelengths  that  could  be  emitted 
by  a  hydrogen  atom  going  from  one  bound  state  to  another  bound 
state?  Is  there  a  lower  limit?  Explain. 

26  Find  an  equation  for  the  wavelength  of  the  photon  emit¬ 

ted  when  the  electron  in  a  hydrogen  atom  makes  a  transition  from 
energy  level  n\  to  level  n 2.  V  ^ 

27  Estimate  the  angular  momentum  of  a  spinning  basketball, 

in  units  of  h.  Explain  how  this  result  relates  to  the  correspondence 
principle.  ^ 

28  Assume  that  the  kinetic  energy  of  an  electron  in  the  n  =  1 
state  of  a  hydrogen  atom  is  on  the  same  order  of  magnitude  as  the 
absolute  value  of  its  total  energy,  and  estimate  a  typical  speed  at 
which  it  would  be  moving.  (It  cannot  really  have  a  single,  definite 
speed,  because  its  kinetic  and  interaction  energy  trade  off  at  different 
distances  from  the  proton,  but  this  is  just  a  rough  estimate  of  a 
typical  speed.)  Based  on  this  speed,  were  we  justified  in  assuming 
that  the  electron  could  be  described  nonrelativistically? 


n=l  T  T  T 
ABC 

Problem  25. 
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29  Before  the  quantum  theory,  experimentalists  noted  that  in 

many  cases,  they  would  find  three  lines  in  the  spectrum  of  the  same 
atom  that  satisfied  the  following  mysterious  rule:  1/Ai  =  I/A2  + 
I/A3.  Explain  why  this  would  occur.  Do  not  use  reasoning  that 
only  works  for  hydrogen  —  such  combinations  occur  in  the  spectra 
of  all  elements.  [Hint:  Restate  the  equation  in  terms  of  the  energies 
of  photons.]  ■ 

30  The  wavefunction  of  the  electron  in  the  ground  state  of  a 
hydrogen  atom  is 

T  =  7r-1/2a-3/2e-r/a; 

where  r  is  the  distance  from  the  proton,  and  a  =  5.3  x  lO^11  m  is  a 
constant  that  sets  the  size  of  the  wave. 

(a)  Calculate  symbolically,  without  plugging  in  numbers,  the  proba¬ 

bility  that  at  any  moment,  the  electron  is  inside  the  proton.  Assume 
the  proton  is  a  sphere  with  a  radius  of  b  =  0.5  fm.  Do  not  use  cal¬ 
culus.  [Hint:  Does  it  matter  if  you  plug  in  r  =  0  or  r  =  b  in  the 
equation  for  the  wavefunction?]  V 

(b)  Calculate  the  probability  numerically.  V 

(c)  Based  on  the  equation  for  the  wavefunction,  is  it  valid  to  think 
of  a  hydrogen  atom  as  having  a  finite  size?  Can  a  be  interpreted 
as  the  size  of  the  atom,  beyond  which  there  is  nothing?  Or  is  there 
any  limit  on  how  far  the  electron  can  be  from  the  proton? 

31  Use  physical  reasoning  to  explain  how  the  equation  for  the 
energy  levels  of  hydrogen, 

mk2e 4  1 

En  =  2hT  '  ^2’ 

should  be  generalized  to  the  case  of  an  atom  with  atomic  number  Z 
that  has  had  all  its  electrons  removed  except  for  one. 

32  A  muon  is  a  subatomic  particle  that  acts  exactly  like  an 
electron  except  that  its  mass  is  207  times  greater.  Muons  can  be 
created  by  cosmic  rays,  and  it  can  happen  that  one  of  an  atom’s 
electrons  is  displaced  by  a  muon,  forming  a  muonic  atom.  If  this 
happens  to  a  hydrogen  atom,  the  resulting  system  consists  simply 
of  a  proton  plus  a  muon. 

(a)  Based  on  the  results  of  section  13.4.4,  how  would  the  size  of  a 
muonic  hydrogen  atom  in  its  ground  state  compare  with  the  size  of 
the  normal  atom? 

(b)  If  you  were  searching  for  muonic  atoms  in  the  sun  or  in  the 
earth’s  atmosphere  by  spectroscopy,  in  what  part  of  the  electromag¬ 
netic  spectrum  would  you  expect  to  find  the  absorption  lines? 

33  An  electron  is  initially  at  rest.  A  photon  collides  with  the 
electron  and  rebounds  from  the  collision  at  180  degrees,  i.e. ,  going 
back  along  the  path  on  which  it  came.  The  rebounding  photon  has  a 
different  energy,  and  therefore  a  different  frequency  and  wavelength. 
Show  that,  based  on  conservation  of  energy  and  momentum,  the 
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difference  between  the  photon’s  initial  and  final  wavelengths  must 
be  2 h/mc,  where  m  is  the  mass  of  the  electron.  The  experimental 
verification  of  this  type  of  “pool-ball”  behavior  by  Arthur  Compton 
in  1923  was  taken  as  definitive  proof  of  the  particle  nature  of  light. 
Note  that  we’re  not  making  any  nonrelativistic  approximations.  To 
keep  the  algebra  simple,  you  should  use  natural  units  —  in  fact,  it’s 
a  good  idea  to  use  even-more-natural-than-natural  units,  in  which 
we  have  not  just  c  =  1  but  also  h  =  1,  and  m  =  1  for  the  mass 
of  the  electron.  You’ll  also  probably  want  to  use  the  relativistic 
relationship  E2  —  p2  =  m 2,  which  becomes  E2  —  p2  =  1  for  the 
energy  and  momentum  of  the  electron  in  these  units.  - 

34  Generalize  the  result  of  problem  33  to  the  case  where  the 

photon  bounces  off  at  an  angle  other  than  180°with  respect  to  its 
initial  direction  of  motion.  ^ 

35  On  page  877  we  derived  an  expression  for  the  probability 

that  a  particle  would  tunnel  through  a  rectangular  barrier,  i.e. ,  a 
region  in  which  the  interaction  energy  U ( x )  has  a  graph  that  looks 
like  a  rectangle.  Generalize  this  to  a  barrier  of  any  shape.  [Hints: 
First  try  generalizing  to  two  rectangular  barriers  in  a  row,  and  then 
use  a  series  of  rectangular  barriers  to  approximate  the  actual  curve 
of  an  arbitrary  function  U(x).  Note  that  the  width  and  height  of 
the  barrier  in  the  original  equation  occur  in  such  a  way  that  all  that 
matters  is  the  area  under  the  [/-versus- x  curve.  Show  that  this  is 
still  true  for  a  series  of  rectangular  barriers,  and  generalize  using  an 
integral.]  If  you  had  done  this  calculation  in  the  1930’s  you  could 
have  become  a  famous  physicist.  M 

36  Show  that  the  wavefunction  given  in  problem  30  is  properly 

normalized.  ^ 

37  Show  that  a  wavefunction  of  the  form  T  =  eby  s'max  is 
a  possible  solution  of  the  Schrodinger  equation  in  two  dimensions, 
with  a  constant  potential  U.  Can  we  tell  whether  it  would  apply  to 
a  classically  allowed  region,  or  a  classically  forbidden  one? 

38  Find  the  energy  levels  of  a  particle  in  a  three-dimensional 

rectangular  box  with  sides  of  length  a,  b,  and  c.  V  ^ 
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39  Americium-241  is  an  artificial  isotope  used  in  smoke  de¬ 
tectors.  It  undergoes  alpha  decay,  with  a  half-life  of  432  years.  As 
discussed  in  example  18  on  page  878,  alpha  decay  can  be  understood 
as  a  tunneling  process,  and  although  the  barrier  is  not  rectangular 
in  shape,  the  equation  for  the  tunneling  probability  on  page  878 
can  still  be  used  as  a  rough  guide  to  our  thinking.  For  americium- 
241,  the  tunneling  probability  is  about  1  x  1CT29.  Suppose  that 
this  nucleus  were  to  decay  by  emitting  a  tritium  (helium-3)  nucleus 
instead  of  an  alpha  particle  (helium-4).  Estimate  the  relevant  tun¬ 
neling  probability,  assuming  that  the  total  energy  E  remains  the 
same.  This  higher  probability  is  contrary  to  the  empirical  observa¬ 
tion  that  this  nucleus  is  not  observed  to  decay  by  tritium  emission 
with  any  significant  probability,  and  in  general  tritium  emission  is 
almost  unknown  in  nature;  this  is  mainly  because  the  tritium  nu¬ 
cleus  is  far  less  stable  than  the  helium-4  nucleus,  and  the  difference 
in  binding  energy  reduces  the  energy  available  for  the  decay. 

40  As  far  as  we  know,  the  mass  of  the  photon  is  zero.  However, 

it’s  not  possible  to  prove  by  experiments  that  anything  is  zero;  all 
we  can  do  is  put  an  upper  limit  on  the  number.  As  of  2008,  the 
best  experimental  upper  limit  on  the  mass  of  the  photon  is  about 
1  x  10“52  kg.  Suppose  that  the  photon’s  mass  really  isn’t  zero,  and 
that  the  value  is  at  the  top  of  the  range  that  is  consistent  with 
the  present  experimental  evidence.  In  this  case,  the  c  occurring  in 
relativity  would  no  longer  be  interpreted  as  the  speed  of  light.  As 
with  material  particles,  the  speed  v  of  a  photon  would  depend  on 
its  energy,  and  could  never  be  as  great  as  c.  Estimate  the  relative 
size  (c  —  v)/c  of  the  discrepancy  in  speed,  in  the  case  of  a  photon  of 
visible  light.  >  Answer,  p.  970  ■ 

41  Hydrogen  is  the  only  element  whose  energy  levels  can  be 

expressed  exactly  in  an  equation.  Calculate  the  ratio  Xe/Xf  of  the 
wavelengths  of  the  transitions  labeled  E  and  F  in  problem  25  on 
p.  937.  Express  your  answer  as  an  exact  fraction,  not  a  decimal 
approximation.  In  an  experiment  in  which  atomic  wavelengths  are 
being  measured,  this  ratio  provides  a  natural,  stringent  check  on  the 
precision  of  the  results.  v  ■ 

42  Give  a  numerical  comparison  of  the  number  of  photons 

per  second  emitted  by  a  hundred-watt  FM  radio  transmitter  and 
a  hundred- watt  lightbulb.  v  ■ 
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43  On  pp.  896-898  of  subsection  13.4.4,  we  used  simple  algebra  to 
derive  an  approximate  expression  for  the  energies  of  states  in  hydro¬ 
gen,  without  having  to  explicitly  solve  the  Schrodinger  equation.  As 
input  to  the  calculation,  we  used  the  the  proportionality  U  oc  r_1, 
which  is  a  characteristic  of  the  electrical  interaction.  The  result  for 
the  energy  of  the  rath  standing  wave  pattern  was  En  oc  n~2. 

There  are  other  systems  of  physical  interest  in  which  we  have 
U  oc  rk  for  values  of  k  besides  —1.  Problem  23  discusses  the  ground 
state  of  the  harmonic  oscillator,  with  k  =  2  (and  a  positive  con¬ 
stant  of  proportionality).  In  particle  physics,  systems  called  char- 
monium  and  bottomonium  are  made  out  of  pairs  of  subatomic  par¬ 
ticles  called  quarks,  which  interact  according  to  k  =  1,  i.e.,  a  force 
that  is  independent  of  distance.  (Here  we  have  a  positive  constant 
of  proportionality,  and  r  >  0  by  definition.  The  motion  turns  out 
not  to  be  too  relativistic,  so  the  Schrodinger  equation  is  a  reason¬ 
able  approximation.)  The  figure  shows  actual  energy  levels  for  these 
three  systems,  drawn  with  different  energy  scales  so  that  they  can 
all  be  shown  side  by  side.  The  sequence  of  energies  in  hydrogen 
approaches  a  limit,  which  is  the  energy  required  to  ionize  the  atom. 
In  charmonium,  only  the  first  three  levels  are  known.11 

Generalize  the  method  used  for  k  =  —1  to  any  value  of  k.  and 
find  the  exponent  j  in  the  resulting  proportionality  En  oc  n3 .  Com¬ 
pare  the  theoretical  calculation  with  the  behavior  of  the  actual  en¬ 
ergies  shown  in  the  figure.  Comment  on  the  limit  k  — >  oo.  V  ^ 

44  The  electron,  proton,  and  neutron  were  discovered,  respec¬ 

tively,  in  1897,  1919,  and  1932.  The  neutron  was  late  to  the  party, 
and  some  physicists  felt  that  it  was  unnecessary  to  consider  it  as 
fundamental.  Maybe  it  could  be  explained  as  simply  a  proton  with 
an  electron  trapped  inside  it.  The  charges  would  cancel  out,  giving 
the  composite  particle  the  correct  neutral  charge,  and  the  masses 
at  least  approximately  made  sense  (a  neutron  is  heavier  than  a  pro¬ 
ton).  (a)  Given  that  the  diameter  of  a  proton  is  on  the  order  of 
1(T15  m,  use  the  Heisenberg  uncertainty  principle  to  estimate  the 
trapped  electron’s  minimum  momentum.  V 

(b)  Find  the  electron’s  minimum  kinetic  energy.  V 

(c)  Show  via  E  =  me 2  that  the  proposed  explanation  fails,  because 
the  contribution  to  the  neutron’s  mass  from  the  electron’s  kinetic 
energy  would  be  many  orders  of  magnitude  too  large. 


hydrogen  charmonium  molecular 
vibrations 
of  HCI 

k=-1  k=1  k=2 

Problem  43. 


11See  Barnes  et  al.,  “The  XYZs  of  Charmonium  at  BES,”  arxiv.org/abs/ 
hep-ph/0608103.  To  avoid  complication,  the  levels  shown  are  only  those  in  the 
group  known  for  historical  reasons  as  the  T  and  J/\&. 
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45  Suppose  that  an  electron,  in  one  dimension,  is  confined  to  a 
certain  region  of  space  so  that  its  wavefunction  is  given  by 


0  if  x  <  0 

T  =  <  Asm(2nx/L)  if  0  <  x  <  L 
0  if  x  >  L 

Determine  the  constant  A  from  normalization.  V  ^ 

46  In  the  following,  x  and  y  are  variables,  while  u  and  v  are 
constants.  Compute  (a)  d(ux1n(vy))/dx,  (b)  d(ux\n(vy))/dy. 

v  m 

47  (a)  A  radio  transmitter  radiates  power  P  in  all  directions,  so 

that  the  energy  spreads  out  spherically.  Find  the  energy  density  at 
a  distance  r.  v 

(b)  Let  the  wavelength  be  A.  As  described  in  example  8  on  p.  850, 
find  the  number  of  photons  in  a  volume  A3  at  this  distance  r.  V 

(c)  For  a  1000  kHz  AM  radio  transmitting  station,  assuming  rea¬ 

sonable  values  of  P  and  r,  verify,  as  claimed  in  the  example,  that 
the  result  from  part  b  is  very  large.  ^ 

48  The  wavefunction  T  of  an  electron  is  a  complex  number. 

Make  up  an  example  of  a  value  for  the  wavefunction  that  is  not  a 
real  number,  and  consider  the  following  expressions:  T2,  I'Ll2,  |\Ef2|. 
Which  of  these  would  it  make  sense  to  interpret  as  a  probability 
density?  All  of  them?  Some?  Only  one?  >  Solution,  p.  982  - 

49  (a)  Consider  the  function  defined  by  f(x,y )  =  (x  —  y)2. 
Visualize  the  graph  of  this  function  as  a  surface.  (This  is  a  simple 
enough  example  that  you  should  not  have  to  resort  to  computer 
software.)  Use  this  visualization  to  determine  the  behavior  of  the 
sign  of  the  Laplacian,  as  in  example  20  on  p.  880. 

(b)  Consider  the  following  incorrect  calculation  of  this  Laplacian. 
We  take  the  first  derivatives  and  find 

df  df 
—  +  —  =  0. 
dx  dy 

Next  we  take  the  second  derivatives,  but  those  are  zero  as  well,  so 
the  Laplacian  is  zero.  Critique  this  calculation  in  two  ways:  (1)  by 
comparing  with  part  a;  (2)  by  comparing  with  a  correct  calculation. 

(c)  In  general,  if  we  have  a  function  /  of  two  variables,  the  quantity 
Q  =  df  /dx  +  df  /dy  can  never  be  of  physical  interest,  because  it  is 
not  rotationally  invariant  (sec.  3.4.2,  p.  190).  Prove  this  by  showing 
that  by  rotating  your  coordinate  system,  you  can  get  a  completely 
different  answer  than  the  one  calculated  in  part  b. 

>  Solution,  p.  983  - 


942 


Chapter  13  Quantum  Physics 


Problems  50-53  draw  on  optional  topics  from  section  13.5,  p.  905. 
50  In  problem  23,  you  showed  that  a  wavefunction  of  the  form 

*o(*)  =  e-2/2 

was  a  solution  of  the  Schrodinger  equation  for  the  quantum  har¬ 
monic  oscillator  in  one  dimensions.  (We  ignore  units,  and  the  factor 
of  1/2  in  the  exponent  is  just  a  convention.)  It  represents  the  ground 
state.  The  wavefunction  of  the  first  excited  state  is 

Ti(x)  =  xe~x 2/2, 


with  the  same  value  of  b. 

(a)  Show  that  these  states  are  orthogonal  in  the  sense  defined  on 
p.  924. 

(b)  What  is  an  observable  that  would  distinguish  them? 

51  Suppose  we  multiply  a  wavefunction  T  by  a  constant  c.  This 

operation,  (HT  =  cT,  is  linear,  which  suggests  that  it  could  also 
be  considered  an  observable,  abeit  a  trivial  one  that  provides  no 
information.  To  see  if  this  is  true,  we  also  need  to  check  that  it 
is  Hermitian,  using  one  of  our  two  equivalent  definitions,  (a)  Does 
it  have  real  eigenvalues?  (b)  Equivalently,  is  it  true  that  (Ou\v)  = 
(u\Ov)7  ■ 

52  (a)  When  an  excited  state  in  a  nucleus  undergoes  gamma  de¬ 
cay,  the  half-life  depends  on  a  variety  of  factors,  but  a  fairly  typical 
value  would  be  about  1  ns.  Find  the  uncertainty  in  energy  imposed 
by  the  energy-time  uncertainty  relation,  and  compare  with  a  typical 
excitation  energy  of  1  MeV. 

(b)  Some  very  neutron-rich  nuclei  are  unstable  with  respect  to  emis¬ 
sion  of  a  neutron,  and  in  these  cases  the  half-life  is  typically  on  the 
order  of  10~21  s.  Carry  out  an  estimate  as  in  part  a.  - 

53  As  you  might  have  guessed  from  the  equations  given  in  prob¬ 
lem  50,  the  mth  excited  state  of  the  one-dimensional  quantum  har¬ 
monic  oscillator  has  a  wavefunction  of  the  form 

Tm(x)  =  Hm(x)e~x2/2. 

Here  Hm  is  a  polynomial  of  order  m,  and  Hm  is  an  even  function  if 
m  is  even,  odd  if  m  is  odd.  Given  these  assumptions,  it  is  possible 
to  find  T2  simply  from  the  requirement  that  it  be  orthogonal  to  \ko 
and  Ti,  without  having  to  solve  the  Schrodinger  equation.  Find  ^2 
by  this  method.  (Ignore  normalization.)  Hint:  First  use  integration 
by  parts  to  relate  the  integral  ff°  xme~x  d.x  to  the  corresponding 
integral  for  m  —  2.  ^ 

Key  to  symbols: 

easy  -  typical  K  challenging  )i  difficult  ■  very  difficult 
V  An  answer  check  is  available  at  www.lightandmatter.com. 
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Exercises 

Exercise  13A:  Quantum  Versus  Classical  Randomness 

1.  Imagine  the  classical  version  of  the  particle  in  a  one-dimensional  box.  Suppose  you  insert 
the  particle  in  the  box  and  give  it  a  known,  predetermined  energy,  but  a  random  initial  position 
and  a  random  direction  of  motion.  You  then  pick  a  random  later  moment  in  time  to  see  where 
it  is.  Sketch  the  resulting  probability  distribution  by  shading  on  top  of  a  line  segment.  Does 
the  probability  distribution  depend  on  energy? 

2.  Do  similar  sketches  for  the  first  few  energy  levels  of  the  quantum  mechanical  particle  in  a 
box,  and  compare  with  1. 

3.  Do  the  same  thing  as  in  1,  but  for  a  classical  hydrogen  atom  in  two  dimensions,  which  acts 
just  like  a  miniature  solar  system.  Assume  you’re  always  starting  out  with  the  same  fixed  values 
of  energy  and  angular  momentum,  but  a  position  and  direction  of  motion  that  are  otherwise 
random.  Do  this  for  L  =  0,  and  compare  with  a  real  L  =  0  probability  distribution  for  the 
hydrogen  atom. 

4.  Repeat  3  for  a  nonzero  value  of  L,  say  L=h. 

5.  Summarize:  Are  the  classical  probability  distributions  accurate?  What  qualitative  features 
are  possessed  by  the  classical  diagrams  but  not  by  the  quantum  mechanical  ones,  or  vice-versa? 
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Three  Essential  Mathematical  Skills 


More  often  than  not  when  a  search-and-rescue  team  finds  a  hiker 
dead  in  the  wilderness,  it  turns  out  that  the  person  won  a  Darwin 
Award  by  not  carrying  some  item  from  a  short  list  of  essentials,  such 
as  water  and  a  map.  There  are  three  mathematical  essentials  in  this 
course. 

1.  Converting  units 

basic  technique:  subsection  0.1.9,  p.  29;  conversion  of  area,  vol¬ 
ume,  etc.:  subsection  0.2.1,  p.  35 

Examples: 


0.7  j^x 


103  g 
1 


700  g. 


To  check  that  we  have  the  conversion  factor  the  right  way  up  (103 
rather  then  1/103),  we  note  that  the  smaller  unit  of  grams  has  been 
compensated  for  by  making  the  number  larger. 

For  units  like  m2,  kg/m3,  etc.,  we  have  to  raise  the  conversion 
factor  to  the  appropriate  power: 


4  m3  x 


103  mm 
1  m 


4  x  109  x 


mm3 


=  4  X  109  mm3 


Examples  with  solutions  —  p.  48,  #6;  p.  52,  #31 

Problems  you  can  check  at  lightandmatter .  com/arealchecker . 
html  p.  48,  #5;  p.  48,  #4;  p.  48,  #7;  p.  52,  #22;  p.  53,  #40 

2.  Reasoning  about  ratios  and  proportionali¬ 
ties 

The  technique  is  introduced  in  subsection  0.2.2,  p.  36,  in  the 
context  of  area  and  volume,  but  it  applies  more  generally  to  any 
relationship  in  which  one  variable  depends  on  another  raised  to  some 
power. 

Example:  When  a  car  or  truck  travels  over  a  road,  there  is  wear 
and  tear  on  the  road  surface,  which  incurs  a  cost.  Studies  show  that 
the  cost  per  kilometer  of  travel  C  is  given  by 

C  =  kw 4, 

where  w  is  the  weight  per  axle  and  A;  is  a  constant.  The  weight  per 
axle  is  about  13  times  higher  for  a  semi-trailer  than  for  my  Honda 
Fit.  How  many  times  greater  is  the  cost  imposed  on  the  federal 
government  when  the  semi  travels  a  given  distance  on  an  interstate 
freeway? 
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>  First  we  convert  the  equation  into  a  proportionality  by  throwing 
out  k,  which  is  the  same  for  both  vehicles: 

Can;4 


Next  we  convert  this  proportionality  to  a  statement  about  ratios: 

L#A)#29,„00 

C2  \W2J 


Since  the  gas  taxes  paid  by  the  trucker  are  nowhere  near  29,000 
times  more  than  those  I  pay  to  drive  my  Fit  the  same  distance,  the 
federal  government  is  effectively  awarding  a  massive  subsidy  to  the 
trucking  company.  Plus  my  Fit  is  cuter. 


Examples  with  solutions  —  p.  52,  #32;  p.  52,  #33;  p.  53,  #38; 
p.  50,  #17 

Problems  you  can  check  at  lightandmatter .  com/arealchecker . 
html  —  p.  53,  #37;  p.  53,  #39;  p.  121,  #24;  p.  122,  #27;  p.  119, 
#9;  p.  288,  #3 

3.  Vector  addition 


subsection  3.4.3,  p.  199 

Example:  The  Ar  vector  from  San  Diego  to  Los  Angeles  has 
magnitude  190  km  and  direction  129° counterclockwise  from  east. 
The  one  from  LA  to  Las  Vegas  is  370  km  at  38°counterclockwise 
from  east.  Find  the  distance  and  direction  from  San  Diego  to  Las 
Vegas. 

>  Graphical  addition  is  discussed  on  p.  199.  Here  we  concentrate  on 
analytic  addition,  which  involves  adding  the  x  components  to  find 
the  total  x  component,  and  similarly  for  y.  The  trig  needed  in  order 
to  find  the  components  of  the  second  leg  (LA  to  Vegas)  is  laid  out 
in  figure  1  on  p.  197  and  explained  in  detail  in  example  60  on  p.  197: 

Ax' 2  =  (370  km)  cos  38°  =  292  km 
Ay 2  =  (370  km)  sin  38°  =  228  km 

(Since  these  are  intermediate  results,  we  keep  an  extra  sig  fig  to 
avoid  accumulating  too  much  rounding  error.)  Once  we  understand 
the  trig  for  one  example,  we  don’t  need  to  reinvent  the  wheel  every 
time.  The  pattern  is  completely  universal,  provided  that  we  first 
make  sure  to  get  the  angle  expressed  according  to  the  usual  trig 
convention,  counterclockwise  from  the  x  axis.  Applying  the  pattern 
to  the  first  leg,  we  have: 

Ax\  =  (190  km)  cos  129°  =  —120  km 
Ayi  =  (190  km)  sin  129°  =  148  km 
For  the  vector  directly  from  San  Diego  to  Las  Vegas,  we  have 
Ax  =  Ax’i  +  Ax2  =  172  km 
Ay  =  Ayi  +  Ay2  =  376  km. 
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The  distance  from  San  Diego  to  Las  Vegas  is  found  using  the  Pythagorean 
theorem, 

\J (172  km)2  +  (376  km)2  =  410  km 

(rounded  to  two  sig  figs  because  it’s  one  of  our  final  results).  The 
direction  is  one  of  the  two  possible  values  of  the  inverse  tangent 

tan_1(Ay/Ax)  =  {65°,  245°}. 

Consulting  a  sketch  shows  that  the  first  of  these  values  is  the  correct 
one. 

Examples  with  solutions  —  p.  229,  #62;  p.  229,  #63;  p.  226, 

#45 

Problems  you  can  check  at  lightandmatter .  com/arealchecker . 
html  p.  228,  #53;  p.  228,  #57;  p.  228,  #58;  p.  235,  #79;  p.  226, 

#46 


Exercises 


947 


Programming  With  Python  rsions  of  Python,  the  computer  will 

actually  print  out  >>>  as  a  prompt  to  tell  you 


The  purpose  of  this  tutorial  is  to  help  you 
get  familiar  with  a  computer  programming  lan¬ 
guage  called  Python,  which  Pve  chosen  be¬ 
cause  (a)  it’s  free,  and  (b)  it’s  easy  to  use  inter¬ 
actively.  I  won’t  assume  you  have  any  previous 
experience  with  computer  programming;  you 
won’t  need  to  learn  very  much  Python,  and 
what  little  you  do  need  to  learn  I’ll  explain 
explicitly.  If  you  really  want  to  learn  Python 
more  thoroughly,  there  are  a  couple  of  excel¬ 
lent  books  that  you  can  download  for  free  on 
the  Web: 

How  to  Think  Like  a  Computer  Sci¬ 
entist  (Python  Version),  Allen  B.  Dow¬ 
ney,  Jeffrey  Elkner,  Moshe  Zadka, 
http:/ / www. ibiblio.org/obp/ 

Dive  Into  Python,  Mark  Pilgrim, 
http:/ /diveintopython.net/ 


it’s  ready  to  type  something.) 

There  are  only  a  couple  of  things  to  watch 
out  for.  First,  Python  distinguishes  between 
integers  and  real  numbers,  so  the  following 
gives  an  unexpected  result: 

>>>  print (2/3) 

0 

To  get  it  to  treat  these  values  as  real  numbers, 
you  have  to  use  decimal  points: 

>>>  print(2./3.) 

0 . 6666666666666666666663 

Multiplication  is  represented  by 

»>  print (2.  *3.) 

6.0 


The  first  book  is  meant  for  people  who  have 
never  programmed  before,  while  the  second  is 
a  more  complete  introduction  aimed  at  vet¬ 
eran  programmers  who  know  a  different  lan¬ 
guage  already. 

Using  Python  as  a  calculator 

The  easiest  way  to  get  Python  going  is  to 
go  to  the  web  site  ideone .  com.  Under  “choose 
a  language,”  select  Python.  Inside  the  win¬ 
dow  where  it  says  “paste  your  source  code  or 
insert  template  or  sample,”  type  print  (2+2). 
Click  on  the  “submit”  button.  The  result,  4, 
is  shown  under  “output.”  In  other  words,  you 
can  use  Python  just  like  a  calculator. 

For  compactness,  I’ll  show  examples  in  the 
following  style: 

»>  print  (2+2) 

4 

Here  the  »>  is  not  something  you  would  type 
yourself;  it’s  just  a  marker  to  distinguish  your 
input  from  the  program’s  output.  (In  some 


Also,  Python  doesn’t  know  about  its  own  li¬ 
brary  of  math  functions  unless  you  tell  it  ex¬ 
plicitly  to  load  them  in: 

>>>  print  (sqrt(2.)) 

Traceback  (most  recent  call  last) : 

File  ‘‘<stdin>,,)  line  1,  in  ? 

NameError:  There  is  no  variable  named  ‘sqrt’ 

Here  are  the  steps  you  have  to  go  through  to 
calculate  the  square  root  of  2  successfully: 

>>>  import  math 

>>>  print (math. sqrt (2 .) ) 

1.4142135623730951 


The  first  line  is  just  something  you  can  make  a 
habit  of  doing  every  time  you  start  up  Python. 
In  the  second  line,  the  name  of  the  square 
root  function  had  to  be  prefixed  with  “math.” 
to  tell  Python  where  you  wanted  to  get  this 
“sqrt”  function  from.  (All  of  this  may  seem 
like  a  nuisance  if  you’re  just  using  Python  as  a 


948 


Chapter  13  Quantum  Physics 


calculator,  but  it’s  a  good  way  to  design  a  pro¬ 
gramming  language  so  that  names  of  functions 
never  conflict.) 

Try  it.  Experiment  and  figure  out  whether 
Python’s  trig  functions  assume  radians  or 
degrees. 

Variables 

Python  lets  you  define  variables  and  assign 
values  to  them  using  an  equals  sign: 

>>>  dwarf s=7 
>>>  print (dwarf s) 

>>>  print (dwarf s+3) 

7 

10 

Note  that  a  variable  in  computer  program¬ 
ming  isn’t  quite  like  a  variable  in  algebra.  In 
algebra,  if  a= 7  then  a= 7  always,  throughout  a 
particular  calculation.  But  in  a  programming 
language,  the  variable  name  really  represents 
a  place  in  memory  where  a  number  can  be 
stored,  so  you  can  change  its  value: 

>>>  dwarf s=7 
>>>  dwarfs=37 
>>>  print (dwarf s) 

37 

You  can  even  do  stuff  like  this, 

>>>  dwarfs=37 
>>>  dwarf s=dwarfs+l 
>>>  print (dwarf s) 

38 

In  algebra  it  would  be  nonsense  to  have  a  vari¬ 
able  equal  to  itself  plus  one,  but  in  a  com¬ 
puter  program,  it’s  not  an  assertion  that  the 
two  things  are  equal,  its  a  command  to  calcu¬ 
late  the  value  of  the  expression  on  the  right 
side  of  the  equals,  and  then  put  that  number 
into  the  memory  location  referred  to  by  the 
variable  name  on  the  left. 

Try  it.  What  happens  if  you  do  dwarfs+1  = 
dwarfs?  Do  you  understand  why? 


Functions 

Somebody  had  to  teach  Python  how  to  do 
functions  like  sqrt,  and  it’s  handy  to  be  able 
to  define  your  own  functions  in  the  same  way. 
Here’s  how  to  do  it: 

>>>  def  double (x): 

>>>  return  2.*x 
>>>  print (double (5 .) ) 

10.0 

Note  that  the  indentation  is  mandatory.  The 
first  and  second  lines  define  a  function  called 
double.  The  final  line  evaluates  that  function 
with  an  input  of  5. 

Loops 

Suppose  we  want  to  add  up  all  the  numbers 
from  0  to  99. 

Automating  this  kind  of  thing  is  exactly 
what  computers  are  best  at,  and  Python  pro¬ 
vides  a  mechanism  for  this  called  a  loop: 

>>>  sum=0 

>>>  for  j  in  range(lOO): 

>>>  sum=sum+j 
>>>  print (sum) 

4950 

The  stuff  that  gets  repeated  —  the  inside  of 
the  loop  —  has  to  be  indented,  just  like  in 
a  function  definition.  Python  always  counts 
loops  starting  from  0,  so  for  j  in  range  (100) 
actually  causes  j  to  range  from  0  to  99,  not 
from  1  to  100. 


Exercises 
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Unphysical  “hovering”  solutions  to  conservation  of  energy 

On  page  83,  I  gave  the  following  derivation  for  the  acceleration  of  an  object  under  the  influence 
of  gravity: 

/ chA  _  /  du  \  /ch K\  /dCA  /dy\ 

Vdt )  ~  U K)  VdC7y  U i) 

=  (— )  (-1  )(mg)(v) 

\  rnv  J 

=  ~9 

There  is  a  loophole  in  this  argument,  however.  When  I  say  dv/  d K  =  l/(mv),  that  only  works 
when  the  object  is  moving.  If  it’s  at  rest,  v  is  nondifferentiable  as  a  function  of  K  (or  we  could 
say  that  the  derivative  is  infinite).  Energy  can  in  fact  be  conserved  by  an  object  that  simply 
hovers  above  the  ground:  its  kinetic  energy  is  constant,  and  its  gravitational  energy  is  also 
constant.  Why,  then,  do  we  never  observe  such  behavior,  except  in  Coyote  and  Roadrunner 
cartoons  when  the  Coyote  runs  off  the  edge  of  a  cliff  without  noticing  it  at  first? 

Suppose  we  toss  a  baseball  straight  up,  and  pick  a  coordinate  system  in  which  upward 
velocities  are  positive.  The  ball’s  velocity  is  a  continuous  function  of  time,  and  it  changes 
from  being  positive  to  being  negative,  so  there  must  be  some  instant  at  which  it  equals  zero. 
Conservation  of  energy  would  be  satisfied  if  the  velocity  were  to  remain  at  zero  for  a  minute 
or  an  hour  before  the  ball  finally  made  the  decision  to  fall.  One  thing  that  seems  odd  about 
all  this  is  that  there’s  no  obvious  way  for  the  ball  to  “decide”  when  it  was  time  to  go  ahead 
and  fall  back  down  again.  It  violates  the  principle  that  the  laws  of  physics  are  supposed  to  be 
deterministic. 

One  reason  that  we  could  never  hope  to  observe  such  behavior  in  reality  is  that  the  ball 
would  have  to  spend  some  time  being  exactly  at  rest,  and  yet  no  object  can  ever  stay  exactly 
at  rest  for  any  finite  amount  of  time.  Objects  in  the  real  world  are  buffeted  by  air  currents, 
for  example.  At  the  atomic  level,  the  interaction  of  these  air  currents  with  the  ball  consists  of 
discrete  collisions  with  whizzing  air  molecules,  and  a  quick  back-of-the-envelope  estimate  shows 
that  for  an  object  this  size,  the  typical  time  between  collisions  is  on  the  order  of  ICC27  s,  which 
would  limit  the  duration  of  the  hovering  to  a  time  far  too  short  to  allow  it  to  be  observed. 
Nevertheless  this  is  not  a  completely  satisfying  explanation.  It  makes  us  wonder  whether  we 
ought  to  apply  to  the  government  for  a  research  grant  to  do  an  experiment  in  which  a  baseball 
would  be  shot  upward  in  a  chamber  that  had  been  pumped  out  to  an  ultra-high  vacuum! 

A  somewhat  better  approach  is  to  consider  that  motion  is  relative,  so  the  ball’s  velocity  can 
only  be  zero  in  one  particular  frame  of  reference.  It  wouldn’t  make  sense  for  the  ball  to  exhibit 
qualitatively  different  behavior  when  it  was  at  rest,  because  different  observers  don’t  even  agree 
when  the  ball  is  at  rest.  But  this  argument  also  fails  to  resolve  the  issue  completely,  because  this 
is  a  ball  interacting  with  the  planet  Earth  via  gravitational  forces,  so  it  could  make  a  difference 
whether  the  ball  was  at  rest  relative  to  the  earth.  Suppose  we  go  into  a  frame  of  reference  defined 
by  an  observer  watching  the  ball  as  she  descends  in  a  glass  elevator.  At  the  moment  when  the 


ball  is  at  rest  relative  to  the  earth,  she  sees  both  the  ball  and  the  earth  as  moving  upward  at  the 
same  speed.  It  would  be  perfectly  consistent  with  conservation  of  energy  if  she  were  to  see  them 
maintain  this  distance  from  one  another  for  several  minutes.  In  her  frame,  their  kinetic  energies 
would  be  nonzero,  but  constant,  and  the  gravitational  energy  only  depends  on  the  separation 
between  the  ball  and  the  earth,  so  it  would  be  constant  as  well. 

Now  that  we’re  thinking  of  the  ball  and  the  earth  as  two  objects  interacting  with  one  another, 
it  becomes  natural  to  think  of  them  on  the  same  footing.  What  about  the  motion  of  the  Earth? 
The  earth  feels  a  gravitational  attraction  from  the  ball,  just  as  the  ball  feels  one  from  the 
Earth.  To  make  this  symmetry  more  evident,  let’s  imagine  two  planets  of  equal  mass,  Foo  and 
Bar,  initially  at  rest  with  respect  to  one  another.  The  Fooites  and  Barians  realize  that  the 
gravitational  interaction  between  their  planets  will  cause  them  to  drop  together  and  collide. 
It  seems  that  they  should  get  ready  for  the  end  of  the  world.  And  yet  before  they  riot,  get 
drunk,  or  tell  their  spouses  that  yes,  they  really  do  look  fat  in  that  dress,  maybe  they  should 
consider  the  possibility  that  the  two  planets  will  simply  hover  in  place  for  some  amount  of 
time,  because  that  would  satisfy  conservation  of  energy.  Now  the  physical  implausibility  of  the 
hovering  solution  becomes  even  more  apparent.  Not  only  does  one  planet  have  to  “decide”  at 
precisely  what  microsecond  to  go  ahead  and  fall,  but  the  other  planet  has  to  make  the  same 
decision  at  the  same  instant,  or  else  conservation  of  energy  will  be  violated.  There  is  no  physical 
process  or  interaction  between  the  two  planets  that  could  perfectly  synchronize  their  “decisions” 
like  this.  (The  mechanism  can’t  be  gravity,  because  nothing  about  the  gravitational  interaction 
provides  any  kind  of  a  count-down  that  would  pick  out  one  particular  time  as  the  one  at  which 
the  planets  should  start  moving.) 

The  key  to  making  sense  of  all  this  is  to  realize  that  each  planet  can  only  “feel”  the  gravita¬ 
tional  field  in  its  own  region  of  space.  Its  acceleration  can  only  depend  on  the  field,  and  not  on 
the  detailed  arrangement  of  masses  elsewhere  in  the  universe  that  caused  that  field.  Granting 
this  kind  of  “real”  status  to  fields  can  be  considered  as  a  logically  necessary  supplement  to 
conservation  of  energy. 

Automated  search  for  the  brachistochrone 

See  page  95. 
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d= .  01 
cl=. 61905 
c2=-. 94427 
a  =  1. 
b  =  1. 

for  i  in  range(lOO): 
bestt  =  99. 
for  j  in  range (3): 
for  k  in  range(3): 
try_cl  =  cl+(j-l)*d 
try_c2  =  c2+(k-l)*d 
t  =  timeb(a,b,try_cl ,try_c2 , 100000) 
if  t<bestt  : 

bestcl  =  try_cl 
bestc2  =  try_c2 
best j  =  j 
bestk  =  k 
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18  bestt  =  t 

19  cl  =  bestcl 

20  c2  =  bestc2 

21  c3  =  (b-cl*a-c2*a**2) / (a**3) 

22  print (cl,  c2,  c3,  bestt) 

23  if  (bestj  ==  1)  and  (bestk  ==  1)  : 

24  d  =  d* .  5 

Derivation  of  the  steady  state  for  damped,  driven  oscillations 

Using  the  trig  identities  for  the  sine  of  a  sum  and  cosine  of  a  sum,  we  can  change  equation  [2] 
on  page  177  into  the  form 

[(— mu;2  +  k )  cos  5  —  bco  sin  <5  —  Fm/A\  sin  cot 
+  [(  —mu2  +  k )  sin  6  +  bco  cos  <5]  cos  cot  =  0. 

Both  the  quantities  in  square  brackets  must  equal  zero,  which  gives  us  two  equations  we  can 
use  to  determine  the  unknowns  A  and  5.  The  results  are 

r  .  -l  buj 

o  =  tan  - t. - - 

mujz  —  k 

-  tan”1  UUo 

“tan  QW-ufi) 

and 


Proofs  relating  to  angular  momentum 

Uniqueness  of  the  cross  product 

The  vector  cross  product  as  we  have  defined  it  has  the  following  properties: 

(1)  It  does  not  violate  rotational  invariance. 

(2)  It  has  the  property  Ax(B  +  C)  =  AxB  +  AxC. 

(3)  It  has  the  property  A  x  (AB)  =  k{ A  x  B),  where  A:  is  a  scalar. 

Theorem:  The  definition  we  have  given  is  the  only  possible  method  of  multiplying  two  vectors 
to  make  a  third  vector  which  has  these  properties,  with  the  exception  of  trivial  redefinitions 
which  just  involve  multiplying  all  the  results  by  the  same  constant  or  swapping  the  names 
of  the  axes.  (Specifically,  using  a  left-hand  rule  rather  than  a  right-hand  rule  corresponds  to 
multiplying  all  the  results  by  —1.) 

Proof:  We  prove  only  the  uniqueness  of  the  definition,  without  explicitly  proving  that  it  has 
properties  (1)  through  (3). 

Using  properties  (2)  and  (3),  we  can  break  down  any  vector  multiplication  (Axx  +  Ay y  + 
Az z)  x  (Bxic  +  By y  +  Bz z)  into  terms  involving  cross  products  of  unit  vectors. 

A  “self-term”  like  x  x  x  must  either  be  zero  or  lie  along  the  x  axis,  since  any  other  direction 
would  violate  property  (1).  If  was  not  zero,  then  (— x)  x  (— x)  would  have  to  lie  in  the  opposite 
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direction  to  avoid  breaking  rotational  invariance,  but  property  (3)  says  that  (— x)  x  (— x)  is  the 
same  as  x  x  x,  which  is  a  contradiction.  Therefore  the  self-terms  must  be  zero. 

An  “other-term”  like  x  x  y  could  conceivably  have  components  in  the  x-y  plane  and  along 
the  2  axis.  If  there  was  a  nonzero  component  in  the  x-y  plane,  symmetry  would  require  that  it 
lie  along  the  diagonal  between  the  x  and  y  axes,  and  similarly  the  in-the-plane  component  of 
(— x)  x  y  would  have  to  be  along  the  other  diagonal  in  the  x-y  plane.  Property  (3),  however, 
requires  that  (— x)  X  y  equal  —  (x  X  y),  which  would  be  along  the  original  diagonal.  The  only 
way  it  can  lie  along  both  diagonals  is  if  it  is  zero. 

We  now  know  that  x  x  y  must  lie  along  the  z  axis.  Since  we  are  not  interested  in  trivial 
differences  in  definitions,  we  can  fix  x  x  y  =  z,  ignoring  peurile  possibilities  such  as  x  x  y  =  7z 
or  the  left-handed  definition  x  X  y  =  —  z.  Given  x  x  y  =  z,  the  symmetry  of  space  requires  that 
similar  relations  hold  for  yxz  and  z  x  x,  with  at  most  a  difference  in  sign.  A  difference  in  sign 
could  always  be  eliminated  by  swapping  the  names  of  some  of  the  axes,  so  ignoring  possible  trivial 
differences  in  definitions  we  can  assume  that  the  cyclically  related  set  of  relations  x  x  y  =  z, 
y  x  z  =  x,  and  z  x  x  =  y  holds.  Since  the  arbitrary  cross-product  with  which  we  started  can 
be  broken  down  into  these  simpler  ones,  the  cross  product  is  uniquely  defined. 

The  choice  of  axis  theorem 

Theorem:  Suppose  a  closed  system  of  material  particles  conserves  angular  momentum  in  one 
frame  of  reference,  with  the  axis  taken  to  be  at  the  origin.  Then  conservation  of  angular 
momentum  is  unaffected  if  the  origin  is  relocated  or  if  we  change  to  a  frame  of  reference  that 
is  in  constant- velocity  motion  with  respect  to  the  first  one.  The  theorem  also  holds  in  the  case 
where  the  system  is  not  closed,  but  the  total  external  force  is  zero. 

Proof:  In  the  original  frame  of  reference,  angular  momentum  is  conserved,  so  we  have  dL/  dt=0. 
From  example  28  on  page  284,  this  derivative  can  be  rewritten  as 


dL 

df 


i  x  Fj, 


where  Fj  is  the  total  force  acting  on  particle  i.  In  other  words,  we’re  adding  up  all  the  torques 
on  all  the  particles. 

By  changing  to  the  new  frame  of  reference,  we  have  changed  the  position  vector  of  each 
particle  according  to  r — >  r j  +  k  —  u t,  where  k  is  a  constant  vector  that  indicates  the  relative 
position  of  the  new  origin  at  t  =  0,  and  u  is  the  velocity  of  the  new  frame  with  respect  to  the 
old  one.  The  forces  are  all  the  same  in  the  new  frame  of  reference,  however.  In  the  new  frame, 
the  rate  of  change  of  the  angular  momentum  is 

dL  v — v  , 

—  =  2^(ri  +  k  -  ut)  x  Fj 

i 

=  ^rjX  Fj  +  (k  -  u  t)  F<- 

i  i 


The  first  term  is  the  expression  for  the  rate  of  change  of  the  angular  momentum  in  the  original 
frame  of  reference,  which  is  zero  by  assumption.  The  second  term  vanishes  by  Newton’s  third 
law;  since  the  system  is  closed,  every  force  Fj  cancels  with  some  force  Fy.  (If  external  forces 
act,  but  they  add  up  to  zero,  then  the  sum  can  be  broken  up  into  a  sum  of  internal  forces  and 
a  sum  of  external  forces,  each  of  which  is  zero.)  The  rate  of  change  of  the  angular  momentum 
is  therefore  zero  in  the  new  frame  of  reference. 
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The  spin  theorem 

Theorem:  An  object’s  angular  momentum  with  respect  to  some  outside  axis  A  can  be  found 
by  adding  up  two  parts: 

(1)  The  first  part  is  the  object’s  angular  momentum  found  by  using  its  own  center  of  mass  as 
the  axis,  i.e.  the  angular  momentum  the  object  has  because  it  is  spinning. 

(2)  The  other  part  equals  the  angular  momentum  that  the  object  would  have  with  respect  to 
the  axis  A  if  it  had  all  its  mass  concentrated  at  and  moving  with  its  center  of  mass. 

Proof:  Let  rcm  be  the  position  of  the  center  of  mass.  The  total  angular  momentum  is 

L  =  J^ri  x  p?, 

i 

which  can  be  rewritten  as 

L  =  ^2  (rcm  +  Ti  -  rcm)  x  p,j, 

i 

where  iy  —  rcm  is  particle  i’s  position  relative  to  the  center  of  mass.  We  then  have 

L  =  rcm  x  ^  p.j  +  ^  (r*  -  rcm)  x  P* 

i  i 

=  rCm  X  ptotal  +  (ri  -  rcm)  X  Pi 
i 

=  rcm  x  mtotaivcm  +  ^  (r*  -  rcm)  x  p_i. 

i 

The  first  and  second  terms  in  this  expression  correspond  to  the  quantities  (2)  and  (1),  respec¬ 
tively. 

Different  Forms  of  Maxwell’s  Equations 

First  we  reproduce  Maxwell’s  equations  as  stated  on  page  701,  in  integral  form,  using  the  SI 
(meter-kilogram-second)  system  of  units,  with  the  coupling  constants  written  in  terms  of  k  and 
c: 


<&E  =  4 nkqin 
&B  =  0 


r  E 

c2Tb 


d&B 

~~dt~ 

d&E 

—  b  47 Tkl through 


Homework  problem  39  on  page  732  deals  with  rewriting  these  in  terms  of  eQ  =  1/47 xk  and 
Ho  =  Airk/c2  rather  than  k  and  c. 

For  the  reader  who  has  been  studying  the  optional  sections  giving  Maxwell’s  equations  in 
differential  form,  here  is  a  summary: 
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div  E  =  Ankp 
div  B  =  0 


,  „  3B 

curl  E  =  — — 
dt 

o  3E 

c  curl  B  =  — — b  47rfcj 
at 


Although  all  engineering  and  most  scientific  work  these  days  is  done  in  the  SI  (inks)  system, 
one  may  still  encounter  the  older  cgs  (centimeter-gram-second)  system,  especially  in  astronomy 
and  particle  physics.  The  mechanical  units  in  this  system  include  the  dyne  (g-crn/s2)  for  force, 
and  the  erg  (g-cm2/s2)  for  energy.  The  system  is  extended  to  electrical  units  by  taking  k  =  1 
as  a  matter  of  definition,  so  the  Coulomb  force  law  is  F  =  qicfr/r2.  This  equation  indirectly 
defines  a  unit  of  charge  called  the  elestrostatic  unit,  with  1  C  =  2.998  x  109  esu,  the  factor  of 
2.998  arising  from  the  speed  of  light.  The  unit  of  voltage  is  the  statvolt,  1  statvolt  =  299.8  V. 
In  this  system,  the  electric  and  magnetic  fields  have  the  same  units,  dynes/esu,  but  to  avoid 
confusion,  magnetic  fields  are  normally  written  using  the  equivalent  unit  of  gauss,  1  gauss=l 
dyne/esu=10-4  T.  The  force  on  a  charged  particle  is  F  =  qE  +  x  B,  which  differs  from  the 
inks  version  by  the  1  jc  factor  in  the  magnetic  term.  Maxwell’s  equations  are: 


$E  =  47T qin 
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Appendix  4:  Hints  and  Solutions 


Hints 

Hints  for  Chapter  2 

Page  120,  problem  16:  You  can  use  either  the  chain-rule  technique  from  page  83  or  the 
technique  prescribed  in  problem  15  on  p.  120.  The  positions  and  velocities  of  the  two  masses  are 
related  to  each  other,  and  you’ll  need  to  use  this  relationship  to  eliminate  one  mass’s  position  and 
velocity  and  get  everything  in  terms  of  the  other  mass’s  position  and  velocity.  The  relationship 
between  the  two  positions  will  involve  some  extraneous  variables  like  the  length  of  the  string, 
which  won’t  have  any  effect  on  your  final  result. 

Page  120,  problem  17:  This  is  similar  to  problem  16,  but  you’re  trying  to  find  the  combination 
of  masses  that  will  result  in  zero  acceleration.  In  this  problem,  the  distance  dropped  by  one 
weight  is  different  from,  but  still  related  to,  the  distance  by  which  the  other  weight  rises.  Try 
relating  the  heights  of  the  two  weights  to  each  other,  so  you  can  get  the  total  gravitational 
energy  in  terms  of  only  one  of  these  heights. 

Page  120,  problem  18:  This  is  similar  to  problem  17,  in  that  you’re  looking  for  a  setup  that 
will  give  zero  acceleration,  and  the  distance  the  middle  weight  rises  or  falls  is  not  the  same  as 
the  distance  the  other  two  weights  fall  or  rise.  The  simplest  approach  is  to  get  the  three  heights 
in  terms  of  6 ,  so  that  you  can  write  the  gravitational  energy  in  terms  of  8. 

Page  120,  problem  19:  This  is  very  similar  to  problems  16  and  17. 

Page  120,  problem  20:  The  first  two  parts  can  be  done  more  easily  by  setting  a  =  1,  since 
the  value  of  a  only  changes  the  distance  scale. 

Page  121,  problem  22:  The  condition  for  a  circular  orbit  contains  three  unknowns,  v,  g,  and 
r,  so  you  can’t  just  solve  it  for  r.  You’ll  need  more  equations  to  make  three  equations  in  three 
unknowns.  You’ll  need  a  relationship  between  g  and  r,  and  also  a  relationship  between  v  and  r 
that  uses  the  given  fact  that  it’s  supposed  to  take  24  hours  for  an  orbit. 

Page  121,  problem  25:  What  does  the  total  energy  have  to  be  if  the  projectile’s  velocity  is 
exactly  escape  velocity?  Write  down  conservation  of  energy,  change  v  to  dr/ df,  separate  the 
variables,  and  integrate. 

Page  121,  problem  26:  The  analytic  approach  is  a  little  cumbersome,  although  it  can  be 
done  by  using  approximations  like  l/\/l  +  e  ~  1  —  (l/2)e.  A  more  straightforward,  brute- force 
method  is  simply  to  write  a  computer  program  that  calculates  U/m  for  a  given  point  in  spherical 
coordinates.  By  trial  and  error,  you  can  fairly  rapidly  find  the  r  that  gives  a  desired  value  of 
U/m. 

Page  123,  problem  33:  Use  calculus  to  find  the  minimum  of  U . 

Page  123,  problem  35:  The  spring  constant  of  this  spring,  k ,  is  not  the  quantity  you  need 
in  the  equation  for  the  period.  What  you  need  in  that  equation  is  the  second  derivative  of  the 
spring’s  energy  with  respect  to  the  position  of  the  thing  that’s  oscillating.  You  need  to  start 
by  finding  the  energy  stored  in  the  spring  as  a  function  of  the  vertical  position,  y,  of  the  mass. 


This  is  similar  to  example  23  on  page  116. 

Page  124,  problem  37:  The  variables  x\  and  X2  will  adjust  themselves  to  reach  an  equilibrium. 
Write  down  the  total  energy  in  terms  of  x\  and  X2,  then  eliminate  one  variable,  and  find  the 
equilibrium  value  of  the  other.  Finally,  eliminate  both  x\  and  X2  from  the  total  energy,  getting 
it  just  in  terms  of  b. 

Hints  for  Chapter  3 

Page  221,  problem  20:  Write  down  two  equations,  one  for  Newton’s  second  law  applied  to 
each  object.  Solve  these  for  the  two  unknowns  T  and  a. 

Page  225,  problem  41:  The  whole  expression  for  the  amplitude  has  maxima  where  the  stuff 
inside  the  square  root  is  at  a  minimum,  and  vice  versa,  so  you  can  save  yourself  a  lot  of  work 
by  just  working  on  the  stuff  inside  the  square  root.  For  normal,  large  values  of  Q,  the  there 
are  two  extrema,  one  at  u>  =  0  and  one  at  resonance;  one  of  these  is  a  maximum  and  one  is  a 
minimum.  You  want  to  find  out  at  what  value  of  Q  the  zero-frequency  extremum  switches  over 
from  being  a  maximum  to  being  a  minimum. 

Page  230,  problem  69:  You  can  use  the  geometric  interpretation  of  the  dot  product. 

Page  230,  problem  70:  The  easiest  way  to  do  this  problem  is  to  use  two  different  coordinate 
systems:  one  that’s  tilted  to  coincide  with  the  upper  slope,  and  one  that’s  tilted  to  coincide 
with  the  lower  one. 

Hints  for  Chapter  4 

Page  288,  problem  8:  The  choice  of  axis  theorem  only  applies  to  a  closed  system,  or  to  a 
system  acted  on  by  a  total  force  of  zero.  Even  if  the  box  is  not  going  to  rotate,  its  center  of 
mass  is  going  to  accelerate,  and  this  can  still  cause  a  change  in  its  angular  momentum,  unless 
the  right  axis  is  chosen.  For  example,  if  the  axis  is  chosen  at  the  bottom  right  corner,  then  the 
box  will  start  accumulating  clockwise  angular  momentum,  even  if  it  is  just  accelerating  to  the 
right  without  rotating.  Only  by  choosing  the  axis  at  the  center  of  mass  (or  at  some  other  point 
on  the  same  horizontal  line)  do  we  get  a  constant,  zero  angular  momentum. 

Page  289,  problem  11:  There  are  four  forces  on  the  wheel  at  first,  but  only  three  when  it  lifts 
off.  Normal  forces  are  always  perpendicular  to  the  surface  of  contact.  Note  that  the  corner  of 
the  step  cannot  be  perfectly  sharp,  so  the  surface  of  contact  for  this  force  really  coincides  with 
the  surface  of  the  wheel. 

Page  289,  problem  12:  How  is  this  different  from  the  case  where  you  whirl  a  rock  in  a  circle 
on  a  string  and  gradually  pull  in  the  string? 

Page  292,  problem  24:  The  acceleration  and  the  tension  in  the  string  are  unknown.  Use 
r  =  d L/  df  and  F  =  ma  to  determine  these  two  unknowns. 

Page  294,  problem  35:  You’ll  need  the  result  of  problem  19  in  order  to  relate  the  energy  and 
angular  momentum  of  a  rigidly  rotating  body.  Since  this  relationship  involves  a  variable  raised 
to  a  power,  you  can’t  just  graph  the  data  and  get  the  moment  of  inertia  directly.  One  way 
to  get  around  this  is  to  manipulate  one  of  the  variables  to  make  the  graph  linear.  Here  is  an 
example  of  this  technique  from  another  context.  Suppose  you  were  given  a  table  of  the  masses, 
m,  of  cubical  pieces  of  wood,  whose  sides  had  various  lengths,  b.  You  want  to  find  a  best-fit 
value  for  the  density  of  the  wood.  The  relationship  is  m  =  pb 3.  The  graph  of  m  versus  b  would 
be  a  curve,  and  you  would  not  have  any  easy  way  to  get  the  density  from  such  a  graph.  But 
by  graphing  m  versus  63,  you  can  produce  a  graph  that  is  linear,  and  whose  slope  equals  the 
density. 
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Hints  for  Chapter  6 

Page  381,  problem  4:  How  could  you  change  the  values  of  x  and  t  so  that  the  value  of  y 
would  remain  the  same?  What  would  this  represent  physically? 

Page  383,  problem  12:  The  answers  to  the  two  parts  are  not  the  same. 

Page  382,  problem  8:  (a)  The  most  straightforward  approach  is  to  apply  the  equation 

d2y/dt 2  =  (T / y)d2y / dx2 .  Although  this  equation  was  developed  in  the  main  text  in  the  context 
of  a  straight  string  with  a  curvy  wave  on  it,  it  works  just  as  well  for  a  circular  loop;  the  left-hand 
side  is  simply  the  inward  acceleration  of  any  point  on  the  rope.  Note,  however,  that  we’ve  been 
assuming  the  string  was  (at  least  approximately)  parallel  to  the  x  axis,  which  will  only  be  true 
if  you  choose  a  specific  value  of  x.  You  need  to  get  an  equation  for  y  in  terms  of  x  in  order  to 
evaluate  the  right-hand  side. 

Hints  for  Chapter  7 

Page  449,  problem  28:  Apply  the  equivalence  principle. 

Hints  for  Chapter  10 

Page  640,  problem  15:  Use  the  approximation  (1  +  e)p  ~  1  +  pe,  which  is  valid  for  small  e. 

Page  644,  problem  31:  The  math  is  messy  if  you  put  the  origin  of  your  polar  coordinates  at 

the  center  of  the  disk.  It  comes  out  much  simpler  if  you  put  the  origin  at  the  edge,  right  on  top 
of  the  point  at  which  we’re  trying  to  compute  the  voltage. 

Page  643,  problem  26:  Since  we  have  (<r,  the  volume  of  the  membrane  is  essentially  the 
same  as  if  it  was  unrolled  and  flattened  out,  and  the  field’s  magnitude  is  nearly  constant. 

Page  643,  problem  25:  First  find  the  energy  stored  in  a  spherical  shell  extending  from  r  to 
r  +  dr,  then  integrate  to  find  the  total  energy. 

Page  648,  problem  59:  We  have  D  =  ql  and  Fx  =  qbl  =  DdEx/dx ,  which,  as  claimed,  is 
consistent  with  the  result  of  example  6  on  p.  573  and  depends  on  q  and  i  only  via  D. 

Hints  for  Chapter  11 

Page  729,  problem  24:  A  stable  system  has  low  energy;  energy  would  have  to  be  added  to 
change  its  configuration. 

Page  733,  problem  41:  We’re  ignoring  the  fact  that  the  light  consists  of  little  wavepackets, 
and  imagining  it  as  a  simple  sine  wave.  But  wait,  there’s  more  good  news!  The  energy  density 
depends  on  the  squares  of  the  fields,  which  means  the  squares  of  some  sine  waves.  Well,  when 
you  square  a  sine  wave  that  varies  from  —1  to  +1,  you  get  a  sine  wave  that  goes  from  0  to  +1, 
and  the  average  value  of  that  sine  wave  is  1/2.  That  means  you  don’t  have  to  do  an  integral 
like  U  =  f  (dU/  dU)  dV.  All  you  have  to  do  is  throw  in  the  appropriate  factor  of  1/2,  and  you 
can  pretend  that  the  fields  have  their  constant  values  E  and  B  everywhere. 

Page  733,  problem  42:  Use  Faraday’s  law,  and  choose  an  Amperian  surface  that  is  a  disk  of 
radius  R  sandwiched  between  the  plates. 

Page  735,  problem  51:  (a)  Magnetic  fields  are  created  by  currents,  so  once  you’ve  decided 
how  currents  behave  under  time-reversal,  you  can  figure  out  how  magnetic  fields  behave. 

Hints  for  Chapter  12 

Page  818,  problem  60:  Expand  sin#  in  a  Taylor  series  around  9  =  90  °. 

Answers  to  Self-Checks 

Answers  to  Self-Checks  for  Chapter  0 
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Page  15:  If  only  he  has  the  special  powers,  then  his  results  can  never  be  reproduced. 

Page  17:  They  would  have  had  to  weigh  the  rays,  or  check  for  a  loss  of  weight  in  the  object 
from  which  they  were  have  emitted.  (For  technical  reasons,  this  was  not  a  measurement  they 
could  actually  do,  hence  the  opportunity  for  disagreement.) 

Page  25:  A  dictionary  might  define  “strong”  as  “possessing  powerful  muscles,”  but  that’s  not 
an  operational  definition,  because  it  doesn’t  say  how  to  measure  strength  numerically.  One 
possible  operational  definition  would  be  the  number  of  pounds  a  person  can  bench  press. 

Page  28:  A  microsecond  is  1000  times  longer  than  a  nanosecond,  so  it  would  seem  like  1000 
seconds,  or  about  20  minutes. 

Page  29:  Exponents  have  to  do  with  multiplication,  not  addition.  The  first  line  should  be  100 
times  longer  than  the  second,  not  just  twice  as  long. 

Page  32:  The  various  estimates  differ  by  5  to  10  million.  The  CIA’s  estimate  includes  a 
ridiculous  number  of  gratuitous  significant  figures.  Does  the  CIA  understand  that  every  day, 
people  in  are  born  in,  die  in,  immigrate  to,  and  emigrate  from  Nigeria? 

Page  33:  (1)  4;  (2)  2;  (3)  2 

Page  36:  1  yd2  x  (3  ft/1  yd)2  =  9  ft2 
1  yd3  x  (3  ft/1  yd)3  =  27  ft3 

Page  42:  Ci/C2  =  (wi/w2)a 

Answers  to  Self-Checks  for  Chapter  1 

Page  58:  The  stream  has  to  spread  out.  When  the  velocity  becomes  zero,  it  seems  like  the 
cross-sectional  area  has  to  become  infinite.  In  reality,  this  is  the  point  where  the  water  turns 
around  and  comes  back  down.  The  infinity  isn’t  real;  it  occurs  mathematically  because  we  used 
a  simplified  model  of  the  the  stream  of  water,  assuming,  for  instance,  that  the  water’s  velocity 
is  always  straight  up. 

Page  60:  A  positive  Ax  means  the  object  is  moving  in  the  same  direction  as  the  positive  x 
axis.  A  negative  Ax  means  it’s  going  the  opposite  direction. 

Page  66:  (1)  The  effect  only  occurs  during  blastoff,  when  their  velocity  is  changing.  Once  the 
rocket  engines  stop  firing,  their  velocity  stops  changing,  and  they  no  longer  feel  any  effect.  (2) 
It  is  only  an  observable  effect  of  your  motion  relative  to  the  air. 

Page  68:  Galilean  relativity  says  that  experiments  can’t  come  out  differently  just  because 
they’re  performed  while  in  motion.  The  tilting  of  the  surface  tells  us  the  train  is  accelerating, 
but  it  doesn’t  tell  us  anything  about  the  train’s  velocity  at  that  instant.  The  person  in  the  train 
might  say  the  bottle’s  velocity  was  zero  (but  changing),  whereas  a  person  working  in  a  reference 
frame  attached  to  the  dirt  outside  says  it’s  moving;  they  don’t  agree  on  velocities.  They  do 
agree  on  accelerations.  The  person  in  the  train  has  to  agree  that  the  train  is  accelerating,  since 
otherwise  there’s  no  reason  for  the  funny  tilting  effect. 

Page  69:  Yes.  In  U.S.  currency,  for  instance,  the  quantum  of  money  is  one  cent. 


Answers  to  Self-Checks  for  Chapter  2 

Page  90:  There  are  two  reasonable  possibilities  we  could  imagine  —  neither  of  which  ends  up 
making  much  sense  —  if  we  insist  on  the  straight-line  trajectory.  (1)  If  the  car  has  constant 
speed  along  the  line,  then  in  the  *  frame  we  see  it  going  straight  down  at  constant  speed.  It 
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makes  sense  that  it  goes  straight  down  in  the  *  frame  of  reference,  since  in  that  frame  it  was 
never  moving  horizontally,  and  there’s  no  reason  for  it  to  start.  However,  it  doesn’t  make  sense 
that  it  goes  down  with  constant  speed,  since  falling  objects  are  supposed  to  speed  up  the  whole 
time  they  fall.  This  violates  both  Galilean  relativity  and  conservation  of  energy.  (2)  If  it’s 
speeding  up  and  moving  along  a  diagonal  line  in  the  original  frame,  then  it  might  be  conserving 
energy  in  one  frame  or  the  other.  But  if  it’s  speeding  up  along  a  line,  then  as  seen  in  the  original 
frame,  both  its  vertical  motion  and  its  horizontal  motion  must  be  speeding  up.  If  its  horizontal 
velocity  is  increasing  in  the  original  frame,  then  it  can’t  be  zero  and  remain  zero  in  the  *  frame. 
This  violates  Galilean  relativity,  since  in  the  *  frame  the  car  apparently  starts  moving  sideways 
for  no  reason. 

Answers  to  Self-Checks  for  Chapter  3 

Page  143:  By  shifting  his  weight  around,  he  can  cause  the  center  of  mass  not  to  coincide  with 
the  geometric  center  of  whe  wheel. 

Page  151:  (1)  This  is  motion,  not  force.  (2)  This  is  a  description  of  how  the  sub  is  able  to  get 
the  water  to  produce  a  forward  force  on  it.  (3)  The  sub  runs  out  of  energy,  not  force. 

Page  153:  (1)  It’s  kinetic  friction,  because  her  uniform  is  sliding  over  the  dirt.  (2)  It’s  static 
friction,  because  even  though  the  two  surfaces  are  moving  relative  to  the  landscape,  they’re  not 
slipping  over  each  other.  (3)  Only  kinetic  friction  creates  heat,  as  when  you  rub  your  hands 
together.  If  you  move  your  hands  up  and  down  together  without  sliding  them  across  each  other, 
no  heat  is  produced  by  the  static  friction. 

Page  152:  Frictionless  (or  nearly  frictionless)  ice  can  certainly  make  a  normal  force,  since 
otherwise  a  hockey  puck  would  sink  into  the  ice.  Friction  is  not  possible  without  a  normal  force, 
however:  we  can  see  this  from  the  equation,  or  from  common  sense,  e.g.  while  sliding  down  a 
rope  you  don’t  get  any  friction  unless  you  grip  the  rope. 

Page  154:  (1)  Normal  forces  are  always  perpendicular  to  the  surface  of  contact,  which  means 
right  or  left  in  this  figure.  Normal  forces  are  repulsive,  so  the  cliff’s  force  on  the  feet  is  to  the 
right,  i.e. ,  away  from  the  cliff.  (2)  Frictional  forces  are  always  parallel  to  the  surface  of  contact, 
which  means  right  or  left  in  this  figure.  Static  frictional  forces  are  in  the  direction  that  would 
tend  to  keep  the  surfaces  from  slipping  over  each  other.  If  the  wheel  was  going  to  slip,  its  surface 
would  be  moving  to  the  left,  so  the  static  frictional  force  on  the  wheel  must  be  in  the  direction 
that  would  prevent  this,  i.e.,  to  the  right.  This  makes  sense,  because  it  is  the  static  frictional 
force  that  accelerates  the  dragster.  (3)  Normal  forces  are  always  perpendicular  to  the  surface  of 
contact.  In  this  diagram,  that  means  either  up  and  to  the  left  or  down  and  to  the  right.  Normal 
forces  are  repulsive,  so  the  ball  is  pushing  the  bat  away  from  itself.  Therefore  the  ball’s  force  is 
down  and  to  the  right  on  this  diagram. 

Page  195: 


Page  171:  The  dashed  lines  on  the  graph  are  about  twice  as  far  apart  in  the  second  cycle 
compared  to  the  first,  so  the  amplitude  has  doubled.  For  sufficiently  small  oscillations  around 
an  equilibrium  with  x  =  0  and  17(0)  =  0,  it’s  always  a  good  approximation  to  take  U  oc  x2, 
so  the  energy  is  proportional  to  the  square  of  the  amplitude;  this  is  a  general  fact  about  all 
oscillations,  provided  that  the  amplitude  is  small.  Since  the  amplitude  doubled,  the  energy 
quadrupled. 
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Page  174:  The  two  graphs  start  off  with  the  same  amplitude,  but  the  solid  curve  loses  amplitude 
more  rapidly.  For  a  given  time,  t,  the  quantity  e~ct  is  apparently  smaller  for  the  solid  curve, 
meaning  that  ct  is  greater.  The  solid  curve  has  the  higher  value  of  c. 

Page  176:  A  decaying  exponential  never  dies  out  to  zero  in  any  finite  amount  of  time. 

Page  180:  In  the  expression 

A  ==  -  m 

my  {uj2  —  uj2)2  +  uj2uj2Q~2 

from  page  952,  substituting  uj  =  ujq  makes  the  first  term  inside  the  square  root  vanish,  which 
should  make  the  denominator  pretty  small,  thereby  producing  a  pretty  big  amplitude.  In  the 
limit  of  Q  =  oo,  Q~2  =  0,  so  the  second  term  vanishes,  and  uj  =  oj0  actually  produces  an  infinite 
amplitude.  For  values  of  Q  that  are  large  but  finite,  we  still  expect  to  get  resonance  pretty  close 
to  uj  =  ojq.  Setting  uj  =  uj0  in  the  finite-Q  case,  the  first  term  vanishes,  we  can  simplify  the 
square  root,  and  the  result  ends  up  being  A  oc  1  / \J Q~ 2  oc  Q.  This  is  only  an  approximation, 
because  we  had  to  assume  early  on  that  Q  was  large. 

Page  194:  F  =  ma 

Page  ??: 


Answers  to  Self-Checks  for  Chapter  4 

Page  256:  Torques  1,  2,  and  4  all  have  the  same  sign,  because  they  are  trying  to  twist  the 
wrench  clockwise.  The  sign  of  3  is  opposite  to  the  signs  of  1,  2,  and  4.  The  magnitude  of  3  is  the 
greatest,  since  it  has  a  large  r  and  the  force  is  nearly  all  perpendicular  to  the  wrench.  Torques 
1  and  2  are  the  same  because  they  have  the  same  values  of  r  and  F±.  Torque  4  is  the  smallest, 
due  to  its  small  r. 

Page  265:  One  person’s  6-t  graph  would  simply  be  shifted  up  or  down  relative  to  the  others. 
The  derivative  equals  the  slope  of  the  tangent  line,  and  this  slope  isn’t  changed  when  you  shift 
the  graph,  so  both  people  would  agree  on  the  angular  velocity. 

Page  267:  Reversing  the  direction  of  oj  also  reverses  the  direction  of  motion,  and  this  is  reflected 
by  the  relationship  between  the  plus  and  minus  signs.  In  the  equation  for  the  radial  acceleration, 
oo  is  squared,  so  even  if  oj  is  negative,  the  result  is  positive.  This  makes  sense  because  the 
acceleration  is  always  inward  in  circular  motion,  never  outward. 

Page  279:  All  the  rotations  around  the  x  axis  give  uj  vectors  along  the  positive  x  axis  (thumb 
pointing  along  positive  x),  and  all  the  rotations  about  the  y  axis  have  uj  vectors  with  positive  y 
components. 

Page  282:  For  example,  if  we  take  (A  x  B)x  =  AyBz  —  ByAz  and  reverse  the  A’s  and  B’s,  we 
get  (B  x  A)x  =  ByAz  —  AyBz,  which  is  just  the  negative  of  the  original  expression. 

Answers  to  Self-Checks  for  Chapter  5 

Page  303:  Solids  can  exert  shear  forces.  A  solid  could  be  in  an  equilibrium  in  which  the  shear 
forces  were  canceling  the  forces  due  to  sideways  pressure  gradients.  For  example,  if  I  push  on 
a  brick  wall,  it  will  give  by  perhaps  a  millionth  of  an  inch,  but  it  will  reach  an  equilibrium,  in 
which  the  shear  forces  cancel  out  the  effect  of  the  pressure  gradient. 
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Page  303:  (1)  Not  valid.  The  equation  only  applies  to  fluids.  (2)  Valid.  The  density  of  the  air 
is  nearly  constant  between  the  top  and  bottom  of  the  building.  (3)  Not  valid.  There  is  a  large 
difference  is  the  density  of  the  air  between  the  top  and  the  bottom  of  the  mountain.  (4)  Not 
valid,  because  g  isn’t  constant  throughout  the  interior  of  the  earth.  (5)  Not  valid,  because  the 
air  is  flowing  around  the  wing.  The  air  is  accelerating,  so  it  is  not  in  equilibrium. 

Page  324:  Heating  the  gas  at  constant  pressure  requires  adding  heat  to  it,  which  increases  its 
entropy.  To  increase  the  gas’s  pressure  while  keeping  its  temperature  constant,  we  would  have 
to  compress  it,  which  would  give  it  a  smaller  volume  to  inhabit,  and  therefore  fewer  possible 
positions  for  each  atom.  The  whole  thing  has  to  be  proportional  to  n  because  entropy  is  additive. 

Answers  to  Self-Checks  for  Chapter  6 

Page  346:  The  leading  edge  is  moving  up,  the  trailing  edge  is  moving  down,  and  the  top  of  the 
hump  is  motionless  for  one  instant. 

Page  365:  The  energy  of  a  wave  is  usually  proportional  to  the  square  of  the  amplitude.  Squaring 
a  negative  number  gives  a  positive  result,  so  the  energy  is  the  same 

Page  366:  A  substance  is  invisible  to  sonar  if  the  speed  of  sound  waves  in  it  is  the  same  as  in 
water.  Reflections  occur  only  at  boundaries  between  media  in  which  the  wave  speed  is  different. 

Page  367:  No.  A  material  object  that  loses  kinetic  energy  slows  down,  but  a  wave  is  not  a 
material  object.  The  velocity  of  a  wave  ordinarily  only  depends  on  the  medium,  not  on  the 
amplitude.  The  speed  of  soft  sound,  for  example,  is  the  same  as  the  speed  of  loud  sound. 

Page  374:  No.  To  get  the  best  possible  interference,  the  thickness  of  the  coating  must  be  such 
that  the  second  reflected  wave  train  lags  behind  the  first  by  an  integer  number  of  wavelengths. 
Optimal  performance  can  therefore  only  be  produced  for  one  specific  color  of  light.  The  typical 
greenish  color  of  the  coatings  shows  that  it  does  the  worst  job  for  green  light. 

Page  375:  The  period  is  the  time  required  to  travel  a  distance  2 L  at  speed  v,  T  =  2L/v.  The 
frequency  is  /  =  1/T  =  v/2 L. 

Page  379:  The  wave  pattern  will  look  like  this:  0(  Three  quarters  of  a  wavelength  fit  in  the 
tube,  so  the  wavelength  is  three  times  shorter  than  that  of  the  lowest-frequency  mode,  in  which 
one  quarter  of  a  wave  fits.  Since  the  wavelength  is  smaller  by  a  factor  of  three,  the  frequency  is 
three  times  higher.  Instead  of  /0,  2/0,  3/0,  4/0,  . . .,  the  pattern  of  wave  frequencies  of  this  air 
column  goes  /G,  3 fQ,  5 fQ,  7/0,  . . . 

Answers  to  Self-Checks  for  Chapter  7 

Page  394:  At  v  =  0,  we  get  7  =  1,  so  t  =  T.  There  is  no  time  distortion  unless  the  two  frames 
of  reference  are  in  relative  motion. 

Page  416:  The  total  momentum  is  zero  before  the  collision.  After  the  collision,  the  two  momenta 
have  reversed  their  directions,  but  they  still  cancel.  Neither  object  has  changed  its  kinetic  energy, 
so  the  total  energy  before  and  after  the  collision  is  also  the  same. 

Page  414:  Both  the  time  axis  and  the  position  axis  have  been  turned  around.  Flipping  the 
time  axis  means  that  the  roles  of  transmitter  and  receiver  have  been  swapped,  and  it  also  means 
that  Alice  and  Betty  are  approaching  one  another  rather  that  receding.  The  time  experienced 
by  the  receiving  observer  is  now  the  longer  one,  so  the  Doppler-shift  factor  has  been  inverted: 
the  receiver  now  measures  a  Doppler  shift  of  1/2  rather  than  2  in  frequency. 

Page  423:  At  v  =  0,  we  have  7  =  1,  so  the  mass-energy  is  me2  as  claimed.  As  v  approaches  c, 
7  approaches  infinity,  so  the  mass  energy  becomes  infinite  as  well. 


964 


Chapter  Appendix  4:  Hints  and  Solutions 


Answers  to  Self-Checks  for  Chapter  8 

Page  463:  Either  type  can  be  involved  in  either  an  attraction  or  a  repulsion.  A  positive  charge 
could  be  involved  in  either  an  attraction  (with  a  negative  charge)  or  a  repulsion  (with  another 
positive),  and  a  negative  could  participate  in  either  an  attraction  (with  a  positive)  or  a  repulsion 
(with  a  negative). 

Page  464:  It  wouldn’t  make  any  difference.  The  roles  of  the  positive  and  negative  charges  in 
the  paper  would  be  reversed,  but  there  would  still  be  a  net  attraction. 

Page  474:  Yes.  In  U.S.  currency,  the  quantum  of  money  is  the  penny. 

Page  494:  Thomson  was  accelerating  electrons,  which  are  negatively  charged.  This  apparatus 
is  supposed  to  accelerated  atoms  with  one  electron  stripped  off,  which  have  positive  net  charge. 
In  both  cases,  a  particle  that  is  between  the  plates  should  be  attracted  by  the  forward  plate  and 
repelled  by  the  plate  behind  it. 

Page  502:  The  hydrogen- 1  nucleus  is  simple  a  proton.  The  binding  energy  is  the  energy  required 
to  tear  a  nucleus  apart,  but  for  a  nucleus  this  simple  there  is  nothing  to  tear  apart. 

Answers  to  Self-Checks  for  Chapter  9 

Page  531:  The  large  amount  of  power  means  a  high  rate  of  conversion  of  the  battery’s  chemical 
energy  into  heat.  The  battery  will  quickly  use  up  all  its  energy,  i.e.  “burn  out.” 

Answers  to  Self-Checks  for  Chapter  10 

Page  569:  The  reasoning  is  exactly  analogous  to  that  used  in  example  1  on  page  567  to  derive 
an  equation  for  the  gravitational  field  of  the  earth.  The  field  is  F/qt  =  ( kQqt/r2)/qt  =  kQ/r2. 

Page  576: 


Er 


_dP 

dx 

fkQ 
d  x  \  r 
kQ 


Page  577:  (a)  The  voltage  (height)  increases  as  you  move  to  the  east  or  north.  If  we  let  the 
positive  x  direction  be  east,  and  choose  positive  y  to  be  north,  then  dV/  dx  and  dV /  dy  are  both 
positive.  This  means  that  Ex  and  Ey  are  both  negative,  which  makes  sense,  since  the  water  is 
flowing  in  the  negative  x  and  y  directions  (south  and  west). 

(b)  The  electric  fields  are  all  pointing  away  from  the  higher  ground.  If  this  was  an  electrical 
map,  there  would  have  to  be  a  large  concentration  of  charge  all  along  the  top  of  the  ridge,  and 
especially  at  the  mountain  peak  near  the  south  end. 

Page  589:  (a)  The  energy  density  depends  on  E  •  E,  which  equals  E2  +  E2  +  E2. 

(b)  Since  Ex  is  squared,  reversing  its  sign  has  no  effect  on  the  energy  density.  This  makes  sense, 
because  otherwise  we’d  be  saying  that  the  positive  and  negative  x  axes  in  space  were  somehow 
physically  different  in  their  behavior,  which  would  violate  the  symmetry  of  space. 
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Page  589: 


N-1m~2C2V2m-2m2m  =  N_1m“1C2V2 

=  N_1m_1J2 
=  J”^2 
=  J 


Page  598:  Yes.  The  mass  has  the  same  kinetic  energy  regardless  of  which  direction  it’s  moving. 
Friction  coverts  mechanical  energy  into  heat  at  the  same  rate  whether  the  mass  is  sliding  to  the 
right  or  to  the  left.  The  spring  has  an  equilibrium  length,  and  energy  can  be  stored  in  it  either 
by  compressing  it  (x  <  0)  or  stretching  it  (x  >  0). 

Page  599:  Velocity,  v,  is  the  derivative  of  position,  x,  with  respect  to  time.  This  is  exactly 
analogous  to  /  =  dq/  dt. 

Page  614:  The  impedance  depends  on  the  frequency  at  which  the  capacitor  is  being  driven.  It 
isn’t  just  a  single  value  for  a  particular  capacitor. 

Page  609:  Say  we’re  looking  for  u  =  y/z,  i.e. ,  we  want  a  number  u  that,  multiplied  by  itself, 
equals  z.  Multiplication  multiplies  the  magnitudes,  so  the  magnitude  of  u  can  be  found  by 
taking  the  square  root  of  the  magnitude  of  z.  Since  multiplication  also  adds  the  arguments 
of  the  numbers,  squaring  a  number  doubles  its  argument.  Therefore  we  can  simply  divide  the 
argument  of  z  by  two  to  find  the  argument  of  u.  This  results  in  one  of  the  square  roots  of  z. 
There  is  another  one,  which  is  —u,  since  (— u)2  is  the  same  as  it2.  This  may  seem  a  little  odd: 
if  u  was  chosen  so  that  doubling  its  argument  gave  the  argument  of  z,  then  how  can  the  same 
be  true  for  —  u?  Well  for  example,  suppose  the  argument  of  z  is  4°.  Then  arg  u  =  2°,  and 
arg(— u)  =  182°.  Doubling  182  gives  364,  which  is  actually  a  synonym  for  4  degrees. 

Page  613:  Only  cos(6f  —  4)  can  be  represented  by  a  complex  number.  Although  the  graph  of 
cos2  t  does  have  a  sinusoidal  shape,  it  varies  between  0  and  1,  rather  than  —1  and  1,  and  there 
is  no  way  to  represent  that  using  complex  numbers.  The  function  tan  t  doesn’t  even  have  a 
sinusoidal  shape. 

Page  626:  The  quantity  4irkqin  is  now  negative,  so  we’d  better  get  a  negative  flux  on  the  other 
side  of  Gauss’  theorem.  We  do,  because  each  field  vector  E j  is  inward,  while  the  corresponding 
area  vector,  A  j,  is  outward.  Vectors  in  opposite  directions  make  negative  dot  products. 

Answers  to  Self-Checks  for  Chapter  11 

Page  664:  For  instance,  imagine  a  small  sphere  around  the  negative  charge,  which  we  would 
sketch  on  the  two-dimensional  paper  as  a  circle.  The  field  points  inward  at  every  point  on 
the  sphere,  so  all  the  contributions  to  the  flux  are  negative.  There  is  no  cancellation,  and  the 
total  flux  is  negative,  which  is  consistent  with  Gauss’  law,  since  the  sphere  encloses  a  negative 
charge.  Copying  the  same  surface  onto  the  field  of  the  bar  magnet,  however,  we  find  that  there  is 
inward  flux  on  the  top  and  outward  flux  on  the  bottom,  where  the  surface  is  inside  the  magnet. 
According  to  Gauss’  law  for  magnetism,  these  cancel  exactly,  which  is  plausible  based  on  the 
figure. 

Page  662:  From  the  top  panel  of  the  figure,  where  the  magnetic  field  is  turned  off,  we  can 
see  that  the  beam  leaves  the  cathode  traveling  upward,  so  in  the  bottom  figure  the  electrons 
must  be  circlng  in  the  counterclockwise  direction.  To  produce  circular  motion,  the  force  must  be 
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towards  the  center  of  the  circle.  We  can  arbitrarily  pick  a  point  on  the  circle  at  which  to  analyze 
the  vectors  —  let’s  pick  the  right-hand  side.  At  this  point,  the  velocity  vector  of  the  electrons 
is  upward.  Since  the  electrons  are  negatively  charged,  the  force  qv  x  B  is  given  by  —  v  x  B, 
not  +v  x  B.  Circular  orbits  are  produced  when  the  motion  is  in  the  plane  perpendicular  to 
the  field,  so  the  held  must  be  either  into  or  out  of  the  page.  If  the  held  was  into  the  page,  the 
right-hand  rule  would  give  v  x  B  to  the  left,  which  is  towards  the  center,  but  the  force  would 
be  in  the  direction  of  —  v  x  B,  which  would  be  outwards.  The  held  must  be  out  of  the  page. 

Page  668:  Plugging  z  =  0  into  the  equation  gives  Bz  =  Akl/c2h.  This  is  simply  twice  the  held 
of  a  single  wire  at  a  distance  h.  At  this  location,  the  helds  contributed  by  the  two  wires  are 
parallel,  so  vector  addition  simply  gives  a  vector  twice  as  strong. 

Page  680:  The  circulation  around  the  Amperian  surface  we  used  was  counterclockwise,  since 
the  held  on  the  bottom  was  to  the  right.  Applying  the  right-hand  rule,  the  current  Ithrough  must 
have  been  out  of  the  page  at  the  top  of  the  solenoid,  and  into  the  page  at  the  bottom. 

Page  680:  The  quantity  £  came  in  because  we  set  q  =  NI/£.  Based  on  that,  it’s  clear  that  i 
represents  the  length  of  the  solenoid,  not  the  length  of  the  wire. 

Page  681:  Doubling  the  radius  of  the  solenoid  would  mean  that  every  distance  in  the  problem 
would  be  doubled,  which  would  tend  to  make  the  fields  weaker,  since  fields  fall  off  with  distance. 
However,  doubling  the  radius  would  also  mean  that  we  had  twice  as  much  wire,  and  therefore 
twice  as  many  moving  charges  to  create  magnetic  helds.  Since  the  magnetic  held  of  a  wire  falls 
off  like  1/r,  it’s  not  surprising  that  the  first  effect  amounts  to  exactly  a  factor  of  1/2,  which  is 
exactly  enough  to  cancel  out  the  factor  of  2  from  the  second  effect. 

Page  693:  Unless  the  engine  is  already  turning  over,  the  permanent  magnet  isn’t  spinning,  so 
there  is  no  change  in  the  magnetic  held.  Only  a  changing  magnetic  held  creates  an  induced 
electric  held. 

Page  698:  Let’s  get  all  the  electrical  units  in  terms  of  Teslas.  Electric  held  units  can  be 
expressed  as  T  •  m/s.  The  circulation  of  the  electric  held  has  units  of  electric  held  multiplied  by 
distance,  or  T  •  m2/s.  On  the  right  side,  the  derivative  <9B  jdt  has  units  of  T/s,  and  multiplying 
this  my  area  gives  units  of  T  •  m2/s,  just  like  on  the  left  side. 

Page  716:  An  (idealized)  battery  is  a  circuit  element  that  always  maintains  the  same  voltage 
difference  across  itself,  so  by  the  loop  rule,  the  voltage  difference  across  the  capacitor  must 
remain  unchanged,  even  while  the  dielectric  is  being  withdrawn.  The  bound  charges  on  the 
surfaces  of  the  dielectric  have  been  attracting  the  free  charges  in  the  plates,  causing  them  to 
charge  up  more  than  they  ordinarily  would  have.  As  the  dielectric  is  withdrawn,  the  capacitor 
will  be  partially  discharged,  and  we  will  observe  a  current  in  the  ammeter.  Since  the  dielectric 
is  attracted  to  the  plates,  positive  work  is  done  in  extracting  it,  indicating  that  there  must  be  an 
increase  in  the  electrical  energy  stored  in  the  capacitor.  This  may  seem  paradoxical,  since  the 
energy  stored  in  a  capacitor  is  (1/2)CU2,  and  we  are  decreasing  the  capacitance.  However,  the 
energy  (1/2)CV2  is  calculated  in  terms  of  the  work  required  to  deposit  the  free  charge  on  the 
plates.  In  addition  to  this  energy,  there  is  also  energy  stored  in  the  dielectric  itself.  By  moving 
its  bound  charges  farther  away  from  the  free  charges  in  the  plates,  to  which  they  are  attracted, 
we  have  increased  their  electrical  energy.  This  energy  of  the  bound  charges  is  inaccessible  to 
the  electric  circuit. 

Answers  to  Self-Checks  for  Chapter  12 

Page  752:  Only  1  is  correct.  If  you  draw  the  normal  that  bisects  the  solid  ray,  it  also  bisects 
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the  dashed  ray. 

Page  756:  You  should  have  found  from  your  ray  diagram  that  an  image  is  still  formed,  and  it 
has  simply  moved  down  the  same  distance  as  the  real  face.  However,  this  new  image  would  only 
be  visible  from  high  up,  and  the  person  can  no  longer  see  his  own  image. 

Page  761:  Increasing  the  distance  from  the  face  to  the  mirror  has  decreased  the  distance  from 
the  image  to  the  mirror.  This  is  the  opposite  of  what  happened  with  the  virtual  image. 

Page  771:  At  the  top  of  the  graph,  di  approaches  infinity  when  d0  approaches  /.  Interpretation: 
the  rays  just  barely  converge  to  the  right  of  the  mirror. 

On  the  far  right,  di  approaches  /  as  dQ  approaches  infinity;  this  is  the  definition  of  the  focal 
length. 

At  the  bottom,  di  approaches  negative  infinity  when  d0  approaches  /  from  the  other  side. 
Interpretation:  the  rays  don’t  quite  converge  on  the  right  side  of  the  mirror,  so  they  appear  to 
have  come  from  a  virtual  image  point  very  far  to  the  left  of  the  mirror. 

Page  780:  (1)  If  m  and  ri2  are  equal,  Snell’s  law  becomes  sin^i  =  sin  62,  which  implies  6 1  =  62, 
since  both  angles  are  between  0  and  90°.  The  graph  would  be  a  straight  line  along  the  diagonal 
of  the  graph.  (2)  The  graph  is  farthest  from  the  diagonal  when  the  angles  are  large,  i.e.,  when 
the  ray  strikes  the  interface  at  a  grazing  angle. 

Page  785:  (1)  In  1,  the  rays  cross  the  image,  so  it’s  real.  In  2,  the  rays  only  appear  to  have 
come  from  the  image  point,  so  the  image  is  virtual.  (2)  A  rays  is  always  closer  to  the  normal  in 
the  medium  with  the  higher  index  of  refraction.  The  first  left  turn  makes  the  ray  closer  to  the 
normal,  which  is  what  should  happen  in  glass.  The  second  left  turn  makes  the  ray  farther  from 
the  normal,  and  that’s  what  should  happen  in  air.  (3)  Take  the  topmost  ray  as  an  example.  It 
will  still  take  two  right  turns,  but  since  it’s  entering  the  lens  at  a  steeper  angle,  it  will  also  leave 
at  a  steeper  angle.  Tracing  backward  to  the  image,  the  steeper  lines  will  meet  closer  to  the  lens. 

Page  793:  It  would  have  to  have  a  wavelength  on  the  order  of  centimeters  or  meters,  the  same 
distance  scale  as  that  of  your  body.  These  would  be  microwaves  or  radio  waves.  (This  effect  can 
easily  be  noticed  when  a  person  affects  a  TV’s  reception  by  standing  near  the  antenna.)  None 
of  this  contradicts  the  correspondence  principle,  which  only  states  that  the  wave  model  must 
agree  with  the  ray  model  when  the  ray  model  is  applicable.  The  ray  model  is  not  applicable 
here  because  X/d  is  on  the  order  of  1. 

Page  795:  At  this  point,  both  waves  would  have  traveled  nine  and  a  half  wavelengths.  They 
would  both  be  at  a  negative  extreme,  so  there  would  be  constructive  interference. 

Page  799:  Judging  by  the  distance  from  one  bright  wave  crest  to  the  next,  the  wavelength 
appears  to  be  about  2/3  or  3/4  as  great  as  the  width  of  the  slit. 

Page  800:  Since  the  wavelengths  of  radio  waves  are  thousands  of  times  longer,  diffraction  causes 
the  resolution  of  a  radio  telescope  to  be  thousands  of  times  worse,  all  other  things  being  equal. 
(To  compensate  for  the  wavelength,  it’s  desirable  to  make  the  telescope  very  large,  as  in  figure 
z  on  page  800.) 

Answers  to  Self-Checks  for  Chapter  13 

Page  834:  (1)  Most  people  would  think  they  were  positively  correlated,  but  it’s  possible  that 
they’re  independent.  (2)  These  must  be  independent,  since  there  is  no  possible  physical  mecha¬ 
nism  that  could  make  one  have  any  effect  on  the  other.  (3)  These  cannot  be  independent,  since 
dying  today  guarantees  that  you  won’t  die  tomorrow. 
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Page  836:  The  area  under  the  curve  from  130  to  135  cm  is  about  3/4  of  a  rectangle.  The  area 
from  135  to  140  cm  is  about  1.5  rectangles.  The  number  of  people  in  the  second  range  is  about 
twice  as  much.  We  could  have  converted  these  to  actual  probabilities  (1  rectangle  =  5  cm  x 
0.005  cnr1  =  0.025),  but  that  would  have  been  pointless  because  we  were  just  going  to  compare 
the  two  areas. 

Page  841:  On  the  left-hand  side,  dlV  is  a  unitless  count,  and  df  is  an  infinitesimal  amount  of 
time,  with  units  of  seconds,  so  the  units  are  s-1  as  claimed.  On  the  right,  both  1V(0)  and  the 
exponential  factor  are  unitless,  so  the  only  units  come  from  the  factor  of  1/r,  which  again  has 
units  of  s-1. 

Page  850:  The  axes  of  the  graph  are  frequency  and  photon  energy,  so  its  slope  is  Planck’s 
constant.  It  doesn’t  matter  if  you  graph  eAV  rather  than  W  +  eAV,  because  that  only  changes 
the  y-intercept,  not  the  slope. 

Page  863:  Wavelength  is  inversely  proportional  to  momentum,  so  to  produce  a  large  wavelength 
we  would  need  to  use  electrons  with  very  small  momenta  and  energies.  (In  practical  terms,  this 
isn’t  very  easy  to  do,  since  ripping  an  electron  out  of  an  object  is  a  violent  process,  and  it’s  not 
so  easy  to  calm  the  electrons  down  afterward.) 

Page  872:  Under  the  ordinary  circumstances  of  life,  the  accuracy  with  which  we  can  measure 
position  and  momentum  of  an  object  doesn’t  result  in  a  value  of  ApAx  that  is  anywhere  near 
the  tiny  order  of  magnitude  of  Planck’s  constant.  We  run  up  against  the  ordinary  limitations 
on  the  accuracy  of  our  measuring  techniques  long  before  the  uncertainty  principle  becomes  an 
issue. 

Page  876:  No.  The  equation  K  =  pi2 /2m.  is  nonrelativistic,  so  it  can’t  be  applied  to  an  electron 
moving  at  relativistic  speeds.  Photons  always  move  at  relativistic  speeds,  so  it  can’t  be  applied 
to  them  either. 

Page  878:  Dividing  by  Planck’s  constant,  a  small  number,  gives  a  large  negative  result  inside 
the  exponential,  so  the  probability  will  be  very  small. 

Page  899:  The  original  argument  was  that  a  kink  would  have  a  zero  wavelength,  which  would 
correspond  to  an  infinite  momentum  and  an  infinite  kinetic  energy,  and  that  would  violate 
conservation  of  energy.  But  the  kink  in  this  example  occurs  at  r  =  0,  which  is  right  on  top 
of  the  proton,  where  the  electrical  energy  —  ke2 /r  is  infinite  and  negative.  Since  the  electrical 
energy  is  negative  and  infinite,  we’re  actually  required  to  have  an  infinite  positive  kinetic  energy 
in  order  to  come  up  with  a  total  that  conserves  energy. 

Page  890:  If  you  trace  a  circle  going  around  the  center,  you  run  into  a  series  of  eight  complete 
wavelengths.  Its  angular  momentum  is  8 h. 

Page  894:  n  =  3,  £  =  0,  lz  =  0:  one  state;  n  =  3,  i  =  1,  iz  =  —1,  0,  or  1:  three  states;  n  =  3, 
t  =  2,  £z  =  —2,  —1,  0,  1,  or  2:  five  states 

Answers 

Answers  for  Chapter  2 


Page  124,  problem  37:  K  =  k\k2/{k\  +  h?)  =  1/(1  /k\  +  l/fo) 

Answers  for  Chapter  3 

Page  218,  problem  5:  After  the  collision  it  is  moving  at  1/3  of  its  initial  speed,  in  the  same 
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direction  it  was  initially  going  (it  “follows  through”). 

Page  225,  problem  41:  Q  =  \j\f2 

Page  225,  problem  43:  (a)  7  x  10-8  radians,  or  about  4  x  10-6  degrees. 

Page  227,  problem  51:  (a)  R  =  (2v2 / g)  sin  0  cos  0  (c)  45  ° 

Page  227,  problem  52:  (a)  The  optimal  angle  is  about  40°,  and  the  resulting  range  is  about 
124  meters,  which  is  about  the  length  of  a  home  run.  (b)  It  goes  about  9  meters  farther.  For 
comparison  with  reality,  the  stadium’s  web  site  claims  a  home  run  goes  about  11  meters  farther 
there  than  in  a  sea-level  stadium. 

Answers  for  Chapter  5 

Page  339,  problem  7:  (c)  n  «  16 

Page  339,  problem  9:  (a)  ~  2  —  10%  (b)  5%  (c)  The  high  end  for  the  body’s  actual  efficiency 
is  higher  than  the  limit  imposed  by  the  laws  of  thermodynamics.  However,  the  high  end  of 
the  1-5  watt  range  quoted  in  the  problem  probably  includes  large  people  who  aren’t  just  lying 
around.  Still,  it’s  impressive  that  the  human  body  comes  so  close  to  the  thermodynamic  limit. 

Page  340,  problem  10:  (a)  Looking  up  the  relevant  density  for  air,  and  converting  everything 
to  mks,  we  get  a  frequency  of  730  Hz.  This  is  on  the  right  order  of  magnitude,  which  is 
promising,  considering  the  crudeness  of  the  approximation,  (b)  This  brings  the  result  down  to 
400  Hz,  which  is  amazingly  close  to  the  observed  frequency  of  300  Hz. 

Answers  for  Chapter  6 

Page  382,  problem  9:  (b)  g/2 

Page  383,  problem  13:  Check:  The  actual  length  of  a  flute  is  about  66  cm  from  the  tip  of 
the  mouthpiece  to  the  end  of  the  bell. 

Page  382,  problem  8:  (a)  T  =  goj2r2 

Page  384,  problem  17:  (a)  /  =  4a/(l  +  a)2  (b)  v 2  =  \fv1V3 

Answers  for  Chapter  10 

Page  642,  problem  22:  (a)  E  =  2k\/R. 

Answers  for  Chapter  11 

Page  724,  problem  5:  (a)  I  =  Xv. 

Page  725,  problem  6:  (b)  2kI\l2L/c2R. 

Answers  for  Chapter  12 

Page  818,  problem  61:  f/e 

Answers  for  Chapter  13 

Page  940,  problem  40:  about  10-34 

Solutions 

Solutions  for  Chapter  0 

Page  48,  problem  6: 


134  mg  x 


10~3  g 
1  mg 


10  3  kg 
1  g 


=  1.34  x  10“4  kg 
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Page  48,  problem  8:  (a)  Let’s  do  10.0  g  and  1000  g.  The  arithmetic  mean  is  505  grams.  It 
comes  out  to  be  0.505  kg,  which  is  consistent,  (b)  The  geometric  mean  comes  out  to  be  100 
g  or  0.1  kg,  which  is  consistent,  (c)  If  we  multiply  meters  by  meters,  we  get  square  meters. 
Multiplying  grams  by  grams  should  give  square  grams!  This  sounds  strange,  but  it  makes  sense. 
Taking  the  square  root  of  square  grams  (g2)  gives  grams  again,  (d)  No.  The  superduper  mean 
of  two  quantities  with  units  of  grams  wouldn’t  even  be  something  with  units  of  grams!  Related 
to  this  shortcoming  is  the  fact  that  the  superduper  mean  would  fail  the  kind  of  consistency  test 
carried  out  in  the  first  two  parts  of  the  problem. 

Page  49,  problem  12:  (a)  They’re  all  defined  in  terms  of  the  ratio  of  side  of  a  triangle  to 

another.  For  instance,  the  tangent  is  the  length  of  the  opposite  side  over  the  length  of  the 
adjacent  side.  Dividing  meters  by  meters  gives  a  unitless  result,  so  the  tangent,  as  well  as  the 
other  trig  functions,  is  unitless,  (b)  The  tangent  function  gives  a  unitless  result,  so  the  units  on 
the  right-hand  side  had  better  cancel  out.  They  do,  because  the  top  of  the  fraction  has  units  of 
meters  squared,  and  so  does  the  bottom. 

Page  50,  problem  17:  The  problem  requires  us  to  relate  a  and  t,  for  a  fixed  value  of  the 
distance  Ax.  To  find  a  relationship  among  these  three  variables,  we  start  with  cPx/  dt2  =  a, 
and  integrate  twice  to  find  Ax  =  ^ at 2.  This  tells  us  that  for  a  fixed  value  of  Ax,  we  have 
t  oc  1  / \Ja.  Decreasing  a  by  a  factor  of  3  means  that  t  will  increase  by  a  factor  of  \/3  =  1.7. 
(The  given  piece  of  data,  3,  only  has  one  sig  fig,  but  rounding  the  final  result  off  to  one  sig  fig, 
giving  2  rather  then  1.7,  would  be  a  little  too  severe.  As  discussed  in  section  0.1.10,  sig  figs  are 
only  a  rule  of  thumb,  and  when  in  doubt,  you  can  change  the  input  data  to  see  how  much  the 
output  would  have  changed.  The  ratio  of  the  gravitational  fields  on  Earth  and  Mars  must  be  in 
the  range  from  2.5  to  3.5,  since  otherwise  the  given  data  would  not  have  been  rounded  off  to  3. 
Using  this  range  of  inputs,  the  possible  range  of  values  for  the  final  result  becomes  1.6  to  1.9. 
The  final  digit  in  the  1.7  is  therefore  a  little  uncertain,  but  it’s  not  complete  garbage.  It  carries 
useful  information,  and  should  not  be  thrown  out.) 

Page  51,  problem  19:  (a)  Solving  for  Ax  =  \at2  for  a,  we  find  a  =  2Ax/t 2  =  5.51  m/s2, 

(b)  v  =  \/2aAx  =  66.6  m/s.  (c)  The  actual  car’s  final  velocity  is  less  than  that  of  the  idealized 
constant-acceleration  car.  If  the  real  car  and  the  idealized  car  covered  the  quarter  mile  in  the 
same  time  but  the  real  car  was  moving  more  slowly  at  the  end  than  the  idealized  one,  the  real 
car  must  have  been  going  faster  than  the  idealized  car  at  the  beginning  of  the  race.  The  real  car 
apparently  has  a  greater  acceleration  at  the  beginning,  and  less  acceleration  at  the  end.  This 
make  sense,  because  every  car  has  some  maximum  speed,  which  is  the  speed  beyond  which  it 
cannot  accelerate. 

Page  52,  problem  31: 

2  /  1  cm  \ 2  2  2 

1  nun  x  -  =  10  cm“ 

\  10  mm  / 

Page  52,  problem  32:  The  bigger  scope  has  a  diameter  that’s  ten  times  greater.  Area  scales 
as  the  square  of  the  linear  dimensions,  so  its  light-gathering  power  is  a  hundred  times  greater 
(10  x  10). 

Page  52,  problem  33:  Since  they  differ  by  two  steps  on  the  Richter  scale,  the  energy  of  the 
bigger  quake  is  10000  times  greater.  The  wave  forms  a  hemisphere,  and  the  surface  area  of  the 
hemisphere  over  which  the  energy  is  spread  is  proportional  to  the  square  of  its  radius.  If  the 
amount  of  vibration  was  the  same,  then  the  surface  areas  much  be  in  the  ratio  of  10000:1,  which 
means  that  the  ratio  of  the  radii  is  100:1. 
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Page  53,  problem  38:  The  cone  of  mixed  gin  and  vermouth  is  the  same  shape  as  the  cone 
of  vermouth,  but  its  linear  dimensions  are  doubled,  so  its  volume  is  8  times  greater.  The  ratio 
of  gin  to  vermouth  is  7  to  1. 

Page  53,  problem  40:  Scaling  down  the  linear  dimensions  by  a  factor  of  1/10  reduces  the 
volume  by  a  factor  of  (1/10)3  =  1/1000,  so  if  the  whole  cube  is  a  liter,  each  small  one  is  one 
milliliter. 

Page  54,  problem  41:  Let’s  estimate  the  Great  Wall’s  volume,  and  then  figure  out  how  many 
bricks  that  would  represent.  The  wall  is  famous  because  it  covers  pretty  much  all  of  China’s 
northern  border,  so  let’s  say  it’s  1000  km  long.  From  pictures,  it  looks  like  it’s  about  10  nr  high 
and  10  nr  wide,  so  the  total  volume  would  be  106  nr  x  10  nr  x  10  nr  =  108  nr3.  If  a  single  brick 
has  a  volume  of  1  liter,  or  10~3  nr3,  then  this  represents  about  1011  bricks.  If  one  person  can 
lay  10  bricks  in  an  hour  (taking  into  account  all  the  preparation,  etc.),  then  this  would  be  1010 
man-hours. 

Page  54,  problem  44:  Directly  guessing  the  number  of  jelly  beans  would  be  like  guessing 
volume  directly.  That  would  be  a  mistake.  Instead,  we  start  by  estimating  the  linear  dimensions, 
in  units  of  beans.  The  contents  of  the  jar  look  like  they’re  about  10  beans  deep.  Although  the 
jar  is  a  cylinder,  its  exact  geometrical  shape  doesn’t  really  matter  for  the  purposes  of  our  order- 
of-nragnitude  estimate.  Let’s  pretend  it’s  a  rectangular  jar.  The  horizontal  dimensions  are  also 
something  like  10  beans,  so  it  looks  like  the  jar  has  about  10  x  10  x  10  or  ~  103  beans  inside. 

Solutions  for  Chapter  1 

Page  71,  problem  12:  To  the  person  riding  the  moving  bike,  bug  A  is  simply  going  in  circles. 
The  only  difference  between  the  motions  of  the  two  wheels  is  that  one  is  traveling  through  space, 
but  motion  is  relative,  so  this  doesn’t  have  any  effect  on  the  bugs.  It’s  equally  hard  for  each  of 
them. 

Solutions  for  Chapter  2 

Page  118,  problem  1:  (a)  The  energy  stored  in  the  gasoline  is  being  changed  into  heat  via 

frictional  heating,  and  also  probably  into  sound  and  into  energy  of  water  waves.  Note  that  the 
kinetic  energy  of  the  propeller  and  the  boat  are  not  changing,  so  they  are  not  involved  in  the 
energy  transformation,  (b)  The  crusing  speed  would  be  greater  by  a  factor  of  the  cube  root  of 
2,  or  about  a  26%  increase. 

Page  118,  problem  2:  We  don’t  have  actual  masses  and  velocities  to  plug  in  to  the  equa¬ 
tion,  but  that’s  OK.  We  just  have  to  reason  in  terms  of  ratios  and  proportionalities.  Ki¬ 
netic  energy  is  proportional  to  mass  and  to  the  square  of  velocity,  so  B’s  kinetic  energy  equals 
(13.4  J)  (3.77) / (2.34)2  =  9.23  J. 

Page  118,  problem  3:  Room  temperature  is  about  20°C.  The  fraction  of  the  power  that 
actually  goes  into  heating  the  water  is 


(250  g)/(0.24  J/g°C)  x  (100°C-20°C)/126  s 
1.25  x  103  J/s 


So  roughly  half  of  the  energy  is  wasted.  The  wasted  energy  might  be  in  several  forms:  heating 
of  the  cup,  heating  of  the  oven  itself,  or  leakage  of  microwaves  from  the  oven. 
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Page  118,  problem  5: 


E total.i 

Ui  +  heati 

1  2 

-mv 


v 


Solutions  for  Chapter  3 

Page  218,  problem  4:  A  conservation  law  is  about  addition:  it  says  that  when  you  add  up 
a  certain  thing,  the  total  always  stays  the  same.  Funkosity  would  violate  the  additive  nature  of 
conservation  laws,  because  a  two-kilogram  mass  would  have  twice  as  much  funkosity  as  a  pair 
of  one- kilogram  masses  moving  at  the  same  speed. 

Page  219,  problem  12:  Momentum  is  a  vector.  The  total  momentum  of  the  molecules 
is  always  zero,  since  the  momenta  in  different  directions  cancal  out  on  the  average.  Cooling 
changes  individual  molecular  momenta,  but  not  the  total. 

Page  220,  problem  15:  a  =  An/A t,  and  also  a  =  F/m ,  so 


a 

m  An 

F 


(1000  kg)(50  m/s  —  20  m/s) 

““  3000  N 

=  10  s 

Page  221,  problem  23:  (a)  This  is  a  measure  of  the  box’s  resistance  to  a  change  in  its  state 
of  motion,  so  it  measures  the  box’s  mass.  The  experiment  would  come  out  the  same  in  lunar 
gravity. 

(b)  This  is  a  measure  of  how  much  gravitational  force  it  feels,  so  it’s  a  measure  of  weight.  In 
lunar  gravity,  the  box  would  make  a  softer  sound  when  it  hit. 

(c)  As  in  part  a,  this  is  a  measure  of  its  resistance  to  a  change  in  its  state  of  motion:  its  mass. 
Gravity  isn’t  involved  at  all. 

Page  223,  problem  34:  (a)  The  swimmer’s  acceleration  is  caused  by  the  water’s  force  on  the 
swimmer,  and  the  swimmer  makes  a  backward  force  on  the  water,  which  accelerates  the  water 
backward,  (b)  The  club’s  normal  force  on  the  ball  accelerates  the  ball,  and  the  ball  makes  a 
backward  normal  force  on  the  club,  which  decelerates  the  club,  (c)  The  bowstring’s  normal  force 
accelerates  the  arrow,  and  the  arrow  also  makes  a  backward  normal  force  on  the  string.  This 
force  on  the  string  causes  the  string  to  accelerate  less  rapidly  than  it  would  if  the  bow’s  force 
was  the  only  one  acting  on  it.  (d)  The  tracks’  backward  frictional  force  slows  the  locomotive 
down.  The  locomotive’s  forward  frictional  force  causes  the  whole  planet  earth  to  accelerate  by 
a  tiny  amount,  which  is  too  small  to  measure  because  the  earth’s  mass  is  so  great. 


E total, f 

=  Uf  +  heat  f  +  Kf 
=  Ui  —  Uf  +  heati  —  heatf 
=  —A  U  —  A  heat 


=  6.4  m/s 
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Page  224,  problem  37:  (a)  Spring  constants  in  parallel  add,  so  the  spring  constant  has  to 

be  proportional  to  the  cross-sectional  area.  Two  springs  in  series  give  half  the  spring  constant, 
three  springs  in  series  give  1/3,  and  so  on,  so  the  spring  constant  has  to  be  inversely  proportional 
to  the  length.  Summarizing,  we  have  k  oc  A/L. 

(b)  With  the  Young’s  modulus,  we  have  k  =  (A/L)E. The  spring  constant  has  units  of  N/m,  so 
the  units  of  E  would  have  to  be  N/ nr2 . 

Page  226,  problem  44:  By  conservation  of  momentum,  the  total  momenta  of  the  pieces  after 
the  explosion  is  the  same  as  the  momentum  of  the  firework  before  the  explosion.  However,  there 
is  no  law  of  conservation  of  kinetic  energy,  only  a  law  of  conservation  of  energy.  The  chemical 
energy  in  the  gunpowder  is  converted  into  heat  and  kinetic  energy  when  it  explodes.  All  we  can 
say  about  the  kinetic  energy  of  the  pieces  is  that  their  total  is  greater  than  the  kinetic  energy 
before  the  explosion. 

Page  226,  problem  45:  Let  rn  be  the  mass  of  the  little  puck  and  M  =  2.3m  be  the  mass 
of  the  big  one.  All  we  need  to  do  is  find  the  direction  of  the  total  momentum  vector  before 
the  collision,  because  the  total  momentum  vector  is  the  same  after  the  collision.  Given  the  two 
components  of  the  momentum  vector  px  =  mv  and  py  =  Mv,  the  direction  of  the  vector  is 
tan ~1{py/px)  =  23°  counterclockwise  from  the  big  puck’s  original  direction  of  motion. 

Page  229,  problem  62:  We  want  to  find  out  about  the  velocity  vector  vbg  °f  the  bullet 
relative  to  the  ground,  so  we  need  to  add  Annie’s  velocity  relative  to  the  ground  vag  to  the 
bullet’s  velocity  vector  vba  relative  to  her.  Letting  the  positive  x  axis  be  east  and  y  north,  we 
have 


vba,x  =  (140  mi/hr)  cos  45° 

=  100  mi/hr 

vsA,y  =  (140  mi/hr)  sin  45° 

=  100  mi/hr 

and 

vag,x  =  0 
VAG,y  =  30  mi/hr. 

The  bullet’s  velocity  relative  to  the  ground  therefore  has  components 

vbg,x  =  100  mi/hr 


and 


vBG,y  =  130  mi/hr. 


Its  speed  on  impact  with  the  animal  is  the  magnitude  of  this  vector 

|v£g|  =  \J (100  mi/hr)2  +  (130  mi/hr)2 
=  160  mi/hr 

(rounded  off  to  two  significant  figures). 
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Page  229,  problem  63:  Since  its  velocity  vector  is  constant,  it  has  zero  acceleration,  and  the 
sum  of  the  force  vectors  acting  on  it  must  be  zero.  There  are  three  forces  acting  on  the  plane: 
thrust,  lift,  and  gravity.  We  are  given  the  first  two,  and  if  we  can  find  the  third  we  can  infer  the 
plane’s  mass.  The  sum  of  the  y  components  of  the  forces  is  zero,  so 


0  -  Fth/rust,y  T  Fjiff  y  T  Fgy 

=  \Fthrust\ sin  #  +  \F nft  \  cos  9  -  mg. 


The  mass  is 


m  =  {\Fthrust\ sin#  +  \FHft\  cos 0)/g 
=  7.0  x  104  kg. 

Page  229,  problem  64:  (a)  Since  the  wagon  has  no  acceleration,  the  total  forces  in  both  the 
x  and  y  directions  must  be  zero.  There  are  three  forces  acting  on  the  wagon:  T,  Fg,  and  the 
normal  force  from  the  ground,  Fn.  If  we  pick  a  coordinate  system  with  x  being  horizontal  and  y 
vertical,  then  the  angles  of  these  forces  measured  counterclockwise  from  the  x  axis  are  90°  —  </>, 
270°,  and  90°  +  #,  respectively.  We  have 

FXy total  =  F  cos(90°  -  4>)  +  Fg  cos(270°)  +  Fncos( 90°  +  0) 

Fy total  =  T  sin(90°  -  (f>)  +  Fg  sin(270°)  +  Fnsin( 90°  +  6), 


which  simplifies  to 


0  =  T  sin  (j>  —  Fn  sin  9 
0  =  T  cos  4>  —  Fg  +  Fn  cos  9. 


The  normal  force  is  a  quantity  that  we  are  not  given  and  do  not  wish  to  find,  so  we  should 
choose  it  to  eliminate.  Solving  the  first  equation  for  Fn  =  (sin  cf>/  sin  9)T,  we  eliminate  Fn  from 
the  second  equation, 

0  =  T  cos  4>  —  Fg  +  T  sin  (f>  cos  9/  sin  9 

and  solve  for  T,  finding 

T? 

rp  _  _ f_g _ 

cos  <f>  +  sin  cj)  cos  9/  sin  9 

Multiplying  both  the  top  and  the  bottom  of  the  fraction  by  sin#,  and  using  the  trig  identity  for 
sin(#  +  (j>)  gives  the  desired  result, 


T  = 


sin  # 

sin(#  +  (/>)  gS 


(b)  The  case  of  4>  =  0,  i.e.  pulling  straight  up  on  the  wagon,  results  in  T  =  Fg:  we  simply 
support  the  wagon  and  it  glides  up  the  slope  like  a  chair-lift  on  a  ski  slope.  In  the  case  of 
4>  =  180°  —  #,  T  becomes  infinite.  Physically  this  is  because  we  are  pulling  directly  into  the 
ground,  so  no  amount  of  force  will  suffice. 

Page  230,  problem  65:  (a)  If  there  was  no  friction,  the  angle  of  repose  would  be  zero,  so  the 
coefficient  of  static  friction,  gs,  will  definitely  matter.  We  also  make  up  symbols  #,  m  and  g  for 
the  angle  of  the  slope,  the  mass  of  the  object,  and  the  acceleration  of  gravity.  The  forces  form  a 
triangle  just  like  the  one  in  example  68  on  page  203,  but  instead  of  a  force  applied  by  an  external 
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object,  we  have  static  friction,  which  is  less  than  fisFn.  As  in  that  example,  Fs  =  mg  sin  0,  and 
Fs  ^  [J’sFm  so 

mg  sin  0  <  gsFn. 

From  the  same  triangle,  we  have  Fn  =  mgcosd ,  so 

mg  sin  6  <  g,smg  cos  8 . 


Rearranging, 


9  <  tan  1  fis. 


(b)  Both  m  and  g  canceled  out,  so  the  angle  of  repose  would  be  the  same  on  an  asteroid. 

Solutions  for  Chapter  4 

Page  288,  problem  1:  The  pliers  are  not  moving,  so  their  angular  momentum  remains 
constant  at  zero,  and  the  total  torque  on  them  must  be  zero.  Not  only  that,  but  each  half  of 
the  pliers  must  have  zero  total  torque  on  it.  This  tells  us  that  the  magnitude  of  the  torque  at 
one  end  must  be  the  same  as  that  at  the  other  end.  The  distance  from  the  axis  to  the  nut  is 
about  2.5  cm,  and  the  distance  from  the  axis  to  the  centers  of  the  palm  and  fingers  are  about  8 
cm.  The  angles  are  close  enough  to  90°that  we  can  pretend  they’re  90  degrees,  considering  the 
rough  nature  of  the  other  assumptions  and  measurements.  The  result  is  (300  N)(2.5  cm)=(i?)(8 
cm),  or  F=90  N. 

Page  294,  problem  37:  The  foot  of  the  rod  is  moving  in  a  circle  relative  to  the  center  of  the 
rod,  with  speed  v  =  irb/T,  and  acceleration  v2/(b/ 2)  =  (ir2/8)g.  This  acceleration  is  initially 
upward,  and  is  greater  in  magnitude  than  g,  so  the  foot  of  the  rod  will  lift  off  without  dragging. 
We  could  also  worry  about  whether  the  foot  of  the  rod  would  make  contact  with  the  floor  again 
before  the  rod  finishes  up  flat  on  its  back.  This  is  a  question  that  can  be  settled  by  graphing, 
or  simply  by  inspection  of  figure  i  on  page  276.  The  key  here  is  that  the  two  parts  of  the 
acceleration  are  both  independent  of  m  and  b,  so  the  result  is  univeral,  and  it  does  suffice  to 
check  a  graph  in  a  single  example.  In  practical  terms,  this  tells  us  something  about  how  difficult 
the  trick  is  to  do.  Because  7r2/8  =  1.23  isn’t  much  greater  than  unity,  a  hit  that  is  just  a  little 
too  weak  (by  a  factor  of  1.231//2  =  1.11)  will  cause  a  fairly  obvious  qualitative  change  in  the 
results.  This  is  easily  observed  if  you  try  it  a  few  times  with  a  pencil. 

Page  296,  problem  45:  The  moment  of  inertia  is  I  =  f  r 2  dm.  Let  the  ring  have  total  mass 
M  and  radius  b.  The  proportionality 

gives  a  change  of  variable  that  results  in 

I  ■ 

2tt  Jo 


M  dm 
27 r  d0 

M  /'27r  . 


If  we  measure  8  from  the  axis  of  rotation,  then  r  =  6sin0,  so  this  becomes 


I  = 


Mb2 

2ir 


sin2  9  dd. 


The  integrand  averages  to  1/2  over  the  27t  range  of  integration,  so  the  integral  equals  n.  We 
therefore  have  /  =  | Mb 2 .  This  is,  as  claimed,  half  the  value  for  rotation  about  the  symmetry 
axis. 
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Solutions  for  Chapter  5 


Page  340,  problem  11:  (a)  We  have 


dP  =  pg  d  y 


and  since  we’re  taking  water  to  be  incompressible,  and  g  doesn’t  change  very  much  over  11  km 
of  height,  we  can  treat  p  and  g  as  constants  and  take  them  outside  the  integral. 

A  P  =  pgAy 

=  (1.0  g/cm3)(9.8  m/s2)(11.0  km) 

=  (1.0  x  103  kg/m3) (9.8  m/s2)(1.10  x  104  nr) 

=  1.0  x  108  Pa 
=  1.0  x  103  atm. 


The  precision  of  the  result  is  limited  to  a  few  percent,  due  to  the  compressibility  of  the  water, 
so  we  have  at  most  two  significant  figures.  If  the  change  in  pressure  were  exactly  a  thousand 
atmospheres,  then  the  pressure  at  the  bottom  would  be  1001  atmospheres;  however,  this  dis¬ 
tinction  is  not  relevant  at  the  level  of  approximation  we’re  attempting  here. 

(b)  Since  the  air  in  the  bubble  is  in  thermal  contact  with  the  water,  it’s  reasonable  to  assume 
that  it  keeps  the  same  temperature  the  whole  time.  The  ideal  gas  law  is  PV  =  nkT ,  and 
rewriting  this  as  a  proportionality  gives 

V  (xP-1, 


or 


Yl 

Vi 


103. 


Since  the  volume  is  proportional  to  the  cube  of  the  linear  dimensions,  the  growth  in  radius  is 
about  a  factor  of  10. 


Page  340,  problem  12:  (a)  Roughly  speaking,  the  thermal  energy  is  ~  ksT  (where  ks  is  the 
Boltzmann  constant),  and  we  need  this  to  be  on  the  same  order  of  magnitude  as  ke2 /r  (where 
k  is  the  Coulomb  constant).  For  this  type  of  rough  estimate  it’s  not  especially  crucial  to  get 
all  the  factors  of  two  right,  but  let’s  do  so  anyway.  Each  proton’s  average  kinetic  energy  due  to 
motion  along  a  particular  axis  is  (1/2 )ksT.  If  two  protons  are  colliding  along  a  certain  line  in 
the  center-of-mass  frame,  then  their  average  combined  kinetic  energy  due  to  motion  along  that 
axis  is  2(1/2 )ksT  =  ksT.  So  in  fact  the  factors  of  2  cancel.  We  have  T  =  ke2 /ksr. 

(b)  The  units  are  K  =  (J-m/C2)(C2)/((J/K)-m),  which  does  work  out. 

(c)  The  numerical  result  is  ~  1010  K,  which  as  suggested  is  much  higher  than  the  temperature 
at  the  core  of  the  sun. 


Page  340,  problem  13:  If  the  full-sized  brick  A  undergoes  some  process,  such  as  heating  it 
with  a  blowtorch,  then  we  want  to  be  able  to  apply  the  equation  AS  =  Q/T  to  either  the  whole 
brick  or  half  of  it,  which  would  be  identical  to  B.  When  we  redefine  the  boundary  of  the  system 
to  contain  only  half  of  the  brick,  the  quantities  AS  and  Q  are  each  half  as  big,  because  entropy 
and  energy  are  additive  quantities.  T,  meanwhile,  stays  the  same,  because  temperature  isn’t 
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additive  —  two  cups  of  coffee  aren’t  twice  as  hot  as  one.  These  changes  to  the  variables  leave 
the  equation  consistent,  since  each  side  has  been  divided  by  2. 

Page  341,  problem  14:  (a)  If  the  expression  1  +  by  is  to  make  sense,  then  by  has  to  be 

unitless,  so  b  has  units  of  m_1.  The  input  to  the  exponential  function  also  has  to  be  unitless, 
so  k  also  has  of  m_1.  The  only  factor  with  units  on  the  right-hand  side  is  PQ,  so  PQ  must  have 
units  of  pressure,  or  Pa. 

(b) 


d  P  =  pg  d  y 

Id  P 

P  =  --T- 
9  d  y 

=  Ege-ky 

g 


(—k  —  kby  +  b) 


(c)  The  three  terms  inside  the  parentheses  on  the  right  all  have  units  of  m-1,  so  it  makes  sense 
to  add  them,  and  the  factor  in  parentheses  has  those  units.  The  units  of  the  result  from  b  then 
look  like 


kg 

m3 


Pa  _i 

- 1T1 

m/s2 
N/m2 
m2/s2 
kg-rn'1  -s~2 
m2/s2  ’ 


which  checks  out. 


Solutions  for  Chapter  7 

Page  447,  problem  17:  (a)  Plugging  in,  we  find 


1  —  w 
1  +  w 

(b)  First  let’s  simplify  by  squaring  both  sides. 


1  —  u 
1  +  u 


1  —  V 

l+v 


1  —  w  1  —  u  1  —  V 
1  +  rc  1  +  u  l  +  v 

For  convenience,  let’s  write  A  for  the  right-hand  side  of  this  equation.  We  then  have 


1  +  w 

1  —  w  =  A  +  Aid. 


Solving  for  w, 

1  —  A 


(1  +  u)(l  +  v)  -  (1  -  it)(l  -  v) 
(1  +  u)(l  +  v)  +  (1  -  u)(l  -  v) 
2  (u  +  v ) 

2(1  +  uv) 
u  +  v 
1  +  uv 
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(c)  This  is  all  in  units  where  c  =  1.  The  correspondence  principle  says  that  we  should  get 
w  ~  u  +  v  when  both  u  and  v  are  small  compared  to  1.  Under  those  circumstances,  uv  is  the 
product  of  two  very  small  numbers,  which  makes  it  very,  very  small.  Neglecting  this  term  in 
the  denominator,  we  recover  the  nonrelativistic  result. 

Page  447,  problem  18:  Among  the  spacelike  vectors,  a  and  e  are  clearly  congruent,  because 
they’re  the  same  except  for  a  rotation  in  space;  this  is  the  same  as  the  definition  of  congruence 
in  ordinary  Euclidean  geometry,  where  rotation  doesn’t  matter.  Vector  b  is  also  congruent  to 
these,  since  it  represents  an  interval  32  —  52  =  — 42,  just  like  the  other  two. 

The  lightlike  vectors  c  and  d  both  represent  intervals  of  zero,  so  they’re  congruent,  even 
though  c  is  a  double-scale  version  of  d. 

The  timelike  vectors  f  and  g  are  not  congruent  to  each  other  or  to  any  of  the  others;  f 
represents  an  interval  of  22,  while  g’s  interval  is  42. 

Page  448,  problem  22:  At  the  center  of  each  of  the  large  triangle’s  sides,  the  angles  add  up 
to  180°  because  they  form  a  straight  line.  Therefore  4s  =  5  +  3  x  180°,  so  5  —  180°  =  4(s  —  180°). 

Page  449,  problem  28:  By  the  equivalence  principle,  we  can  adopt  a  frame  tied  to  the  tossed 
clock,  B,  and  in  this  frame  there  is  no  gravitational  field.  We  see  a  desk  and  clock  A  go  by. 
The  desk  applies  a  force  to  clock  A,  decelerating  it  and  then  reaccelerating  it  so  that  it  comes 
back.  We’ve  already  established  that  the  effect  of  motion  is  to  slow  down  time,  so  clock  A  reads 
a  smaller  time  interval. 

Hints  for  Chapter  8 

Page  512,  problem  15:  The  force  on  the  lithium  ion  is  the  vector  sum  of  all  the  forces  of 
all  the  quadrillions  of  sodium  and  chlorine  atoms,  which  would  obviously  be  too  laborious  to 
calculate.  Nearly  all  of  these  forces,  however,  are  canceled  by  a  force  from  an  ion  on  the  opposite 
side  of  the  lithium. 

Solutions  for  Chapter  9 

Page  550,  problem  1:  At  =  A q/I  =  e/I  =  0.16  //s 

Page  551,  problem  12:  In  series,  they  give  11  kfi.  In  parallel,  they  give  (1/1  kfl  + 
1/10  Ml)”1  =  0.9  kfi. 

Page  554,  problem  25:  The  actual  shape  is  irrelevant;  all  we  care  about  is  what’s  connected  to 
what.  Therefore,  we  can  draw  the  circuit  flattened  into  a  plane.  Every  vertex  of  the  tetrahedron 
is  adjacent  to  every  other  vertex,  so  any  two  vertices  to  which  we  connect  will  give  the  same 
resistance.  Picking  two  arbitrarily,  we  have  this: 


This  is  unfortunately  a  circuit  that  cannot  be  converted  into  parallel  and  series  parts,  and 
that’s  what  makes  this  a  hard  problem!  However,  we  can  recognize  that  by  symmetry,  there 
is  zero  current  in  the  resistor  marked  with  an  asterisk.  Eliminating  this  one,  we  recognize  the 
whole  arrangement  as  a  triple  parallel  circuit  consisting  of  resistances  R,  2 R,  and  2 R.  The 
resulting  resistance  is  R/2. 
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Page  555,  problem  29:  (a)  Conservation  of  energy  gives 


UA  =  UB  +  Kb 
Kb  =  U a-  Ub 


-mv2  =  eAV 


v  = 


2eAV 


m 


(b)  Plugging  in  numbers,  we  get  5.9  x  10'  m/s.  This  is  about  20%  of  the  speed  of 
nonrelativistic  assumption  was  good  to  at  least  a  rough  approximation. 

Page  556,  problem  32:  It’s  much  more  practical  to  measure  voltage  differences, 
a  current,  you  have  to  break  the  circuit  somewhere  and  insert  the  meter  there, 
possible  to  disconnect  the  circuits  sealed  inside  the  board. 

Solutions  for  Chapter  10 

Page  640,  problem  16:  By  symmetry,  the  field  is  always  directly  toward  or  away  from  the 
center.  We  can  therefore  calculate  it  along  the  x  axis,  where  r  =  x,  and  the  result  will  be  valid 
for  any  location  at  that  distance  from  the  center. 


light,  so  the 

To  measure 
but  it’s  not 


dV 

dx 

d 


dx  v 
=  x~2e~x 


-v*) 

+  x~1e~x 


At  small  x,  near  the  proton,  the  first  term  dominates,  and  the  exponential  is  essentially  1,  so  we 
have  E  oc  x~2,  as  we  expect  from  the  Coulomb  force  law.  At  large  x,  the  second  term  dominates, 
and  the  held  approaches  zero  faster  than  an  exponential. 

Page  648,  problem  56: 

sin  (a  +  b)  =  (V(°+b)  -  e~^a+b^  /2 i 

=  (eiaeib  -  e~iae-ib^j  /2i 

=  [(cos  a  +  i  sin  a)  (cos  b  +  isinb)  —  (cos  a  —  i  sin  a)  (cos  b  —  isin&)]  /2  i 
=  cos  a  sin  b  +  sin  a  cos  b 

By  a  similar  computation,  we  find  cos(a  +  b)  =  cos  a  cos  b  —  sin  a  sin  b. 

Page  648,  problem  57:  If  z3  =  1,  then  we  know  that  \z\  =  1,  since  cubing  z  cubes  its 
magnitude.  Cubing  z  triples  its  argument,  so  the  argument  of  z  must  be  a  number  that,  when 
tripled,  is  equivalent  to  an  angle  of  zero.  There  are  three  possibilities:  0x3  =  0,  (27r/3)  x  3  =  27t, 
and  (47r/3)  x  3  =  47T.  (Other  possibilities,  such  as  (327t/3),  are  equivalent  to  one  of  these.)  The 
solutions  are: 

z  =  l,  e2™/3,  e4™/3 

Page  648,  problem  58:  We  can  think  of  this  as  a  polynomial  in  x  or  a  polynomial  in  y 
their  roles  are  symmetric.  Let’s  call  x  the  variable.  By  the  fundamental  theorem  of  algebra,  it 
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must  be  possible  to  factor  it  into  a  product  of  three  linear  factors,  if  the  coefficients  are  allowed 
to  be  complex.  Each  of  these  factors  causes  the  product  to  be  zero  for  a  certain  value  of  x.  But 
the  condition  for  the  expression  to  be  zero  is  x 3  =  y3,  which  basically  means  that  the  ratio  of  x 
to  y  must  be  a  third  root  of  1.  The  problem,  then,  boils  down  to  finding  the  three  third  roots 
of  1,  as  in  problem  57.  Using  the  result  of  that  problem,  we  find  that  there  are  zeroes  when  x/y 
equals  1,  e27™/3,  and  e47U/3.  This  tells  us  that  the  factorization  is  (x  —  y)(x  —  e2m/3y)(x  —  e4m/3y) . 

The  second  part  of  the  problem  asks  us  to  factorize  as  much  as  possible  using  real  coefficients. 
Our  only  hope  of  doing  this  is  to  multiply  out  the  two  factors  that  involve  complex  coefficients, 
and  see  if  they  produce  something  real.  In  fact,  we  can  anticipate  that  it  will  work,  because 
the  coefficients  are  complex  conjugates  of  one  another,  and  when  a  quadratic  has  two  complex 
roots,  they  are  conjugates.  The  result  is  (x  —  y)(x2  +  xy  +  y2). 

Solutions  for  Chapter  11 

Page  735,  problem  51:  (a)  For  a  material  object,  p  =  mv.  The  velocity  vector  reverses 

itself,  but  mass  is  still  positive,  so  the  momentum  vector  is  reversed. 

(b)  In  the  forward-time  universe,  conservation  of  momentum  is  pgi  +  p2,i  =  pi,/  +  P2,/-  In  the 
backward-time  universe,  all  the  momenta  are  reversed,  but  that  just  negates  both  sides  of  the 
equation,  so  momentum  is  still  conserved. 

Page  737,  problem  54:  Note  that  in  the  Biot-Savart  law,  the  variable  r  is  defined  as  a  vector 
that  points  from  the  current  to  the  point  at  which  the  field  is  being  calculated,  whereas  in  the 
polar  coordinates  used  to  express  the  equation  of  the  spiral,  the  vector  more  naturally  points 
the  opposite  way.  This  requires  some  fiddling  with  signs,  which  I’ll  suppress,  and  simply  identify 
d t  with  dr. 


kl  f  d£  x  r 

c2  J  r 3 


The  vector  dr  has  components  dx  =  w(cos9  —  9  sin  9)  and  d  y  =  w(sin9  +  9  cos  9).  Evaluating 
the  vector  cross  product,  and  substituting  9/w  for  r,  we  find 


kl  f  6  (cos  9  sin  9 
C2W  J 

ki  r  de 

c2w  J  9 
kl  .  9-2 

~2  n  7T 
czw  9 1 

kl  ,  b 
—  In  - 
czw  a 


9  sin2  9  —  cos  9  sin  9  —  9  cos2  9)  d# 
¥ 


Solutions  for  Chapter  12 

Page  806,  problem  16:  See  the  ray  diagram  below.  Decreasing  90  decreases  9{,  so  the 
equation  9f  =  ±9t  +  ±90  must  have  opposite  signs  on  the  right.  Since  90  is  bigger  than  the 
only  way  to  get  a  positive  9f  is  if  the  signs  are  9 /  =  —9i  +  90.  This  gives  1//  =  —1/di  +  1  /dQ. 
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Page  806,  problem  19:  (a)  The  object  distance  is  less  than  the  focal  length,  so  the  image  is 
virtual:  because  the  object  is  so  close,  the  cone  of  rays  is  diverging  too  strongly  for  the  mirror  to 
bring  it  back  to  a  focus,  (b)  At  an  object  distance  of  30  cm,  it’s  clearly  going  to  be  real.  With 
the  object  distance  of  20  cm,  we’re  right  at  the  crossing-point  between  real  and  virtual.  For 
this  object  position,  the  reflected  rays  will  be  parallel.  We  could  consider  this  to  be  an  image 
at  infinity.  (c),(d)  A  diverging  mirror  can  only  make  virtual  images. 

Page  810,  problem  39:  Since  d0  is  much  greater  than  di,  the  lens-film  distance  di  is  es¬ 
sentially  the  same  as  /.  (a)  Splitting  the  triangle  inside  the  camera  into  two  right  triangles, 
straightforward  trigonometry  gives 

9  =  2  tan-1 

for  the  field  of  view.  This  comes  out  to  be  39°  and  64°  for  the  two  lenses,  (b)  For  small  angles, 
the  tangent  is  approximately  the  same  as  the  angle  itself,  provided  we  measure  everything  in 
radians.  The  equation  above  then  simplifies  to 


The  results  for  the  two  lenses  are  .70  rad  =  40°,  and  1.25  rad  =  72°.  This  is  a  decent  approxi¬ 
mation. 

(c)  With  the  28-nrm  lens,  which  is  closer  to  the  film,  the  entire  field  of  view  we  had  with  the 
50-rnrn  lens  is  now  confined  to  a  small  part  of  the  film.  Using  our  small-angle  approximation 
9  =  w/f,  the  amount  of  light  contained  within  the  same  angular  width  9  is  now  striking  a  piece 
of  the  film  whose  linear  dimensions  are  smaller  by  the  ratio  28/50.  Area  depends  on  the  square 
of  the  linear  dimensions,  so  all  other  things  being  equal,  the  film  would  now  be  overexposed  by 
a  factor  of  (50/28)2  =  3.2.  To  compensate,  we  need  to  shorten  the  exposure  by  a  factor  of  3.2. 

Page  818,  problem  59:  One  surface  is  curved  outward  and  one  inward.  Therefore  the  minus 
sign  applies  in  the  lensmaker’s  equation.  Since  the  radii  of  curvature  are  equal,  the  quantity 
1/ri  —  l/r2  equals  zero,  and  the  resulting  focal  length  is  infinite.  A  big  focal  length  indicates  a 
weak  lens.  An  infinite  focal  length  tells  us  that  the  lens  is  infinitely  weak  —  it  doesn’t  focus  or 
defocus  rays  at  all. 

Solutions  for  Chapter  13 

Page  942,  problem  48:  The  expressions  |T|2  and  |T2|  are  identical,  because  the  magnitude  of 
a  product  is  the  product  of  the  magnitudes.  These  expressions  give  positive  real  numbers  as  their 
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results,  which  makes  sense  for  a  probability  density.  The  expression  'k2  need  not  be  real,  and  if 
it  is  real,  it  may  be  negative.  It  cannot  be  interpreted  as  a  probability  density.  As  a  concrete 
example,  suppose  that  \k  =  hi .  where  b  is  a  real  number  with  units.  Then  (T)2  =  |T2|  =  ft2, 
which  is  real  and  positive,  but  T2  =  —  b2,  which  clearly  can’t  be  interpreted  probabilistically, 
because  it’s  negative. 

Page  942,  problem  49:  (a)  The  quantity  x  —  y  vanishes  along  the  line  y  =  x  lying  in  the 

first  quadrant  at  a  45-degree  angle  between  the  axes.  Squaring  produces  a  trough  parallel  to 
this  line,  with  a  parabolic  cross-section.  Geometrically,  the  Laplacian  can  be  interpreted  as  a 
measure  of  how  much  the  value  of  /  at  a  point  differs  from  its  average  value  on  a  small  circle 
centered  on  that  point.  The  trough  is  concave  up,  so  we  can  predict  that  the  Laplacian  will  be 
positive  everywhere. 

(b)  The  zero  result  is  clearly  wrong  because  it  disagrees  with  our  conclusion  from  part  a  that 
the  Laplacian  is  positive.  A  correct  calculation  gives  d2(x  —  y)2 /dx2  +  d2(x  —  y)2 /dy2  =  4. 

(c)  If  we  rotate  our  coordinate  axes  counterclockwise  by  45  degrees,  then  we  have  a  parabolic 
trough  oriented  along  the  x  axis.  In  terms  of  these  new  coordinates,  d f  /dx  =  0,  while  df  /dy  is 
nonzero  almost  everywhere. 

Remark:  The  mistake  described  in  the  question  is  a  common  one,  and  is  apparently  based  on 
the  idea  that  the  notation  V2  must  mean  applying  an  operator  V  twice.  For  those  with  some 
exposure  to  vector  calculus,  it  may  be  of  interest  to  note  that  the  Laplacian  is  equivalent  to  the 
divergence  of  the  gradient,  which  can  be  notated  either  div(grad/)  or  V  •  (V/).  The  important 
thing  to  recognize  is  that  the  gradient,  notated  grad  /  or  V/,  outputs  a  vector ,  not  a  scalar  like 
the  quantity  Q  defined  in  this  problem. 
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.0.6  Notation  and  terminology,  compared  with  other  books 

Almost  all  the  notation  and  terminology  in  Simple  Nature  is  standard,  but  there  are  some 
cases  where  there  is  no  universal  standard,  and  a  very  few  cases  where  I’ve  intentionally  deviated 
from  a  universal  standard.  The  notation  used  by  physicists  is  also  different  from  that  used  by 
electrical  and  mechanical  engineers;  I  use  physics  terminology  and  notation  (notably  y/—l  = 
i,  not  j,  and  “torque”  rather  than  “moment”),  but  employ  the  SI  system  of  units  used  in 
engineering,  rather  than  the  cgs  units  favored  by  some  physicists. 

Nonstandard  terminology: 

Potential  energy  is  referred  to  in  this  book  as  interaction  energy ,  or  according  to  its  type: 
gravitational  energy ,  electrical  energy ,  etc. 

The  potential,  in  an  electrical  context,  is  referred  to  as  voltage,  e.g.  I  say  that  V  =  kq/r  is 
the  voltage  surrounding  a  point  charge. 

Heat  and  thermal  energy  are  both  referred  to  as  heat.  This  is  in  keeping  with  casual  usage 
among  scientists,  but  formal  written  usage  dictates  the  use  of  “thermal  energy”  to  mean 
the  kinetic  energy  an  object  has  because  of  its  molecules’  random  motion,  while  “heat”  is 
the  transfer  of  thermal  energy. 

Notation  for  which  there  is  no  universal  standard: 

Kinetic  energy  is  written  K.  Standard  notation  is  K,  T,  or  KE. 

Interaction  energy  is  written  U.  Standard  notation  is  U,  V,  or  PE. 

The  unit  vectors  are  x,  y,  z.  Standard  notation  is  either  x,  y,z  or  i,j,k. 

Distance  from  an  axis  in  cylindrical  coordinates  is  R.  A  more  common  notation  in  math 
books  is  p,  but  this  would  conflict  with  the  standard  physics  notation  for  the  charge 
density. 

Vibrations  do  not  have  very  well  standardized  terminology  or  notation.  I  use  “frequency”  to 
refer  to  both  /  and  a depending  on  the  context  to  make  it  clear  which  is  meant.  The 
frequency  of  free,  damped  oscillations  is  ojf,  which  is  only  approximately  the  same  as 
oj0  =  \fk/m.  The  full  width  at  half-maximum  of  the  resonance  peak  (on  a  plot  of  energy 
versus  frequency)  is  A u. 

The  coupling  constants  for  electricity  and  magnetism  are  written  as  k  and  k/c2.  This  is 
standard  notation,  but  it  would  be  more  common  in  SI  calculations  to  see  everything 
expressed  in  terms  of  eQ  =  l/4irk  and  pQ  =  Airk/c2.  Numerically,  we  have  k  =  8.99  x 
109  N-m2/C2  and  /c/c2  =  10~'  N/A2,  the  latter  being  an  exact  relation. 


.0.7  Notation  and  units 


quantity 

unit 

symbol 

distance 

meter,  m 

x,  Ax 

time 

second,  s 

t ,  At 

mass 

kilogram,  kg 

m 

density 

kg/m3 

P 

force 

newton,  1  N=1  kg-rn/s2 

F 

velocity 

m/s 

V 

acceleration 

m/s2 

a 

gravitational  field 

J/kg-m  or  m/s2 

9 

energy 

joule,  J 

E  (also  electric  field) 

momentum 

kg-rn/s 

P 

angular  momentum 

kg-m2/s  or  J-s 

L  (also  inductance) 

power 

watt,  1  W  =  1  J/s 

P  (also  pressure) 

pressure 

1  Pa=l  N/m2 

P  (also  power) 

temperature 

K 

T  (also  period) 

period 

s 

T  (also  temperature) 

wavelength 

nr 

A 

frequency 

s_1  or  Hz 

/ 

charge 

coulomb,  C 

q 

voltage 

volt,  1  V  =  1  J/C 

V 

current 

ampere,  1  A  =  1  C/s 

i 

resistance 

ohm,  HI  =  1  V/A 

R 

capacitance 

farad,  1  F  =  1  C/V 

c 

inductance 

henry,  1  H  =  1  V-s/A 

L  (also  angular  momentum) 

electric  field 

V/m  or  N/C 

E  (also  energy) 

magnetic  field 

tesla,  1  T  =  1  N-s/C-m 

B 

focal  length 

nr 

f 

magnification 

unitless 

M 

index  of  refraction 

unitless 

n 

electron  wavefunction 

m-3/2 

'P 

.0.8  Fundamental  constants 


gravitational  constant 

G  =  6.67  x  10  11  J-m/kg2 

Boltzmann  constant 

k  =  1.38  x  10”23  J/K 

Coulomb  constant 

k  =  8.99  x  109  J-nr/C2  or  N-m2/C2 

quantum  of  charge 

e  =  1.60  x  10”19  C 

speed  of  light 

c  =  3.00  x  108  nr/s 

Planck’s  constant 

h  =  6.63  x  10“34  J-s 

Note  the  use  of  the  same  notation,  k,  for  both  the  Boltzmann  constant  and  the  Coulomb 
constant. 
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.0.9  Metric  prefixes 


M- 

mega- 

~W 

k- 

kilo- 

103 

m- 

rnilli- 

10~3 

fi-  (Greek  mu) 

micro- 

1(T6 

n- 

nano- 

10  9 

p- 

pico- 

10"12 

f- 

fernto- 

10”15 

Note  that  the  exponents  go  in  steps  of  three.  The  exception  is  centi-,  10  2,  which  is  used  only 
in  the  centimeter,  and  this  doesn’t  require  memorization,  because  a  cent  is  10~2  dollars. 

.0.10  Nonmetric  units 

Nonmetric  units  in  terms  of  metric  ones: _ 


1  inch 

=  25.4  mm  (by  definition) 

1  pound  (lb) 

=  4.5  newtons  of  force 

1  scientific  calorie 

=  4.18  J 

1  nutritional  calorie 

=  4.18  x  103  J 

1  gallon 

=  3.78  x  103  cm3 

1  horsepower 

=  746  W 

The  pound  is  a  unit  of  force,  so  it  converts  to  newtons,  not  kilograms.  A  one-kilogram  mass  at 
the  earth’s  surface  experiences  a  gravitational  force  of  (1  kg) (9.8  m/s2)  =  9.8  N  =  2.2  lb.  The 
nutritional  information  on  food  packaging  typically  gives  energies  in  units  of  calories,  but  those 
so-called  calories  are  really  kilocalories. 

Relationships  among  U.S.  units: 


1  foot  (ft) 

=  12  inches 

1  yard  (yd) 

=  3  feet 

1  mile  (mi) 

=  5280  feet 

1  ounce  (oz) 

=  1/16  pound 

.0.1 1  The  Greek  alphabet 


a 

A 

alpha 

L 

i 

iota 

P 

p 

rho 

P 

B 

beta 

K 

K 

kappa 

a 

X 

sigma 

7 

T 

gamma 

A 

A 

lambda 

T 

T 

tau 

5 

A 

delta 

A 

M 

mu 

V 

Y 

upsilon 

€ 

E 

epsilon 

V 

N 

nu 

<t> 

$ 

phi 

c 

Z 

zeta 

i 

xi 

X 

X 

chi 

V 

H 

eta 

o 

O 

omicron 

Y> 

'k 

psi 

e 

0 

theta 

7 r 

n 

Pi 

UJ 

n 

omega 

0.12  Subatomic  particles 


particle 

mass  (kg) 

charge 

radius  (fm) 

electron 

9.109  x  10”31 

— e 

<0.01 

proton 

1.673  x  10~27 

+e 

~  1.1 

neutron 

1.675  x  10-27 

0 

~  1.1 

neutrino 

~  10-39  kg  ? 

0 

? 

The  radii  of  protons  and  neutrons  can  only  be  given  approximately,  since  they  have  fuzzy 
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surfaces.  For  comparison,  a  typical  atom  is  about  a  million  fm  in  radius. 


.0.13  Earth,  moon,  and  sun 


body 

mass  (kg) 

radius  (km) 

radius  of  orbit  (km) 

earth 

moon 

sun 

5.97  x  1024 
7.35  x  1022 
1.99  x  1030 

6.4  x  103 

1.7  x  103 

7.0  x  105 

1.49  x  10s 

3.84  x  105 

.0.14  The  periodic  table 


1 

H 

2 

He 

3 

Li 

4 

Be 

5 

B 

6 

c 

7 

N 

8 

O 

9 

F 

10 

Ne 

11 

Na 

12 

Mg 

13 

Al 

14 

Si 

15 

P 

16 

s 

17 

Cl 

18 

Ar 

19 

K 

20 

Ca 

21 

Sc 

* 

** 

22 

Ti 

23 

V 

24 

Cr 

25 

Mn 

26 

Fe 

27 

Co 

28 

Ni 

29 

Cu 

30 

Zn 

31 

Ga 

32 

Ge 

33 

As 

34 

Se 

35 

Br 

36 

Kr 

37 

Rb 

38 

Sr 

39 

Y 

40 

Zr 

41 

Nb 

42 

Mo 

43 

Tc 

44 

Ru 

45 

Rh 

46 

Pd 

47 

Ag 

48 

Cd 

49 

In 

50 

Sn 

51 

Sb 

52 

Te 

53 

1 

54 

Xe 

55 

Cs 

56 

Ba 

57 

La 

72 

Hf 

73 

Ta 

74 

W 

75 

Re 

76 

Os 

77 

lr 

78 

Pt 

79 

Au 

80 

Hg 

81 

TI 

82 

Pb 

83 

Bi 

84 

Po 

85 

At 

86 

Rn 

87 

Fr 

88 

Ra 

89 

Ac 

104 

Rf 

105 

Ha 

106 

107 

108 

109 

110 

in 

112 

113 

114 

115 

116 

117 

118 

Ce 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

* 

Pr 

Nd 

Pm 

Sm 

Eu 

Gd 

Tb 

Dy 

Ho 

Er 

Tm 

Yb 

Lu 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 

100 

101 

102 

103 

** 

Th 

Pa 

U 

Np 

Pu 

Am 

Cm 

Bk 

Cf 

Es 

Fm 

Md 

No 

Lr 

.0.15  Atomic  masses 

These  atomic  masses  are  given  in  atomic  mass  units  (u),  where  by  definition  the  mass  of 
an  atom  of  the  isotope  carbon-12  equals  12  u.  One  atomic  mass  unit  is  the  same  as  about 
1.66  x  ICO2'  kg.  Data  are  only  given  for  naturally  occurring  elements. 


Ag 

107.9 

Eu 

152.0 

Mo 

95.9 

Sc 

45.0 

Al 

27.0 

F 

19.0 

N 

14.0 

Se 

79.0 

Ar 

39.9 

Fe 

55.8 

Na 

23.0 

Si 

28.1 

As 

74.9 

Ga 

69.7 

Nb 

92.9 

Sn 

118.7 

Au 

197.0 

Gd 

157.2 

Nd 

144.2 

Sr 

87.6 

B 

10.8 

Ge 

72.6 

Ne 

20.2 

Ta 

180.9 

Ba 

137.3 

H 

1.0 

Ni 

58.7 

Tb 

158.9 

Be 

9.0 

He 

4.0 

O 

16.0 

Te 

127.6 

Bi 

209.0 

Hf 

178.5 

Os 

190.2 

Ti 

47.9 

Br 

79.9 

Hg 

200.6 

P 

31.0 

TI 

204.4 

C 

12.0 

Ho 

164.9 

Pb 

207.2 

Tm 

168.9 

Ca 

40.1 

In 

114.8 

Pd 

106.4 

U 

238 

Ce 

140.1 

lr 

192.2 

Pt 

195.1 

V 

50.9 

Cl 

35.5 

K 

39.1 

Pr 

140.9 

W 

183.8 

Co 

58.9 

Kr 

83.8 

Rb 

85.5 

Xe 

131.3 

Cr 

52.0 

La 

138.9 

Re 

186.2 

Y 

88.9 

Cs 

132.9 

Li 

6.9 

Rh 

102.9 

Yb 

173.0 

Cu 

63.5 

Lu 

175.0 

Ru 

101.1 

Zn 

65.4 

Dy 

162.5 

Mg 

24.3 

S 

32.1 

Zr 

91.2 

Er 

167.3 

Mn 

54.9 

Sb 

121.8 
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Notation  and  units  are  summarized  on  page  985. 

Chapter  0,  Introduction  and  Review,  page  13 

Physics  is  the  use  of  the  scientific  method  to  study  the  behavior  of  light  and  matter.  The 
scientific  method  requires  a  cycle  of  theory  and  experiment,  theories  with  both  predictive  and 
explanatory  value,  and  reproducible  experiments. 

The  metric  system  is  a  simple,  consistent  framework  for  measurement  built  out  of  the  meter, 
the  kilogram,  and  the  second  plus  a  set  of  prefixes  denoting  powers  of  ten.  The  most  systematic 
method  for  doing  conversions  is  shown  in  the  following  example: 


370  ms  x 


10-3  s 
1  ms 


=  0.37  s 


Mass  is  a  measure  of  the  amount  of  a  substance.  Mass  can  be  defined  gravitationally,  by 
comparing  an  object  to  a  standard  mass  on  a  double-pan  balance,  or  in  terms  of  inertia,  by 
comparing  the  effect  of  a  force  on  an  object  to  the  effect  of  the  same  force  on  a  standard  mass. 
The  two  definitions  are  found  experimentally  to  be  proportional  to  each  other  to  a  high  degree 
of  precision,  so  we  usually  refer  simply  to  “mass,”  without  bothering  to  specify  which  type. 

A  force  is  that  which  can  change  the  motion  of  an  object.  The  metric  unit  of  force  is  the 
Newton,  defined  as  the  force  required  to  accelerate  a  standard  1-kg  mass  from  rest  to  a  speed 
of  1  m/s  in  1  s. 

Scientific  notation  means,  for  example,  writing  3.2  x  10'5  rather  than  320000. 

Writing  numbers  with  the  correct  number  of  significant  figures  correctly  communicates  how 
accurate  they  are.  As  a  rule  of  thumb,  the  final  result  of  a  calculation  is  no  more  accurate  than, 
and  should  have  no  more  significant  figures  than,  the  least  accurate  piece  of  data. 

Nature  behaves  differently  on  large  and  small  scales.  Galileo  showed  that  this  results  fun¬ 
damentally  from  the  way  area  and  volume  scale.  Area  scales  as  the  second  power  of  length, 
A  oc  L2,  while  volume  scales  as  length  to  the  third  power,  V  oc  L3. 

An  order  of  magnitude  estimate  is  one  in  which  we  do  not  attempt  or  expect  an  exact  answer. 
The  main  reason  why  the  uninitiated  have  trouble  with  order-of-magnitude  estimates  is  that 
the  human  brain  does  not  intuitively  make  accurate  estimates  of  area  and  volume.  Estimates  of 
area  and  volume  should  be  approached  by  first  estimating  linear  dimensions,  which  one’s  brain 
has  a  feel  for. 

Velocity,  dx/  dt,  measures  how  fast  an  object  is  moving.  Acceleration,  cPx/  df2,  measures 
how  quickly  its  velocity  is  changing.  For  motion  with  constant  acceleration,  we  have  these  useful 


relations: 


Av 

a  =  vr- 

At 

1  2 

x  =  -at  +  vQt  +  xQ 
vj  =  v2  +  2aAx 

Chapter  1,  Conservation  of  Mass,  page  55 

Conservation  laws  are  the  foundation  of  physics.  A  conservation  law  states  that  a  certain 
quantity  can  be  neither  created  nor  destroyed;  the  total  amount  of  it  remains  the  same. 

Mass  is  a  conserved  quantity  in  classical  physics,  i.e.  physics  before  Einstein.  This  is 
plausible,  since  we  know  that  matter  is  composed  of  subatomic  particles;  if  the  particles  are 
neither  created  or  destroyed,  then  it  makes  sense  that  the  total  mass  will  remain  the  same. 
There  are  two  ways  of  defining  mass. 

Gravitational  mass  is  defined  by  measuring  the  effect  of  gravity  on  a  particular  object,  and 
comparing  with  some  standard  object,  taking  care  to  test  both  objects  at  a  location  where  the 
strength  of  gravity  is  the  same. 

Inertial  mass  is  defined  by  measuring  how  much  a  particular  object  resists  a  change  in  its 
state  of  motion.  For  instance,  an  object  placed  on  the  end  of  a  spring  will  oscillate  if  the  spring 
is  initially  compressed,  and  a  more  massive  object  will  take  longer  to  complete  one  oscillation. 

Inertial  and  gravitational  mass  are  equivalent:  experiments  show  to  a  very  high  degree  of 
precision  that  any  two  objects  with  the  same  inertial  mass  have  the  same  gravitational  mass  as 
well. 

The  definition  of  inertial  mass  depends  on  a  correct  but  counterintuitive  assumption:  that 
an  object  resists  a  change  in  its  state  of  motion.  Most  people  intuitively  believe  that  motion 
has  a  natural  tendency  to  slow  down.  This  cannot  be  correct  as  a  general  statement,  because 
“to  slow  down”  is  not  a  well-defined  concept  unless  we  specify  what  we  are  measuring  motion 
relative  to.  This  insight  is  credited  to  Galileo,  and  the  general  principle  of  Galilean  relativity 
states  that  the  laws  of  physics  are  the  same  in  all  inertial  frames  of  reference.  In  other  words, 
there  is  no  way  to  distinguish  a  moving  frame  of  reference  from  one  that  is  at  rest.  To  establish 
which  frames  of  reference  are  inertial,  we  first  must  find  one  inertial  frame  in  which  objects 
appear  to  obey  Galilean  relativity.  The  surface  of  the  earth  is  an  inertial  frame  to  a  reasonably 
good  approximation,  and  the  frame  of  reference  of  the  stars  is  an  even  better  one.  Once  we  have 
found  one  inertial  frame  of  reference,  any  other  frame  is  inertial  which  is  moving  in  a  straight 
line  at  constant  velocity  relative  to  the  first  one.  For  instance,  if  the  surface  of  the  earth  is  an 
approximately  inertial  frame,  then  a  train  traveling  in  a  straight  line  at  constant  speed  is  also 
approximately  an  inertial  frame. 

The  unit  of  mass  is  the  kilogram,  which,  along  with  the  meter  and  the  second,  forms  the 
basis  for  the  SI  system  of  units  (also  known  as  the  mks  system).  A  fundamental  skill  in  science 
is  to  know  the  definitions  of  the  most  common  metric  prefixes,  which  are  summarized  on  page 
986,  and  to  be  able  to  convert  among  them. 

One  consequence  of  Einstein’s  theory  of  special  relativity  is  that  mass  can  be  converted  to 
energy  and  energy  to  mass.  This  prediction  has  been  verified  amply  by  experiment.  Thus  the 
conserved  quantity  is  not  really  mass  but  rather  the  total  “mass-energy,”  m.  +  E/c2,  where  c  is 
the  speed  of  light.  Since  the  speed  of  light  is  a  large  number,  the  F/c2  term  is  ordinarily  small 
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in  everyday  life,  which  is  why  we  can  usually  neglect  it. 

Chapter  2,  Conservation  of  Energy,  page  73 

We  observe  that  certain  processes  are  physically  impossible.  For  example,  there  is  no  process 
that  can  heat  up  an  object  without  using  up  fuel  or  having  some  other  side  effect  such  as  cooling 
a  different  object.  We  find  that  we  can  neatly  separate  the  possible  processes  from  the  impossible 
by  defining  a  single  numerical  quantity,  called  energy ,  which  is  conserved.  Energy  comes  in  many 
forms,  such  as  heat,  motion,  sound,  light,  the  energy  required  to  melt  a  solid,  and  gravitational 
energy  (e.g.  the  energy  that  depends  on  the  distance  between  a  rock  and  the  earth).  Because 
it  has  so  many  forms,  we  can  arbitrarily  choose  one  form,  heat,  in  order  to  define  a  standard 
unit  for  our  numerical  scale  of  energy.  Energy  is  measured  in  units  of  joules  (J),  and  one  joule 
can  be  defined  as  the  amount  of  energy  required  in  order  to  raise  the  termperature  of  a  certain 
amount  of  water  by  a  certain  number  of  degrees.  (The  numbers  are  not  worth  memorizing.) 
Power  is  defined  as  the  rate  of  change  of  energy  P  =  d E/  d t,  and  the  unit  of  power  is  the  watt, 
1  W  =  1  J/s. 

Once  we  have  defined  one  type  of  energy  numerically,  we  can  perform  experiments  that  es¬ 
tablish  the  mathematical  rules  governing  other  types  of  energy.  For  example,  in  his  paddlewheel 
experiment,  James  Joule  allowed  weights  to  drop  through  a  certain  height  and  spin  paddlewheels 
inside  sealed  canisters  of  water,  thereby  heating  the  water  through  friction.  Since  in  this  book 
we  define  the  joule  unit  in  terms  of  the  temperature  of  water,  we  can  think  of  the  paddlewheel 
experiment  as  establishing  a  rule  for  the  gravitational  energy  of  a  mass  which  is  at  a  certain 
height, 

d Ug  =  mg  dy, 

where  dUg  is  the  infinitesimal  change  in  the  gravitational  energy  of  a  mass  m  when  its  height 
is  changed  by  an  infinitesimal  amount  d y  in  the  vertical  direction.  The  quantity  g  is  called 
the  gravitational  field,  and  at  the  earth’s  surface  it  has  a  numerical  value  of  about  10  J/kg-m. 
That  is,  about  10  joules  of  energy  are  required  in  order  to  raise  a  one-kilogram  mass  by  one 
meter.  (The  gravitational  field  g  also  has  the  interpretation  that  when  we  drop  an  object,  its 
acceleration,  <#  y/ d t2,  is  equal  to  g.) 

Using  similar  techniques,  we  find  that  the  energy  of  a  moving  object,  called  its  kinetic  energy, 
is  given  by 

1  2 
A  =  -mv  , 

where  m  is  its  mass  and  v  its  velocity.  The  proportionality  factor  equals  1/2  exactly  by  the 
design  of  the  SI  system  of  units,  and  since  the  SI  is  based  on  the  meter,  the  kilogram,  and  the 
second,  the  joule  is  considered  to  be  a  derived  unit,  1  J  =  1  kg-m2/s2. 

When  the  interaction  energy  U  has  a  local  maximum  or  minimum  with  respect  to  the  position 
of  an  object  (dU /  d.x  =  0),  then  the  object  is  in  equilibrium  at  that  position.  For  example,  if 
a  weight  is  hanging  from  a  rope,  and  is  initially  at  rest  at  the  bottom,  then  it  must  remain  at 
rest,  because  this  is  a  position  of  minimum  gravitational  energy  Ug,  to  move,  it  would  have  to 
increase  both  its  kinetic  and  its  gravitational  energy,  which  would  violate  conservation  of  energy, 
since  the  total  energy  would  increase. 

Since  kinetic  energy  is  independent  of  the  direction  of  motion,  conservation  of  energy  is 
often  insufficient  to  predict  the  direction  of  an  object’s  motion.  However,  many  of  the  physically 
impossible  motions  can  be  ruled  out  by  the  trick  of  imposing  conservation  of  energy  in  some  other 
frame  of  reference.  By  this  device,  we  can  solve  the  important  problem  of  projectile  motion:  even 
if  the  projectile  has  horizontal  motion,  we  can  imagine  ourselves  in  a  frame  of  reference  in  which 
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we  are  moving  along  with  the  projectile  horizontally.  In  this  frame  of  reference,  the  projectile 
has  no  horizontal  motion,  and  its  vertical  motion  has  constant  acceleration  g.  Switching  back 
to  a  frame  of  reference  in  which  its  horizontal  velocity  is  not  zero,  we  find  that  a  projectile’s 
horizontal  and  vertical  motions  are  independent,  and  that  the  horizontal  motion  is  at  constant 
velocity. 

Even  in  one-dimensional  motion,  it  is  seldom  possible  to  solve  real-world  problems  and 
predict  the  motion  of  an  object  in  closed  form.  However,  there  are  straightforward  numerical 
techniques  for  solving  such  problems. 

From  observations  of  the  motion  of  the  planets,  we  infer  that  the  gravitational  interaction 
between  any  two  objects  is  given  by  Ug  =  —Gm\milr,  where  r  is  the  distance  between  them. 
When  the  sizes  of  the  objects  are  not  small  compared  to  their  separation,  the  definition  of  r 
becomes  vague;  for  this  reason,  we  should  interpret  this  fundamentally  as  the  law  governing  the 
gravitational  interactions  between  individual  atoms.  However,  in  the  special  case  of  a  spherically 
symmetric  mass  distribution,  there  is  a  shortcut:  the  shell  theorem  states  that  the  gravitational 
interaction  between  a  spherically  symmetric  shell  of  mass  and  a  particle  on  the  outside  of  the 
shell  is  the  same  as  if  the  shell’s  mass  had  all  been  concentrated  at  its  center.  An  astronomical 
body  like  the  earth  can  be  broken  down  into  concentric  shells  of  mass,  and  so  its  gravitational 
interactions  with  external  objects  can  also  be  calculated  simply  by  using  the  center-to-center 
distance. 

Energy  appears  to  come  in  a  bewildering  variety  of  forms,  but  matter  is  made  of  atoms,  and 
thus  if  we  restrict  ourselves  to  the  study  of  mechanical  systems  (containing  material  objects, 
not  light),  all  the  forms  of  energy  we  observe  must  be  explainable  in  terms  of  the  behabior  and 
interactions  of  atoms.  Indeed,  at  the  atomic  level  the  picture  is  much  simpler.  Fundamentally, 
all  the  familiar  forms  of  mechanical  energy  arise  from  either  the  kinetic  energy  of  atoms  or  the 
energy  they  have  because  they  interact  with  each  other  via  gravitational  or  electrical  forces.  For 
example,  when  we  stretch  a  spring,  we  distort  the  latticework  of  atoms  in  the  metal,  and  this 
change  in  the  interatomic  distances  involves  an  increase  in  the  atoms’  electrical  energies. 

An  equilibrium  is  a  local  minimum  of  U ( x ),  and  up  close,  any  minimum  looks  like  a  parabola. 
Therefore,  small  oscillations  around  an  equilibrium  exhibit  universal  behavior,  which  depends 
only  on  the  object’s  mass,  m,  and  on  the  tightness  of  curvature  of  the  minimum,  parametrized 
by  the  quantity  k  =  cP  U /  d.x2.  The  oscillations  are  sinusoidal  as  a  function  of  time,  and  the 
period  is  T  =  2n^/m/k,  independent  of  amplitude.  When  oscillations  are  small  enough  for  these 
statements  to  be  good  approximations,  we  refer  to  them  the  oscillations  as  simple  harmonic. 

Chapter  3,  Conservation  of  Momentum,  page  129 

Since  the  kinetic  energy  of  a  material  object  depends  on  v2,  it  isn’t  obvious  that  conservation 
of  energy  is  consistent  with  Galilean  relativity.  Even  if  a  certain  mechanical  system  conserves 
energy  in  one  frame  of  reference,  the  velocities  involved  will  be  different  as  measured  in  another 
frame,  and  therefore  so  will  the  kinetic  energies.  It  turns  out  that  consistency  is  achieved  only 
if  there  is  a  new  conservation  law,  conservation  of  momentum, 

p  =  mv. 

In  one  dimension,  the  direction  of  motion  is  described  using  positive  and  negative  signs  of 
the  velocity  v,  and  since  mass  is  always  positive,  the  momentum  carries  the  same  sign.  Thus 
conservation  of  momentum,  unlike  conservation  of  energy,  makes  direct  predictions  about  the 
direction  of  motion.  Although  this  line  of  argument  was  based  on  the  assumption  of  a  mechanical 
system,  momentum  need  not  be  mechanical.  Light  has  momentum. 
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A  moving  object’s  momentum  equals  the  sum  of  the  momenta  of  all  its  atoms.  To  avoid 
having  to  carry  out  this  sum,  we  can  use  the  concept  of  the  center  of  mass.  The  center  of  mass 
can  be  defined  as  a  kind  of  weighted  average  of  the  positions  of  all  the  atoms  in  the  object, 

„  _  E  mjXj 

and  although  the  definition  does  involve  a  sum,  we  can  often  locate  the  center  of  mass  by 
symmetry  or  by  physically  determining  an  object’s  balance  point.  The  total  momentum  of  the 
object  is  then  given  by 

P  total  —  TTT'total^cm- 


The  rate  of  transfer  of  momentum  is  called  force,  F  =  dp/  d t,  and  is  measured  in  units  of 
newtons,  1  N  =  1  kg-rn/s2.  As  a  direct  consequence  of  conservation  of  momentum,  we  have  the 
following  statements,  known  as  Newton’s  laws  of  motion: 


If  the  total  force  on  an  object  is  zero,  it  remains  in  the  same  state  of  motion. 

F  =  dp/d  t 

Forces  always  come  in  pairs:  if  object  A  exerts  a  force  on  object  B,  then  object  B  exerts 
a  force  on  object  A  which  is  the  same  strength,  but  in  the  opposite  direction. 

Although  the  fundamental  forces  at  the  atomic  level  are  gravity,  electromagnetism,  and 
nuclear  forces,  we  use  a  different  and  more  practical  classification  scheme  in  everyday  situations. 
In  this  scheme,  the  forces  between  solid  objects  are  described  as  follows: 


A  normal  force,  Fn, 

is  perpendicular  to  the  surface  of  contact,  and  prevents 
objects  from  passing  through  each  other  by  becoming 
as  strong  as  necessary  (up  to  the  point  where  the  ob¬ 
jects  break).  “Normal”  means  perpendicular. 

Static  friction,  Fs , 

is  parallel  to  the  surface  of  contact,  and  prevents  the 
surfaces  from  starting  to  slip  by  becoming  as  strong  as 
necessary,  up  to  a  maximum  value  of  FStmax.  “Static” 
means  not  moving,  i.e.  not  slipping. 

Kinetic  friction,  F^, 

is  parallel  to  the  surface  of  contact,  and  tends  to  slow 
down  any  slippage  once  it  starts.  “Kinetic”  means 
moving,  i.e.  slipping. 

Work  is  defined  as  the  transfer  of  energy  by  a  force.  ( “By  a  force”  is  meant  to  exclude  energy 
transfer  by  heat  conduction.)  The  work  theorem  states  that  when  a  force  occurs  at  a  single  point 
of  contact,  the  amount  of  energy  transferred  by  that  force  is  given  by  dW  =  F  •  dx,  where  dx  is 
the  distance  traveled  by  the  point  of  contact.  The  kinetic  energy  theorem  is  d I\cm  =  Ftotn;-dxcm, 
where  d Kcm  is  the  change  in  the  energy,  (1/2 )mu2m,  an  object  possesses  due  to  the  motion  of 
its  center  of  mass,  F total  is  the  total  force  acting  on  the  object,  and  dxcm  is  the  distance  traveled 
by  the  center  of  mass. 

The  relationship  between  force  and  interaction  energy  is  U  =  —  dF /  da:.  Any  interaction  can 
be  described  either  by  giving  the  force  as  a  function  of  distance  or  the  interaction  energy  as  a 
function  of  distance;  the  other  quantity  can  then  be  found  by  integration  or  differentiation. 

An  oscillator  subject  to  friction  will,  if  left  to  itself,  suffer  a  gradual  decrease  in  the  amplitude 
of  its  motion  as  mechanical  energy  is  transformed  into  heat.  The  quality  factor,  Q,  is  defined  as 
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the  number  of  oscillations  required  for  the  mechanical  energy  to  fall  off  by  a  factor  of  e2n  ~  535. 
To  maintain  an  oscillation  indefinitely,  an  external  force  must  do  work  to  replace  this  energy. 
We  assume  for  mathematical  simplicity  that  the  external  force  varies  sinusoidally  with  time, 
F  =  Fm  sin  ojt.  If  this  force  is  applied  for  a  long  time,  the  motion  approaches  a  steady  state, 
in  which  the  oscillator’s  motion  is  sinusoidal,  matching  the  driving  force  in  frequency  but  not 
in  phase.  The  amplitude  of  this  steady-state  motion,  A,  exhibits  the  phenomenon  of  resonance: 
the  amplitude  is  maximized  at  a  driving  frequency  which,  for  large  Q,  is  essentially  the  same 
as  the  natural  frequency  of  the  free  vibrations,  ojf  (and  for  large  Q  this  is  also  nearly  the  same 
as  oj0  =  y/kjw).  When  the  energy  of  the  steady-state  oscillations  is  graphed  as  a  function  of 
frequency,  both  the  height  and  the  width  of  the  resonance  peak  depend  on  Q.  The  peak  is  taller 
for  greater  Q,  and  its  full  width  at  half-maximum  is  A oj  ~  ujq/Q.  For  small  values  of  Q,  all 
these  approximations  become  worse,  and  at  Q  <  1/2  qualitatively  different  behavior  sets  in. 

For  three-dimensional  motion,  a  moving  object’s  motion  can  be  described  by  three  different 
velocities,  vx  =  dx /  df,  and  similarly  for  vy  and  vz.  Thus  conservation  of  momentum  becomes 
three  different  conservation  laws:  conservation  of  px  =  mvx ,  and  so  on.  The  principle  of 
rotational  invariance  says  that  the  laws  of  physics  are  the  same  regardless  of  how  we  change 
the  orientation  of  our  laboratory:  there  is  no  preferred  direction  in  space.  As  a  consequence  of 
this,  no  matter  how  we  choose  our  x,  y,  and  z  coordinate  axes,  we  will  still  have  conservation  of 
px,  py,  and  pz.  To  simplify  notation,  we  define  a  momentum  vector,  p,  which  is  a  single  symbol 
that  stands  for  all  the  momentum  information  contained  in  the  components  px,  py.  and  pz.  The 
concept  of  a  vector  is  more  general  than  its  application  to  the  momentum:  any  quantity  that 
has  a  direction  in  space  is  considered  a  vector,  as  opposed  to  a  scalar  like  time  or  temperature. 
The  following  table  summarizes  some  vector  operations. 


operation 

definition 

vector 

vector  +  vector 

vector  —  vector 

vector  •  scalar 
vector  /  scalar 

^ j  vector  |  +  vector  y  +  vector \ 

Add  component  by  component. 

Subtract  component  by  component. 
Multiply  each  component  by  the  scalar. 
Divide  each  component  by  the  scalar. 

Differentiation  and  integration  of  vectors  is  defined  component  by  component. 

There  is  only  one  meaningful  (rotationally  invariant)  way  of  defining  a  multiplication  of 
vectors  whose  result  is  a  scalar,  and  it  is  known  as  the  vector  dot  product: 

b  •  c  —  bxcx  T  byCy  T  bzcz 
=  |b|  |c|  cos  Obc- 

The  dot  product  has  most  of  the  usual  properties  associated  with  multiplication,  except  that 
there  is  no  “dot  division.” 

Chapter  4,  Conservation  of  Angular  Momentum,  page  245 

A?igular  momentum  is  a  conserved  quantity.  For  motion  confined  to  a  plane,  the  angular  mo¬ 
mentum  of  a  material  particle  is 

L  =  mv±r , 

where  r  is  the  particle’s  distance  from  the  point  chosen  as  the  axis,  and  v±_  is  the  component 
of  its  velocity  vector  that  is  perpendicular  to  the  line  connecting  the  particle  to  the  axis.  The 
choice  of  axis  is  arbitrary.  In  a  plane,  only  two  directions  of  rotation  are  possible,  clockwise  and 
counterclockwise.  One  of  these  is  considered  negative  and  the  other  positive.  Geometrically, 
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angular  momentum  is  related  to  rate  at  which  area  is  swept  out  by  the  line  segment  connecting 
the  particle  to  the  axis. 

Torque  is  the  rate  of  change  of  angular  momentum,  r  =  d L/  dt.  The  torque  created  by  a 
given  force  can  be  calculated  using  any  of  the  relations 

r  =  rF  sin  9rf 
=  rF± 

=  r±F, 


where  the  subscript  _L  indicates  a  component  perpendicular  to  the  line  connecting  the  axis  to 
the  point  of  application  of  the  force. 

In  the  special  case  of  a  rigid  body  rotating  in  a  single  plane,  we  define 


and 


a  = 


dc j 
dt  ’ 


in  terms  of  which  we  have 


[angular  velocity] 


[angular  acceleration] 


L  =  Iuj 


and 


r  =  la, 


where  the  moment  of  inertia,  I,  is  defined  as 

1  -  muj, 

summing  over  all  the  atoms  in  the  object  (or  using  calculus  to  perform  a  continuous  sum,  i.e. 
an  integral).  The  relationship  between  the  angular  quantities  and  the  linear  ones  is 


vt  =  cor 
vr  =  0 
at  =  ar. 

ar  =  u2r 


[tangential  velocity  of  a  point] 
[radial  velocity  of  a  point] 
[radial  acceleration  of  a  point] 
at  a  distance  r  from  the  axis] 
[radial  acceleration  of  a  point] 
at  a  distance  r  from  the  axis] 


In  three  dimensions,  torque  and  angular  momentum  are  vectors,  and  are  expressed  in  terms  of 
the  vector  cross  product,  which  is  the  only  rotationally  invariant  way  of  defining  a  multiplication 
of  two  vectors  that  produces  a  third  vector: 

L  =  r  x  p 
r  =  r  x  F 
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In  general,  the  cross  product  of  vectors  b  and  c  has  magnitude 

|b  x  c|  =  |b|  |c|  sin0fec, 

which  can  be  interpreted  geometrically  as  the  area  of  the  parallelogram  formed  by  the  two 
vectors  when  they  are  placed  tail-to-tail.  The  direction  of  the  cross  product  lies  along  the  line 
which  is  perpendicular  to  both  vectors;  of  the  two  such  directions,  we  choose  the  one  that  is 
right-handed,  in  the  sense  that  if  we  point  the  fingers  of  the  flattened  right  hand  along  b,  then 
bend  the  knuckles  to  point  the  fingers  along  c,  the  thumb  gives  the  direction  of  b  X  c.  In  terms 
of  components,  the  cross  product  is 


(b  X  c)^,  —  byCz  Cybz 

(b  x  c^y  —  bzcx  czbx 
(b  x  c^z  —  bxCy  cxby 

The  cross  product  has  the  disconcerting  properties 

a  x  b  =  -b  x  a  [noncommutative] 


and 


ax(bxc)/(axb)xc  [nonassociative] , 
and  there  is  no  “cross-division.” 

For  rigid-body  rotation  in  three  dimensions,  we  define  an  angular  velocity  vector  u>,  which 
lies  along  the  axis  of  rotation  and  bears  a  right-hand  relationship  to  it.  Except  in  special  cases, 
there  is  no  scalar  moment  of  inertia  for  which  L  =  Iu;  the  moment  of  inertia  must  be  expressed 
as  a  matrix. 

Chapter  5,  Thermodynamics,  page  299 

A  fluid  is  any  gas  or  liquid,  but  not  a  solid;  fluids  do  not  exhibit  shear  forces.  A  fluid  in  equilib¬ 
rium  exerts  a  force  on  any  surface  which  is  proportional  to  the  surface’s  area  and  perpendicular 
to  the  surface.  We  can  therefore  define  a  quantity  called  the  pressure,  P,  which  is  ratio  of  force 
to  area, 


where  the  subscript  _L  indicates  the  component  of  the  fluid’s  force  which  is  perpendicular  to  the 
surface. 

Usually  it  is  only  the  difference  in  pressure  between  the  two  sides  of  a  surface  that  is  physically 
significant.  Pressure  doesn’t  just  “press  down”  on  things;  air  pressure  upward  under  your  chin 
is  the  same  as  air  pressure  downward  on  your  shoulders.  In  a  fluid  acted  on  by  gravity,  pressure 
varies  with  depth  according  to  the  equation 

dP  =  pg  ■  dy. 

This  equation  is  only  valid  if  the  fluid  is  in  equilibrium,  and  if  g  and  r  are  constant  with  respect 
to  height. 

Temperature  can  be  defined  according  to  the  volume  of  an  ideal  gas  under  conditions  of 
standard  pressure.  The  Kelvin  scale  of  temperature  used  throughout  this  book  equals  zero  at 
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absolute  zero,  the  temperature  at  which  all  random  molecular  motion  ceases,  and  equals  273  K 
at  the  freezing  point  of  water.  We  can  get  away  with  using  the  Celsius  scale  as  long  as  we  are 
only  interested  in  temperature  differences;  a  difference  of  1  degree  C  is  the  same  as  a  difference 
of  1  degree  K. 

It  is  an  observed  fact  that  ideal  gases  obey  the  ideal  gas  law, 


PV  =  nkT, 


and  this  equation  can  be  explained  by  the  kinetic  theory  of  heat,  which  states  that  the  gas 
experts  pressure  on  its  container  because  its  molecules  are  constantly  in  motion.  In  the  kinetic 
theory  of  heat,  the  temperature  of  a  gas  is  proportional  to  the  average  energy  per  molecule. 


Not  all  the  heat  energy  in  an  object  can  be  extracted  to  do  mechanical  work.  We  therefore 
describe  heat  as  a  lower  grade  of  energy  than  other  forms  of  energy.  Entropy  is  a  measure  of 
how  much  of  a  system’s  energy  is  inaccessible  to  being  extracted,  even  by  the  most  efficient  heat 
engine;  a  high  entropy  corresponds  to  a  low  grade  of  energy.  The  change  in  a  system’s  entropy 
when  heat  Q  is  deposited  into  it  is 


AS  = 


Q 

T' 


The  efficiency  of  any  heat  engine  is  defined  as 


energy  we  get  m  useful  form 

efficiency  =  - - - 

energy  we  pay  for 


and  the  efficiency  of  a  Carnot  engine,  the  most  efficient  of  all,  is 


efficiency  =  1  —  — — . 

Th 

These  results  are  all  closely  related.  For  instance,  example  11  on  page  315  uses  AS  =  Q/T  and 
efficiency  =  1  —  Tl/Tjj  to  show  that  a  Carnot  engine  doesn’t  change  the  entropy  of  the  universe. 

Fundamentally,  entropy  is  defined  as  the  being  proportional  to  the  natural  logarithm  of 
the  number  of  states  available  to  a  system,  and  the  above  equation  then  serves  as  a  definition 
of  temperature.  The  entropy  of  a  closed  system  always  increases;  this  is  the  second  law  of 
thermodynamics. 

Chapter  6,  Waves,  page  343 

Wave  motion  differs  in  three  important  ways  from  the  motion  of  material  objects: 


Waves  obey  the  principle  of  superposition.  When  two  waves  collide,  they  simply  add 
together. 

The  medium  is  not  transported  along  with  the  wave.  The  motion  of  any  given  point  in 
the  medium  is  a  vibration  about  its  equilibrium  location,  not  a  steady  forward  motion. 

The  velocity  of  a  wave  depends  on  the  medium,  not  on  the  amount  of  energy  in  the  wave. 
(For  some  types  of  waves,  notably  water  waves,  the  velocity  may  also  depend  on  the  shape 
of  the  wave.) 
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Sound  waves  consist  of  increases  and  decreases  (typically  very  small  ones)  in  the  density  of 
the  air.  Light  is  a  wave,  but  it  is  a  vibration  of  electric  and  magnetic  fields,  not  of  any  physical 
medium.  Light  can  travel  through  a  vacuum. 

A  periodic  wave  is  one  that  creates  a  periodic  motion  in  a  receiver  as  it  passes  it.  Such 
a  wave  has  a  well-defined  period  and  frequency,  and  it  will  also  have  a  wavelength,  which  is 
the  distance  in  space  between  repetitions  of  the  wave  pattern.  The  velocity,  frequency,  and 
wavelength  of  a  periodic  wave  are  related  by  the  equation 

v  =  /A. 

A  wave  emitted  by  a  moving  source  will  undergo  a  Doppler  shift  in  wavelength  and  frequency. 
The  shifted  wavelength  is  given  by  the  equation 


where  v  is  the  velocity  of  the  waves  and  u  is  the  velocity  of  the  source,  taken  to  be  positive 
or  negative  so  as  to  produce  a  Doppler-lengthened  wavelength  if  the  source  is  receding  and  a 
Doppler-shortened  one  if  it  approaches.  A  similar  shift  occurs  if  the  observer  is  moving,  and 
in  general  the  Doppler  shift  depends  approximately  only  on  the  relative  motion  of  the  source 
and  observer  if  their  velocities  are  both  small  compared  to  the  waves’  velocity.  (This  is  not  just 
approximately  but  exactly  true  for  light  waves,  as  required  by  Einstein’s  theory  of  relativity.) 

Whenever  a  wave  encounters  the  boundary  between  two  media  in  which  its  speeds  are 
different,  part  of  the  wave  is  reflected  and  part  is  transmitted.  The  reflection  is  always  reversed 
front-to-back,  but  may  also  be  inverted  in  amplitude.  Whether  the  reflection  is  inverted  depends 
on  how  the  wave  speeds  in  the  two  media  compare,  e.g.  a  wave  on  a  string  is  uninverted  when  it 
is  reflected  back  into  a  segment  of  string  where  its  speed  is  lower.  The  greater  the  difference  in 
wave  speed  between  the  two  media,  the  greater  the  fraction  of  the  wave  energy  that  is  reflected. 
Surprisingly,  a  wave  in  a  dense  material  like  wood  will  be  strongly  reflected  back  into  the  wood 
at  a  wood-air  boundary. 

A  one-dimensional  wave  confined  by  highly  reflective  boundaries  on  two  sides  will  display 
motion  which  is  periodic.  For  example,  if  both  reflections  are  inverting,  the  wave  will  have  a 
period  equal  to  twice  the  time  required  to  traverse  the  region,  or  to  that  time  divided  by  an 
integer.  An  important  special  case  is  a  sinusoidal  wave;  in  this  case,  the  wave  forms  a  stationary 
pattern  composed  of  a  superposition  of  sine  waves  moving  in  opposite  direction. 

Chapter  7,  Relativity,  page  385 

Experiments  show  that  space  and  time  do  not  have  the  properties  claimed  by  Galileo  and 
Newton.  Time  and  space  as  seen  by  one  observer  are  distorted  compared  to  another  observer’s 
perceptions  if  they  are  moving  relative  to  each  other.  This  distortion  is  quantified  by  the  factor 


1 


where  v  is  the  relative  velocity  of  the  two  observers,  and  c  is  a  universal  velocity  that  is  the  same 
in  all  frames  of  reference.  Light  travels  at  c.  A  clock  appears  to  run  fastest  to  an  observer  who 
is  not  in  motion  relative  to  it,  and  appears  to  run  too  slowly  by  a  factor  of  7  to  an  observer  who 
has  a  velocity  v  relative  to  the  clock.  Similarly,  a  meter-stick  appears  longest  to  an  observer 
who  sees  it  at  rest,  and  appears  shorter  to  other  observers.  Time  and  space  are  relative,  not 
absolute.  In  particular,  there  is  no  well-defined  concept  of  simultaneity. 
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All  of  these  strange  effects,  however,  are  very  small  when  the  relative  velocities  are  small 
compared  to  c.  This  makes  sense,  because  Newton’s  laws  have  already  been  thoroughly  tested 
by  experiments  at  such  speeds,  so  a  new  theory  like  relativity  must  agree  with  the  old  one  in 
their  realm  of  common  applicability.  This  requirement  of  backwards-compatibility  is  known  as 
the  correspondence  principle. 

Relativity  has  implications  not  just  for  time  and  space  but  also  for  the  objects  that  inhabit 
time  and  space.  The  correct  relativistic  equation  for  momentum  is 

p  =  m'yv, 

which  is  similar  to  the  classical  p  =  mv  at  low  velocities,  where  7  ~  1,  but  diverges  from  it 
more  and  more  at  velocities  that  approach  c.  Since  7  becomes  infinite  at  v  =  c,  an  infinite  force 
would  be  required  in  order  to  give  a  material  object  enough  momentum  to  move  at  the  speed  of 
light.  In  other  words,  material  objects  can  only  move  at  speeds  lower  than  c.  Relativistically, 
mass  and  energy  are  not  separately  conserved.  Mass  and  energy  are  two  aspects  of  the  same 
phenomenon,  known  as  mass-energy,  and  they  can  be  converted  to  one  another  according  to  the 
equation 

E  =  me2. 

The  mass-energy  of  a  moving  object  is  £  =  mqc2.  When  an  object  is  at  rest,  7  =  1,  and  the 
mass-energy  is  simply  the  energy-equivalent  of  its  mass,  me2.  When  an  object  is  in  motion,  the 
excess  mass-energy,  in  addition  to  the  me2,  can  be  interpreted  as  its  kinetic  energy. 

Chapter  8,  Atoms  and  Electromagnetism,  page  459 

All  the  forces  we  encounter  in  everyday  life  boil  down  to  two  basic  types:  gravitational  forces 
and  electrical  forces.  A  force  such  as  friction  or  a  “sticky  force”  arises  from  electrical  forces 
between  individual  atoms. 

Just  as  we  use  the  word  mass  to  describe  how  strongly  an  object  participates  in  gravitational 
forces,  we  use  the  word  charge  for  the  intensity  of  its  electrical  forces.  There  are  two  types  of 
charge.  Two  charges  of  the  same  type  repel  each  other,  but  objects  whose  charges  are  different 
attract  each  other.  Charge  is  measured  in  units  of  coulombs  (C). 

Mobile  charged  particle  model:  A  great  many  phenomena  are  easily  understood  if  we  imagine 
matter  as  containing  two  types  of  charged  particles,  which  are  at  least  partially  able  to  move 
around. 

Positive  and  negative  charge:  Ordinary  objects  that  have  not  been  specially  prepared  have 
both  types  of  charge  spread  evenly  throughout  them  in  equal  amounts.  The  object  will  then 
tend  not  to  exert  electrical  forces  on  any  other  object,  since  any  attraction  due  to  one  type  of 
charge  will  be  balanced  by  an  equal  repulsion  from  the  other.  (We  say  “tend  not  to”  because 
bringing  the  object  near  an  object  with  unbalanced  amounts  of  charge  could  cause  its  charges 
to  separate  from  each  other,  and  the  force  would  no  longer  cancel  due  to  the  unequal  distances.) 
It  therefore  makes  sense  to  describe  the  two  types  of  charge  using  positive  and  negative  signs, 
so  that  an  unprepared  object  will  have  zero  total  charge. 

The  Coulomb  force  law  states  that  the  magnitude  of  the  electrical  force  between  two  charged 
particles  is  given  by 

|F|  =  fc|gi||g2| 

Conservation  of  charge:  An  even  more  fundamental  reason  for  using  positive  and  negative 
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signs  for  charge  is  that  with  this  definition  the  total  charge  of  a  closed  system  is  a  conserved 
quantity. 

Quantization  of  charge:  Millikan’s  oil  drop  experiment  showed  that  the  total  charge  of  an 
object  could  only  be  an  integer  multiple  of  a  basic  unit  of  charge,  e.  This  supported  the  idea 
the  the  “flow”  of  electrical  charge  was  the  motion  of  tiny  particles  rather  than  the  motion  of 
some  sort  of  mysterious  electrical  fluid. 

Einstein’s  analysis  of  Brownian  motion  was  the  first  definitive  proof  of  the  existence  of 
atoms.  Thomson’s  experiments  with  vacuum  tubes  demonstrated  the  existence  of  a  new  type  of 
microscopic  particle  with  a  very  small  ratio  of  mass  to  charge.  Thomson  correctly  interpreted 
these  as  building  blocks  of  matter  even  smaller  than  atoms:  the  first  discovery  of  subatomic 
particles.  These  particles  are  called  electrons. 

The  above  experimental  evidence  led  to  the  first  useful  model  of  the  interior  structure  of 
atoms,  called  the  raisin  cookie  model.  In  the  raisin  cookie  model,  an  atom  consists  of  a  relatively 
large,  massive,  positively  charged  sphere  with  a  certain  number  of  negatively  charged  electrons 
embedded  in  it. 


Rutherford  and  Marsden  observed  that  some  alpha  particles  from  a  beam  striking  a  thin 
gold  foil  came  back  at  angles  up  to  180  degrees.  This  could  not  be  explained  in  the  then-favored 
raisin-cookie  model  of  the  atom,  and  led  to  the  adoption  of  the  planetary  model  of  the  atom,  in 
which  the  electrons  orbit  a  tiny,  positively-charged  nucleus.  Further  experiments  showed  that 
the  nucleus  itself  was  a  cluster  of  positively-charged  protons  and  uncharged  neutrons. 


Radioactive  nuclei  are  those  that  can  release  energy.  The  most  common  types  of  radioactivity 
are  alpha  decay  (the  emission  of  a  helium  nucleus),  beta  decay  (the  transformation  of  a  neutron 
into  a  proton  or  vice-versa),  and  gamma  decay  (the  emission  of  a  type  of  very-high-frequency 
light).  Stars  are  powered  by  nuclear  fusion  reactions,  in  which  two  light  nuclei  collide  and  form 
a  bigger  nucleus,  with  the  release  of  energy. 


Human  exposure  to  ionizing  radiation  is  measured  in  units  of  millirem.  The  typical  person 
is  exposed  to  about  100  mrern  worth  of  natural  background  radiation  per  year. 

Chapter  9,  DC  Circuits,  page  515 

All  electrical  phenomena  are  alike  in  that  that  arise  from  the  presence  or  motion  of  charge.  Most 
practical  electrical  devices  are  based  on  the  motion  of  charge  around  a  complete  circuit,  so  that 
the  charge  can  be  recycled  and  does  not  hit  any  dead  ends.  The  most  useful  measure  of  the  flow 
of  charge  is  current , 

/  =  — 
dt  ’ 

An  electrical  device  whose  job  is  to  transform  energy  from  one  form  into  another,  e.g.  a  lightbulb, 
uses  power  at  a  rate  which  depends  both  on  how  rapidly  charge  is  flowing  through  it  and  on  how 
much  work  is  done  on  each  unit  of  charge.  The  latter  quantity  is  known  as  the  voltage  difference 
between  the  point  where  the  current  enters  the  device  and  the  point  where  the  current  leaves  it. 
Since  there  is  a  type  of  electrical  energy  associated  with  electrical  forces,  the  amount  of  work 
they  do  is  equal  to  the  difference  in  potential  energy  between  the  two  points,  and  we  therefore 
define  voltage  differences  directly  in  terms  of  electrical  energy, 

A Uelec 


AV  = 


q 


The  rate  of  power  dissipation  is 


P  =  IAV. 


999 


Many  important  electrical  phenomena  can  only  be  explained  if  we  understand  the  mecha¬ 
nisms  of  current  flow  at  the  atomic  level.  In  metals,  currents  are  carried  by  electrons,  in  liquids 
by  ions.  Gases  are  normally  poor  conductors  unless  their  atoms  are  subjected  to  such  intense 
electrical  forces  that  the  atoms  become  ionized. 

Many  substances,  including  all  solids,  respond  to  electrical  forces  in  such  a  way  that  the 
flow  of  current  between  two  points  is  proportional  to  the  voltage  difference  between  those  points 
(assuming  the  voltage  difference  is  small).  Such  a  substance  is  called  ohmic,  and  an  object  made 
out  of  an  ohmic  substance  can  be  rated  in  terms  of  its  resistance, 


An  important  corollary  is  that  a  perfect  conductor,  with  R  =  0,  must  have  constant  voltage 
everywhere  within  it. 

A  schematic  is  a  drawing  of  a  circuit  that  standardizes  and  stylizes  its  features  to  make  it 
easier  to  understand.  Any  circuit  can  be  broken  down  into  smaller  parts.  For  instance,  one  big 
circuit  may  be  understood  as  two  small  circuits  in  series,  another  as  three  circuits  in  parallel. 
When  circuit  elements  are  combined  in  parallel  and  in  series,  we  have  two  basic  rules  to  guide 
us  in  understanding  how  the  parts  function  as  a  whole: 

The  junction  rule:  In  any  circuit  that  is  not  storing  or  releasing  charge,  conservation  of 
charge  implies  that  the  total  current  flowing  out  of  any  junction  must  be  the  same  as  the 
total  flowing  in. 

The  loop  rule:  Assuming  the  standard  convention  for  plus  and  minus  signs,  the  sum  of 
the  voltage  drops  around  any  closed  loop  in  a  circuit  must  be  zero. 

The  simplest  application  of  these  rules  is  to  pairs  of  resistors  combined  in  series  or  parallel. 
In  such  cases,  the  pair  of  resistors  acts  just  like  a  single  unit  with  a  certain  resistance  value,  called 
their  equivalent  resistance.  Resistances  in  series  add  to  produce  a  larger  equivalent  resistance, 

R  =  R\  +  i?2; 

because  the  current  has  to  fight  its  way  through  both  resistances.  Parallel  resistors  combine  to 
produce  an  equivalent  resistance  that  is  smaller  than  either  individual  resistance, 

1  _  i  1 

R  ~  Rl  +  ~R2' 

because  the  current  has  two  different  paths  open  to  it. 

An  important  example  of  resistances  in  parallel  and  series  is  the  use  of  voltmeters  and 
ammeters  in  resistive  circuits.  A  voltmeter  acts  as  a  large  resistance  in  parallel  with  the  resistor 
across  which  the  voltage  drop  is  being  measured.  The  fact  that  its  resistance  is  not  infinite 
means  that  it  alters  the  circuit  it  is  being  used  to  investigate,  producing  a  lower  equivalent 
resistance.  An  ammeter  acts  as  a  small  resistance  in  series  with  the  circuit  through  which  the 
current  is  to  be  determined.  Its  resistance  is  not  quite  zero,  which  leads  to  an  increase  in  the 
resistance  of  the  circuit  being  tested. 

Chapter  10,  Fields,  page  563 

Newton  conceived  of  a  universe  where  forces  reached  across  space  instantaneously,  but  we  now 
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know  that  there  is  a  delay  in  time  before  a  change  in  the  configuration  of  mass  and  charge  in 
one  corner  of  the  universe  will  make  itself  felt  as  a  change  in  the  forces  experienced  far  away. 
We  imagine  the  outward  spread  of  such  a  change  as  a  ripple  in  an  invisible  universe-filling  field 
of  force. 

As  an  alternative  to  our  earlier  energy-based  definition,  we  can  define  the  gravitational  field 
at  a  given  point  as  the  force  per  unit  mass  exerted  on  objects  inserted  at  that  point,  and  likewise 
the  electric  field  is  defined  as  the  force  per  unit  charge.  These  fields  are  vectors,  and  the  fields 
generated  by  multiple  sources  add  according  to  the  rules  of  vector  addition. 

The  relationship  between  the  electric  field  and  the  voltage  is 

w__ 

rv  - 

OX 

By  ~  ' 


which  can  be  notated  more  compactly  as  a  gradient, 

E  =  -W. 


Fields  of  force  contain  energy,  and  the  density  of  energy  is  proportional  to  the  square  of  the 
magnitude  of  the  field, 

d Uq  = - —  g2  dv 

9  8tt  Gy 

d  Ue  =  ^—E2dv 
8irk 

dUm  oc  B 2  dv 


The  equation  for  the  energy  stored  in  the  magnetic  field  is  given  explicitly  in  the  next  chapter; 
for  now,  we  only  need  the  fact  that  it  behaves  in  the  same  general  way  as  the  first  two  equations: 
the  energy  density  is  proportional  to  the  square  of  the  field.  In  the  case  of  static  electric  fields, 
we  can  calculate  potential  energy  either  using  the  previous  definition  in  terms  of  mechanical 
work  or  by  calculating  the  energy  stored  in  the  fields.  If  the  fields  are  not  static,  the  old  method 
gives  incorrect  results  and  the  new  one  must  be  used. 

Capacitance,  C,  and  inductance,  L,  are  defined  as 


Uc  =  2Cq 


and 


measured  in  units  of  farads  and  henries,  respectively.  The  voltage  across  a  capacitor  or  inductor 
is  given  by 


Ec  = 


fi_ 

C 
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In  the  equation  for  the  inductor,  the  direction  of  the  voltage  drop  (plus  or  minus  sign)  is  such 
that  the  inductor  resists  the  change  in  current.  Although  the  equation  for  the  voltage  across 
an  inductor  follows  directly  from  fundamental  arguments  concerning  the  energy  stored  in  the 
magnetic  field,  the  result  is  a  surprise:  the  voltage  drop  implies  the  existence  of  electric  fields 
which  are  not  created  by  charges.  This  is  an  induced  electric  field,  discussed  in  more  detail  in 
the  next  chapter. 

A  series  LRC  circuit  exhibits  oscillation,  and,  if  driven  by  an  external  voltage,  resonates. 
The  Q  of  the  circuit  relates  to  the  resistance  value.  For  large  Q,  the  resonant  frequency  is 

A  series  RC  or  RL  circuit  exhibits  exponential  decay, 

‘,  =  ,foexp(-i^ 


or 


/  =  I0  exp 


and  the  quantity  RC  or  L/R  is  known  as  the  time  constant. 

When  driven  by  a  sinusoidal  AC  voltage  with  amplitude  V,  a  capacitor,  resistor,  or  inductor 
responds  with  a  current  having  amplitude 


where  the  impedance,  Z,  is  a  frequency-dependent  quantity  having  units  of  ohms.  In  a  capacitor, 
the  current  has  a  phase  that  is  90°  ahead  of  the  voltage,  while  in  an  inductor  the  current  is 
90°  behind.  We  can  represent  these  phase  relationships  by  defining  the  impedances  as  complex 
numbers: 


Zr  =  R 
Zl  =  iooL 


The  arguments  of  the  complex  impedances  are  to  be  interpreted  as  phase  relationships  between 
the  oscillating  voltages  and  currents.  The  complex  impedances  defined  in  this  way  combine  in 
series  and  parallel  according  to  the  same  rules  as  resistances. 

When  a  voltage  source  is  driving  a  load  through  a  transmission  line,  the  maximum  power  is 
delivered  to  the  load  when  the  impedances  of  the  line  and  the  load  are  matched. 

Gauss  ’  law  states  that  for  any  region  of  space,  the  flux  through  the  surface, 

*  =  E 
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is  related  by 


$  =  4-7T  kqin 

to  the  charge  enclosed  within  the  surface. 

Chapter  11,  Electromagnetism,  page  653 

Relativity  implies  that  there  must  be  an  interaction  between  moving  charges  and  other  moving 
charges.  This  magnetic  interaction  is  in  addition  to  the  usual  electrical  one.  The  magnetic  field 
can  be  defined  in  terms  of  the  magnetic  force  exerted  on  a  test  charge, 

F  =  qw  x  B, 

or,  alternatively,  in  terms  of  the  torque  on  a  magnetic  test  dipole, 


_  I  |  •  Q1 

|  mj  |  sin  9 

where  6  is  the  angle  between  the  dipole  vector  and  the  field.  The  magnetic  dipole  moment  m  of 
a  loop  of  current  has  magnitude  m  =  I  A,  and  is  in  the  (right-handed)  direction  perpendicular 
to  the  loop. 

The  magnetic  held  has  no  sources  or  sinks.  Gauss’  law  for  magnetism  is 

&B  =  0. 


The  external  magnetic  held  of  a  long,  straight  wire  is 

B=2kI 


c2R' 


forming  a  right-handed  circular  pattern  around  the  wire. 
The  energy  of  the  magnetic  held  is 


dUm  =  - — -B2  dt;. 

8irk 

The  magnetic  held  resulting  from  a  set  of  currents  can  be  computed  by  finding  a  set  of 
dipoles  that  combine  to  give  those  currents.  The  held  of  a  dipole  is 


km, 


Bz  =  — r  (3  cos2  6  —  l) 


km, 


3 


Br  =  (3  sin  0  cos  9)  r  3, 


which  reduces  to  Bz  =  km/c2r 3  in  the  plane  perpendicular  to  the  dipole  moment.  By  construct¬ 
ing  a  current  loop  out  of  dipoles,  one  can  prove  the  Biot.-Savart  law, 

kl  df  xr 

dB  =  - o  3 - ’ 

which  gives  the  held  when  we  integrate  over  a  closed  current  loop.  All  of  this  is  valid  only  for 
static  magnetic  helds. 
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Ampere’s  law  is  another  way  of  relating  static  magnetic  fields  to  the  static  currents  that 
created  them,  and  it  is  more  easily  extended  to  nonstatic  fields  than  is  the  Biot-Savart  law. 
Ampere’s  law  states  that  the  circulation  of  the  magnetic  field, 

Fs  =  ’ 

around  the  edge  of  a  surface  is  related  to  the  current  Ithrough  passing  through  the  surface, 

47i-/c 

—  ^2  through' 

In  the  general  nonstatic  case,  the  fundamental  laws  of  physics  governing  electric  and  magnetic 
fields  are  Maxwell’s  equations ,  which  state  that  for  any  closed  surface,  the  fluxes  through  the 
surface  are 


&E  =  47t  kqin  and 

=  0. 


For  any  surface  that  is  not  closed,  the  circulations  around  the  edges  of  the  surface  are  given  by 

d$B 


Te  =  - 


dt 


and 


2  _  d$E  A  UT 

C  1-  B  Q  T  Mtklthrough- 


The  most  important  result  of  Maxwell’s  equations  is  the  existence  of  electromagnetic  waves 
which  propagate  at  the  velocity  of  light  —  that’s  what  light  is.  The  waves  are  transverse,  and 
the  electric  and  magnetic  fields  are  perpendicular  to  each  other.  There  are  no  purely  electric  or 
purely  magnetic  waves;  their  amplitudes  are  always  related  to  one  another  by  B  =  E/c.  They 
propagate  in  the  right-handed  direction  given  by  the  cross  product  E  x  B,  and  carry  momentum 
p  =  U/c. 

A  complete  statement  of  Maxwell’s  equations  in  the  presence  of  electric  and  magnetic  ma¬ 
terials  is  as  follows: 


=  Qiree 
=  0 

d$B 


rE  =  — 


d  t 


TH  =  ^  +  4ee, 

df 


where  the  auxiliary  fields  D  and  H  are  defined  as 


D  =  eE  and 

h  =  5, 

h 

and  e  and  p  are  the  permittivity  and  permeability  of  the  substance. 

Chapter  12,  Optics,  page  741 

The  ray  model  of  light:  We  can  understand  many  phenomena  involving  light  without  having 
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to  use  sophisticated  models  such  as  the  wave  model  or  the  particle  model.  Instead,  we  simply 
describe  light  according  to  the  path  it  takes,  which  we  call  a  ray.  The  ray  model  of  light  is  useful 
when  light  is  interacting  with  material  objects  that  are  much  larger  than  a  wavelength  of  light. 
Since  a  wavelength  of  visible  light  is  so  short  compared  to  the  human  scale  of  existence,  the  ray 
model  is  useful  in  many  practical  cases. 

A  smooth  surface  produces  specular  reflection,  in  which  the  reflected  ray  exits  at  the  same 
angle  with  respect  to  the  normal  as  that  of  the  incoming  ray.  A  rough  surface  gives  diffuse 
reflection,  where  a  single  ray  of  light  is  divided  up  into  many  weaker  reflected  rays  going  in 
many  directions. 

Images:  A  large  class  of  optical  devices,  including  lenses  and  flat  and  curved  mirrors,  operates 
by  bending  light  rays  to  form  an  image.  A  real  image  is  one  for  which  the  rays  actually  cross  at 
each  point  of  the  image.  A  virtual  image,  such  as  the  one  formed  behind  a  flat  mirror,  is  one 
for  which  the  rays  only  appear  to  have  crossed  at  a  point  on  the  image.  A  real  image  can  be 
projected  onto  a  screen;  a  virtual  one  cannot. 

Mirrors  and  lenses  will  generally  make  an  image  that  is  either  smaller  than  or  larger  than  the 
original  object.  The  scaling  factor  is  called  the  magnification.  In  many  situations,  the  angular 
magnification  is  more  important  than  the  actual  magnification. 

Every  lens  or  mirror  has  a  property  called  the  focal  length,  which  is  defined  as  the  distance 
from  the  lens  or  mirror  to  the  image  it  forms  of  an  object  that  is  infinitely  far  away.  A  stronger 
lens  or  mirror  has  a  shorter  focal  length. 

Locating  images:  The  relationship  between  the  locations  of  an  object  and  its  image  formed 
by  a  lens  or  mirror  can  always  be  expressed  by  equations  of  the  form 

Of  =  ±  do 

1  _  i  i 

f  di  dQ 

The  choice  of  plus  and  minus  signs  depends  on  whether  we  are  dealing  with  a  lens  or  a  mirror, 
whether  the  lens  or  mirror  is  converging  or  diverging,  and  whether  the  image  is  real  or  virtual. 
A  method  for  determining  the  plus  and  minus  signs  is  as  follows: 

1.  Use  ray  diagrams  to  decide  whether  di  and  d0  vary  in  the  same  way  or  in  opposite  ways. 
Based  on  this,  decide  whether  the  two  signs  in  the  equation  are  the  same  or  opposite.  If 
the  signs  are  opposite,  go  on  to  step  2  to  determine  which  is  positive  and  which  is  negative. 

2.  If  the  signs  are  opposite,  we  need  to  decide  which  is  the  positive  one  and  which  is  the 
negative.  Since  the  focal  angle  is  never  negative,  the  smaller  angle  must  be  the  one  with 
a  minus  sign. 


Once  the  correct  form  of  the  equation  has  been  determined,  the  magnification  can  be  found 
via  the  equation 


This  equation  expresses  the  idea  that  the  entire  image-world  is  shrunk  consistently  in  all  three 
dimensions. 
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Refraction:  Refraction  is  a  change  in  direction  that  occurs  when  a  wave  encounters  the 
interface  between  two  media.  Together,  refraction  and  reflection  account  for  the  basic  principles 
behind  nearly  all  optical  devices. 

Snell  discovered  the  equation  for  refraction, 

n\  sin  6\  =  n2  sin  62. 

[angles  measured  with  respect  to  the  normal] 

through  experiments  with  light  rays,  long  before  light  was  proven  to  be  a  wave.  Snell’s  law  can 
be  proven  based  on  the  geometrical  behavior  of  waves.  Here  n  is  the  index  of  refraction.  Snell 
invented  this  quantity  to  describe  the  refractive  properties  of  various  substances,  but  it  was  later 
found  to  be  related  to  the  speed  of  light  in  the  substance, 

c 

n  =  -, 

v 

where  c  is  the  speed  of  light  in  a  vacuum.  In  general  a  material’s  index  of  refraction  is  different 
for  different  wavelengths  of  light.  Total  internal  reflection  occurs  when  there  is  no  angle  that 
satisfies  Snell’s  law. 

Wave  optics:  Wave  optics  is  a  more  general  theory  of  light  than  ray  optics.  When  light 
interacts  with  material  objects  that  are  much  larger  then  one  wavelength  of  the  light,  the  ray 
model  of  light  is  approximately  correct,  but  in  other  cases  the  wave  model  is  required. 

Huygens’  principle  states  that,  given  a  wavefront  at  one  moment  in  time,  the  future  behavior 
of  the  wave  can  be  found  by  breaking  the  wavefront  up  into  a  large  number  of  small,  side-by-side 
wave  peaks,  each  of  which  then  creates  a  pattern  of  circular  or  spherical  ripples.  As  these  sets 
of  ripples  add  together,  the  wave  evolves  and  moves  through  space.  Since  Huygens’  principle  is 
a  purely  geometrical  construction,  diffraction  effects  obey  a  simple  scaling  rule:  the  behavior  is 
unchanged  if  the  wavelength  and  the  dimensions  of  the  diffracting  objects  are  both  scaled  up 
or  down  by  the  same  factor.  If  we  wish  to  predict  the  angles  at  which  various  features  of  the 
diffraction  pattern  radiate  out,  scaling  requires  that  these  angles  depend  only  on  the  unitless 
ratio  A/d,  where  d  is  the  size  of  some  feature  of  the  diffracting  object. 

Double-slit  diffraction  is  easily  analyzed  using  Huygens’  principle  if  the  slits  are  narrower 
than  one  wavelength.  We  need  only  construct  two  sets  of  ripples,  one  spreading  out  from  each 
slit.  The  angles  of  the  maxima  (brightest  points  in  the  bright  fringes)  and  minima  (darkest 
points  in  the  dark  fringes)  are  given  by  the  equation 

A  sin  6 
d  m  ’ 

where  d  is  the  center-to-center  spacing  of  the  slits,  and  m  is  an  integer  at  a  maximum  or  an 
integer  plus  1/2  at  a  minimum. 

If  some  feature  of  a  diffracting  object  is  repeated,  the  diffraction  fringes  remain  in  the  same 
places,  but  become  narrower  with  each  repetition.  By  repeating  a  double-slit  pattern  hundreds 
or  thousands  of  times,  we  obtain  a  diffraction  grating. 

A  single  slit  can  produce  diffraction  fringes  if  it  is  larger  than  one  wavelength.  Many  practical 
instances  of  diffraction  can  be  interpreted  as  single-slit  diffraction,  e.g.,  diffraction  in  telescopes. 
The  main  thing  to  realize  about  single-slit  diffraction  is  that  it  exhibits  the  same  kind  of  rela¬ 
tionship  between  A,  d ,  and  angles  of  fringes  as  in  any  other  type  of  diffraction. 
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Chapter  13,  Quantum  Physics,  page  829 

Quantum  physics  differs  from  classical  physics  in  many  ways,  the  most  dramatic  of  which  is 
that  certain  processes  at  the  atomic  level,  such  as  radioactive  decay,  are  random  rather  than 
deterministic.  There  is  a  method  to  the  madness,  however:  quantum  physics  still  rules  out  any 
process  that  violates  conservation  laws,  and  it  also  offers  methods  for  calculating  probabilities 
numerically.  The  most  important  of  these  generic  methods  is  the  law  of  independent  probabil¬ 
ities,  which  states  that  if  two  random  events  are  not  related  in  any  way,  then  the  probability 
that  they  will  both  occur  equals  the  product  of  the  two  probabilities, 

probability  of  A  and  B 

=  PaPb  [if  A  and  B  are  independent]. 

When  discussing  a  random  variable  x  that  can  take  on  a  continuous  range  of  values,  we  cannot 
assign  any  finite  probability  to  any  particular  value.  Instead,  we  define  the  probability  dis¬ 
tribution  D(x),  defined  so  that  its  integral  over  some  range  of  x  gives  the  probability  of  that 
range. 

In  radioactive  decay,  the  time  that  a  radioactive  atom  has  a  50%  chance  of  surviving  is  called 
the  half-life,  t\/ 2.  The  probability  of  surviving  for  two  half-lives  is  (1/2)  (1/2)  =  1/4,  and  so  on. 
In  general,  the  probability  of  surviving  a  time  t  is  given  by 

Psurv(t)  =  0.5^i/2. 

Related  quantities  such  as  the  rate  of  decay  and  probability  distribution  for  the  time  of  decay 
are  given  by  the  same  type  of  exponential  function,  but  multiplied  by  certain  constant  factors. 

Around  the  turn  of  the  twentieth  century,  experiments  began  to  show  problems  with  the 
classical  wave  theory  of  light.  In  any  experiment  sensitive  enough  to  detect  very  small  amounts 
of  light  energy,  it  becomes  clear  that  light  energy  cannot  be  divided  into  chunks  smaller  than  a 
certain  amount.  Measurements  involving  the  photoelectric  effect  demonstrate  that  this  smallest 
unit  of  light  energy  equals  hf ,  where  /  is  the  frequency  of  the  light  and  h  is  a  number  known 
as  Planck’s  constant.  We  say  that  light  energy  is  quantized  in  units  of  hf,  and  we  interpret  this 
quantization  as  evidence  that  light  has  particle  properties  as  well  as  wave  properties.  Particles 
of  light  are  called  photons. 

The  only  method  of  reconciling  the  wave  and  particle  natures  of  light  that  has  stood  the  test 
of  experiment  is  the  probability  interpretation:  the  probability  that  the  particle  is  at  a  given 
location  is  proportional  to  the  square  of  the  amplitude  of  the  wave  at  that  location. 

One  important  consequence  of  wave-particle  duality  is  that  we  must  abandon  the  concept 
of  the  path  the  particle  takes  through  space.  To  hold  on  to  this  concept,  we  would  have 
to  contradict  the  well  established  wave  nature  of  light,  since  a  wave  can  spread  out  in  every 
direction  simultaneously. 

Light  is  both  a  particle  and  a  wave.  Matter  is  both  a  particle  and  a  wave.  The  equations 
that  connect  the  particle  and  wave  properties  are  the  same  in  all  cases: 

E  =  hf 
p  =  h/X 

Unlike  the  electric  and  magnetic  fields  that  make  up  a  photon-wave,  the  electron  wavefunction 
is  not  directly  measurable.  Only  the  square  of  the  wavefunction,  which  relates  to  probability, 
has  direct  physical  significance. 
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A  particle  that  is  bound  within  a  certain  region  of  space  is  a  standing  wave  in  terms  of 
quantum  physics.  The  two  equations  above  can  then  be  applied  to  the  standing  wave  to  yield 
some  important  general  observations  about  bound  particles: 

1.  The  particle’s  energy  is  quantized  (can  only  have  certain  values). 

2.  The  particle  has  a  minimum  energy. 

3.  The  smaller  the  space  in  which  the  particle  is  confined,  the  higher  its  kinetic  energy  must 
be. 

These  immediately  resolve  the  difficulties  that  classical  physics  had  encountered  in  explaining 
observations  such  as  the  discrete  spectra  of  atoms,  the  fact  that  atoms  don’t  collapse  by  radiating 
away  their  energy,  and  the  formation  of  chemical  bonds. 

A  standing  wave  confined  to  a  small  space  must  have  a  short  wavelength,  which  corresponds 
to  a  large  momentum  in  quantum  physics.  Since  a  standing  wave  consists  of  a  superposition 
of  two  traveling  waves  moving  in  opposite  directions,  this  large  momentum  should  actually  be 
interpreted  as  an  equal  mixture  of  two  possible  momenta:  a  large  momentum  to  the  left,  or  a 
large  momentum  to  the  right.  Thus  it  is  not  possible  for  a  quantum  wave-particle  to  be  confined 
to  a  small  space  without  making  its  momentum  very  uncertain.  In  general,  the  Heisenberg 
uncertainty  principle  states  that  it  is  not  possible  to  know  the  position  and  momentum  of  a 
particle  simultaneously  with  perfect  accuracy.  The  uncertainties  in  these  two  quantities  must 
satisfy  the  approximate  inequality 

ApAx  >  h. 


When  an  electron  is  subjected  to  electric  forces,  its  wavelength  cannot  be  constant.  The 
“wavelength”  to  be  used  in  the  equation  p  =  h/X  should  be  thought  of  as  the  wavelength  of  the 
sine  wave  that  most  closely  approximates  the  curvature  of  the  wavefunction  at  a  specific  point. 

Infinite  curvature  is  not  physically  possible,  so  realistic  wavefunctions  cannot  have  kinks  in 
them,  and  cannot  just  cut  off  abruptly  at  the  edge  of  a  region  where  the  particle’s  energy  would 
be  insufficient  to  penetrate  according  to  classical  physics.  Instead,  the  wavefunction  “tails  off” 
in  the  classically  forbidden  region,  and  as  a  consequence  it  is  possible  for  particles  to  “tunnel” 
through  regions  where  according  to  classical  physics  they  should  not  be  able  to  penetrate.  If 
this  quantum  tunneling  effect  did  not  exist,  there  would  be  no  fusion  reactions  to  power  our 
sun,  because  the  energies  of  the  nuclei  would  be  insufficient  to  overcome  the  electrical  repulsion 
between  them. 

Hydrogen,  with  one  proton  and  one  electron,  is  the  simplest  atom,  and  more  complex  atoms 
can  often  be  analyzed  to  a  reasonably  good  approximation  by  assuming  their  electrons  occupy 
states  that  have  the  same  structure  as  the  hydrogen  atom’s.  The  electron  in  a  hydrogen  atom 
exchanges  very  little  energy  or  angular  momentum  with  the  proton,  so  its  energy  and  angular 
momentum  are  nearly  constant,  and  can  be  used  to  classify  its  states.  The  energy  of  a  hydrogen 
state  depends  only  on  its  n  quantum  number. 

In  quantum  physics,  the  angular  momentum  of  a  particle  moving  in  a  plane  is  quantized  in 
units  of  h.  Atoms  are  three-dimensional,  however,  so  the  question  naturally  arises  of  how  to 
deal  with  angular  momentum  in  three  dimensions.  In  three  dimensions,  angular  momentum  is 
a  vector  in  the  direction  perpendicular  to  the  plane  of  motion,  such  that  the  motion  appears 
clockwise  if  viewed  along  the  direction  of  the  vector.  Since  angular  momentum  depends  on 
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both  position  and  momentum,  the  Heisenberg  uncertainty  principle  limits  the  accuracy  with 
which  one  can  know  it.  The  most  that  can  be  known  about  an  angular  momentum  vector  is  its 
magnitude  and  one  of  its  three  vector  components,  both  of  which  are  quantized  in  units  of  K. 

In  addition  to  the  angular  momentum  that  an  electron  carries  by  virtue  of  its  motion  through 
space,  it  possesses  an  intrinsic  angular  momentum  with  a  magnitude  of  K/2.  Protons  and 
neutrons  also  have  spins  of  K/ 2,  while  the  photon  has  a  spin  equal  to  h. 

Particles  with  half-integer  spin  obey  the  Pauli  exclusion  principle:  only  one  such  particle 
can  exist  is  a  given  state,  i.e.,  with  a  given  combination  of  quantum  numbers. 

We  can  enumerate  the  lowest-energy  states  of  hydrogen  as  follows: 

n  =  1,  t  =  0,  lz  =  0,  sz  =  +1/2  or  —1/2  two  states 

n  =  2,  i  =  0,  tz  =  0,  sz  =  +1/2  or  —1/2  two  states 

n  =  2,  1=1,  tz  =  —1,  0,  or  1,  sz  =  +1/2  or  —1/2  six  states 

The  periodic  table  can  be  understood  in  terms  of  the  filling  of  these  states.  The  nonreactive 
noble  gases  are  those  atoms  in  which  the  electrons  are  exactly  sufficient  to  fill  all  the  states  up  to 
a  given  n  value.  The  most  reactive  elements  are  those  with  one  more  electron  than  a  noble  gas 
element,  which  can  release  a  great  deal  of  energy  by  giving  away  their  high-energy  electron,  and 
those  with  one  electron  fewer  than  a  noble  gas,  which  release  energy  by  accepting  an  electron. 
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pair  production,  425 
gamma  rays,  16 
garage  paradox,  397 
gas 

spectrum  of,  868 
gas  discharge  tube,  604 
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gauss  (unit),  955 
Gauss’  law,  631 

differential  form,  635 
Gauss’  theorem,  625 
for  gravity,  631 
proof  of,  630 
Gaussian  pillbox,  632 
general  relativity,  429 
generator,  603,  693 
geothermal  vents,  829 
Gerrner,  L.,  860 
GFI,  667 

global  warming,  507 
goiters,  837 
gradient,  216 

graphical  addition  of  vectors,  199 
gravitational  field,  82,  565,  566 
energy  density  of,  592 
gravitational  time  dilation,  433 
gravitational  waves,  568 
Gravity  Probe  B,  430 
ground  fault  interrupter,  667 
group  velocity,  866 

half-life,  837 
Halley’s  comet,  133 
handedness,  665 
harmonics,  376 
Hawking  radiation,  318 
Hawking  singularity  theorem,  440 
Hawking,  Stephen,  440 
heat,  74 

compared  to  temperature,  74 
compared  to  thermal  energy,  75 
heat  capacity 

at  constant  pressure,  334 
at  constant  volume,  334 
heat  engine,  299 
Heisenberg 

Werner,  870 

Heisenberg  uncertainty  principle,  870 
in  three  dimensions,  890 
helium,  903 

Helmholtz  resonator,  336 
Hertz 

Heinrich,  709 
Hertz,  Heinrich,  847 
Heinrich,  794 


Hiroshima,  505 

homogeneity  of  spacetime,  391 
Hooke,  460 
Hooke’s  law,  169 
hormesis,  506 
Hubble,  Edwin,  360 
Hugo,  Victor,  459 
Huygens’  principle,  793 
hydrogen  atom,  893 

angular  momentum  in,  889 
classification  of  states,  888 
energies  of  states  in,  895 
energy  in,  889 
momentum  in,  889 
quantum  numbers,  893 
hysteresis,  721 

ideal  gas  law,  310 
images 

formed  by  curved  mirrors,  759 
formed  by  plane  mirrors,  756 
location  of,  766 
of  images,  761 
real,  760 
virtual,  756 
impedance,  612 

of  an  inductor,  614 
impedance  matching,  618,  718 
incoherent  light,  791 
independence 

statistical,  831,  832 
independent  probabilities 
law  of,  832 
index  of  refraction 
defined,  780 

related  to  speed  of  light,  781 
inductance 

defined,  596 
induction,  603 
inductor,  594 

inductance,  594 
inertial  frame  of  reference,  63 
information  paradox,  437 
inner  product,  411 

in  quantum  mechanics,  922 
insulator 

defined,  526 
invariance 
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rotational,  190 
inverted  reflection,  366 
Io,  744 
iodine,  837 
ion  drive,  132 
isotopes,  494 

Ives-Stilwell  experiment,  414 
Jeans 

James,  830 
joule  (unit),  74 
Joule,  James,  73 

paddlewheel  experiment,  76 
junction  rule,  539 
Jupiter,  744 

kelvin  (unit),  306 
Kelvin  scale,  75 
Kepler’s  laws,  96 
Keynes,  John  Maynard,  460 
kilo-  (metric  prefix),  24 
kilogram,  26,  56 
standard,  57 
kinetic  energy,  76 

compared  to  momentum,  134 
kinetic  energy  theorem,  165 
kinetic  friction,  152 
coefficient  of,  153 

Lagrange,  Joseph-Louis,  55 
Laplace,  16 

Laplace,  Pierre  Simon  de,  829 
Laplacian,  880 
Lavoisier,  Pierre- Andre 

conservation  of  mass,  59 
execution,  55 

least  time,  principle  of,  754,  787,  802 
Leibniz,  94 
lens,  784 

lensmaker’s  equation,  786 
light,  16 

absorption  of,  745 
angular  momentum  of,  712 
brightness  of,  747 
defined,  468 
Doppler  shift  for,  412 
electromagnetic  wave,  703 
momentum  of,  133,  434,  446,  709 
particle  model  of,  747 


ray  model  of,  747 
speed  of,  743 
wave  model  of,  747 
waves,  354 
light  cone,  406 
lightlike,  406 
LIGO,  569 
line  integral,  216 
linear  no-threshold,  506 
Lipkin  linkage,  817 
LNT,  506 
loop  rule,  544 
Lorentz  invariance,  409 
Lorentz  transformation,  392 
Lorentz,  Hendrik,  392 
LRC  circuit,  621 
lumped-circuit  approximation 
for  capacitors,  584 

magnetic  dipole,  658 
field  of,  674 
magnetic  field 

defined,  658,  659 
energy  density  of,  593,  671 
magnetic  monopoles,  663 
magnification 
angular,  762 

by  a  converging  mirror,  759 
mass 

conservation  of,  56,  466 
equivalence  to  energy,  419 
gravitational,  57 
inertial,  57 
quantization  of,  69 
mass-energy 

conservation  of,  421 
correspondence  principle,  423 
of  a  moving  particle,  422 
matter,  16 

as  a  wave,  859 
defined,  468 

Maxwell’s  equations,  701 
for  static  fields,  682 
in  cgs  units,  955 
in  differential  form,  954 
Maxwell,  James  Clerk,  794 
measurement  in  quantum  physics,  872 
mechanical  system,  130 
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median,  837 

mega-  (metric  prefix),  24 
Mendeleev,  Dmitri,  469 
meter  (metric  unit),  25 
meter  (unit),  56 
metric  system,  24,  56 
prefixes,  24,  986 

Michelson-Morley  experiment,  400 

micro-  (metric  prefix),  24 

microwaves,  16 

milli-  (metric  prefix),  24 

Millikan,  Robert,  471 

Millikan,  Robert,  849 

mirror 

converging,  766 
inks  units,  26 
molecules 

nonexistence  in  classical  physics,  859 
mollusc,  778 
moment 

dipole,  571 

moment  of  inertia,  268 

tabulated  for  various  shapes,  275 
momentum,  130,  131 

compared  to  kinetic  energy,  134 
nonmechanical,  133 
of  light,  133,  434,  446,  709 
relativistic,  415,  423 
monopoles 

magnetic,  663 
MRI  scan,  661 

naked  singularity,  440 
nano-  (metric  prefix),  24 
nautilus,  777 
neutral  (electrically),  463 
neutral  equilibrium,  86 
neutron 

discovery  of,  138 
spin  of,  901 
Newton 

Isaac,  829 
newton  (unit),  145 
Newton,  Isaac,  24,  94 
alchemy,  459 
definition  of  time,  27 
Newtonian  telescope,  761 
particle  theory  of  light,  793 


Nichols-Hull  experiment  on  momentum  of  light, 
711 

normal  force,  152 
normalization,  833 
nuclear  forces,  495,  666 
nuclear  reactions,  68 
nucleus 

discovery,  485 

Ohm’s  law,  526 
ohmic 

defined,  526 
op-amp,  600 
open  circuit,  520 

operational  amplifier  (op-amp),  600 

operational  definitions,  25 

orbit 

circular,  98 

order-of-magnitude  estimates,  44 
oscillations,  113 
damped,  172 
overdamped 
electrical,  605 
mechanical,  185 
steady  state,  176 
Otto  cycle,  332 
Otto,  Nikolaus,  332 
overdamped  oscillations 
electrical,  605 
mechanical,  185 
ozone  layer,  844 

paddlewheel  experiment,  76 
pair  production,  425 
parallel  axis  theorem,  270,  295 
parallel  circuit 
defined,  533 
paramagnetism,  719 
parity 

operator  in  quantum  mechanics,  921 
Parmenides,  55 
partial  derivative,  216,  635 
particle 

definition  of,  852 
particle  in  a  box,  867 
particle  model  of  light,  747,  794 
pascal 

unit,  301 

pascal  (unit),  204 
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path  of  a  photon  undefined,  853 
Pauli  exclusion  principle,  18,  904 
Peaucellier  linkage,  817 
Pelton  waterwheel,  189 
Penrose  singularity  theorem,  439 
Penrose,  Roger,  439 
period 

of  waves,  355 

periodic  table,  469,  487,  904 
permeability,  716 
permittivity,  715 
perpetual  motion  machine,  73 
phase  velocity,  866 
photoelectric  effect,  847 
photon 

Einstein’s  early  theory,  846 
energy  of,  849 
in  three  dimensions,  857 
spin  of,  901 
physics,  16 
pillbox 

Gaussian,  632 
pilot  wave  hypothesis,  854 
Planck’s  constant,  849 
Planck,  Max,  849 
polarization,  705 
Pope,  37 
Porro  prism,  812 
positron,  421,  499 
positron  decay,  498 
Pound- Rebka  experiment,  434 
power,  80 

electrical,  522 
Poynting  vector,  732 
praxinoscope,  757 
pressure,  300 

as  a  function  of  depth,  204 
defined,  204 
prism 

Porro,  812 
probabilities 

addition  of,  833 
normalization  of,  833 
probability  distributions 
averages  of,  836 
widths  of,  836 

probability  distributions,  835 
probability  interpretation,  854 


protein  molecules,  888 
proton 

spin  of,  901 
Pythagoras,  742 

quality  factor,  176 
quantization,  471 
of  mass,  69 
quantum  dot,  867 
quantum  moat,  889 
quantum  numbers,  894 
quantum  physics,  830 
quark,  941 

radar,  844 

radiation  hormesis,  506 

radio,  844 

radio  waves,  16 

raisin  cookie  model,  478 

randomness,  830 

ray  diagrams,  749 

ray  model  of  light,  747,  794 

RC  circuit,  606 

RC  time  constant,  606 

reactions 

chemical,  68 
nuclear,  68 
reductionism,  19 
reflection 

diffuse,  746 
of  waves,  364 
specular,  750 
reflections 

inverted  and  uninverted,  366 
refraction 

and  color,  783 
defined,  778 
relativity 

Galilean,  62,  191 
general,  429 
Renaissance,  13 

repetition  of  diffracting  objects,  798 
resistance 

defined,  526 
in  parallel,  539 
in  series,  543 
resistivity 

defined,  545 
resistor,  531 
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resistors 

in  parallel,  540 
retina,  761 
reversibility,  752 
RHIC  accelerator,  397 
RL  circuit,  606 
RMS  (root  mean  square),  617 
Roerner,  744 
root  mean  square,  617 
rotational  invariance,  190 
Russell 

Bertrand,  830 
Rutherford 
Ernest,  829 

scalar 

defined,  193 
scaling,  36 
schematic,  538 
schematics,  538 
Schrodinger 
Erwin,  872 

Schrodinger  equation,  875 
time-dependent,  913 
Schrodinger ’s  cat,  872 
scientific  method,  14 
sea-of-arrows  representation,  567 
second  (unit),  25,  56 
series  circuit 
defined,  533 
shell  theorem,  102 
short  circuit 
defined,  531 
SI  units,  26,  78 
Sievert  (unit),  504 
sigma  notation,  141 
significant  figures,  31 
simple  machine,  168 
single-slit 

diffraction,  799 
singularity 

Big  Bang,  439 
black  hole,  438 
naked,  440 
singularity  theorem 
Hawking,  440 
Penrose,  439 
sinks  in  fields,  567 


Sirius,  868 
skin  depth,  714 
Snell’s  law,  779 

derivation  of,  782 
mechanical  model  of,  781 
sodium,  904 
solar  constant,  733 
solar  sail,  204 
solenoid,  595 

magnetic  field  of,  680 
sound 

speed  of,  379 
waves,  353 
sources  of  fields,  567 
spacelike,  406 
spacetime 

curvature  of,  431 
spark  plug,  607 
specific  heat,  74 
spectrum 

absorption,  868 
emission,  868 
spherical  harmonics,  892 
spin,  901 

neutron’s,  901 
of  electron,  901 
photon’s,  901 
proton’s,  901 
spin  theorem,  253 
proof,  954 
spiral 

Archimedean,  737 
spring  constant,  114 
Squid,  778 
stability,  86 
standing  wave,  378 
standing  waves,  378 
Star  Trek,  869 
states 

bound, 867 
static  friction,  152 
coefficient  of,  153 
statvold,  955 
steady  state,  176 
strong  nuclear  force,  495 
strong  nuclear  force,  495 
superposition 
of  waves,  344 
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tunneling,  875 
twin  paradox,  411 


superposition  of  fields,  567 
Swift,  Jonathan,  36 
symmetry,  665 
system 

closed,  58 

Taylor,  G.I.,  853 
telescope,  761,  800 
temperature,  74,  300 

absolute  zero,  75,  306 
Celsius,  306 
compared  to  heat,  74 
Kelvin,  306 

macroscopic  definition,  306 
microscopic  definition,  322 
tension,  159,  167 
Tesla 

Nikola,  178 
tesla  (unit),  658 
thermal  energy 

compared  to  heat,  75 
thermodynamics,  299 
first  law  of,  300,  331 
laws  of 

summarized,  331 
second  law  of,  317,  329,  331 
third  law  of,  331 
zeroth  law  of,  305,  331 
thermometer,  306 
Thomson,  J.J. 

cathode  ray  experiments,  475 
time 

arrow  of,  329 
time  constant 
RC,  606 
time  dilation 

gravitational,  433 
time  reversal,  752 
timelike,  406 
torque 

defined,  254 

related  to  force,  255,  284 
total  internal  reflection,  784 
transformer,  603,  693 
transmission 

of  waves,  364 

transmission  of  forces,  158 
triangle  inequality,  411 


ultraviolet  light,  844 
uncertainty  principle,  870 
in  three  dimensions,  890 
unitary  evolution  of  the  wavefunction,  913 
units 

nonmetric,  986 
units,  conversion  of,  29 
unstable  equilibrium,  87 

vector 

addition,  194 
defined,  193 

division  by  a  scalar,  194 
dot  product,  212 
four- vector,  411 
magnitude  of,  194 
multiplication  by  a  scalar,  194 
subtraction,  194 
vector  addition 
analytic,  199 
graphical,  199 
vector  cross  product,  280 
vector  product,  cross,  280 
velocity 

addition  of 

relativistic,  414,  447,  978 
vector,  201 
group,  866 
phase,  866 
velocity  filter,  662 
vision,  742 
volt  (unit) 

defined,  521 
voltage 

defined,  522 

related  to  electric  field,  574 
volume 

operational  definition,  35 
scaling  of,  36 
Voyager  space  probe,  444 

water 

specific  heat,  74 
wave 

definition  of,  852 
dispersive,  787,  864 
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electromagnetic,  703 

energy  related  to  amplitude,  367 

light,  703 

wave  model  of  light,  747,  794 
wave-particle  duality,  852 

pilot-wave  interpretation  of,  854 
probability  interpretation  of,  854 
wavefunction 

complex  numbers  in,  881 
of  the  electron,  863 
wavelength,  356 
wavenumber,  925 
waves 

absorption  of,  364 
dispersive,  357 
frequency  of,  355 
gravitational,  568 
interference,  372 
light,  354 

medium  not  transported  with,  346 
on  a  string,  350 
patterns,  348 
period  of,  355 
reflection  of,  364 
sound,  353 
standing,  378 
superposition  of,  344 
transmission  of,  364 
velocity  of,  347 
wavelength,  356 
weak  nuclear  force,  666 
weak  nuclear  force,  498 
whale  songs,  366 
Wicked  Witch  of  the  West,  859 
Wigner,  Eugene,  766 
work 

defined,  162 
work  theorem,  164 
world-line,  411 

x-rays,  16 

Yarkovsky  effect,  189 
Young,  Thomas,  794 
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